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COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY OF 

OVARIAN CANCER 

5 RELATED APPLICATIONS 

The present application claims priority to U.S. provisional patent application 
serial no. 60/191,031 filed on March 21, 2000, U.S. provisional patent application serial 
no. 60/207,124, filed on May 25, 2000, U.S. provisional patent application serial no. 
60/211,940, filed on June 15, 2000, U.S. provisional patent application serial no. 
10 60/216,820, filed on July 7, 2000, U.S. provisional patent application serial no. 

60/220,661, filed on July 25, 2000, and U.S. provisional patent application serial no. 
60/257,672, filed on December 21, 2000, all of which are expressly incorporated by 
reference. 

1 5 FIELD OF THE INVENTION 

The field of the invention is ovarian cancer, including diagnosis, 
characterization, management, and therapy of ovarian cancer. 

BACKGROUND OF THE INVENTION 
20 Ovarian cancer is responsible for significant morbidity and mortality in 

populations around the world. Ovarian cancer is classified, on the basis of clinical and 
pathological features, in three groups, namely epithelial ovarian cancer (EOC; >90% of 
ovarian cancer in Western countries), germ cell tumors {circa 2-3% of ovarian cancer), 
and stromal ovarian cancer {circa 5% of ovarian cancer; Ozols et al, 1997, Cancer 
25 Principles and Practice of Oncology, 5th ed., DeVita et al, Eds. pp. 1502). Relative to 
EOC, germ cell tumors and stromal ovarian cancers are more easily detected and treated 
at an early stage, translating into higher/better survival rates for patients afflicted with 
these two types of ovarian cancer. 

There are numerous types of ovarian tumors, some of which are benign, and 
3 0 others of which are malignant. . Treatment (including non-treatment) options and 

predictions of patient outcome depend on accurate classification of the ovarian cancer. 
Ovarian cancers are named according to the type of cells from which the cancer is 
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derived and whether the ovarian cancer is benign or malignant. Recognized histological 
tumor types include, for example, serous, mucinous, endometrioid, and clear cell 
tumors. In addition, ovarian cancers are classified according to recognized grade and 
•stage scales. 

5 In grade I, the tumor tissue is well differentiated from normal ovarian tissue. In 

grade II, tumor tissue is moderately well differentiated. In grade III, the tumor tissue is 
poorly differentiated from normal tissue, and this grade correlates with a less favorable 
prognosis than grades I and II. Stage I is generally confined within the capsule 
surrounding one (stage I A) or both (stage IB) ovaries, although in some stage I (i.e. 

1 0 stage IC) cancers, malignant cells may be detected in ascites, in peritoneal rinse fluid, or 
on the surface of the ovaries. Stage II involves extension or metastasis of the tumor 
from one or both ovaries to other pelvic structures. In stage IIA, the tumor extends or 
has metastasized to the uterus, the fallopian tubes, or both. Stage IIB involves extension 
of the tumor to the pelvis. Stage IIC is stage IIA or IIB in which malignant cells may be 

1 5 detected in ascites, in peritoneal rinse fluid, or on the surface of the ovaries. In stage III, 
the tumor comprises at least one malignant extension to the small bowel or the 
omentum, has formed extrapelvic peritoneal implants of microscopic (stage IIIA) or 
macroscopic (< 2 centimeter diameter, stage IIIB; > 2 centimeter diameter, stage IIIC) 
size, or has metastasized to a retroperitoneal or inguinal lymph node (an alternate 

20 indicator of stage IIIC). In stage IV, distant (i.e. non-peritoneal) metastases of the tumor 
can be detected. 

The durations of the various stages of ovarian cancer are not presently known, 
but are believed to be at least about a year each (Richart et al, 1 969, Am. J. Obstet 
Gynecol 105:386). Prognosis declines with increasing stage designation. For example, 

25 5-year survival rates for patients diagnosed with stage I, II, III, and IV ovarian cancer 
are 80%, 57%, 25%, and 8%, respectively. 

Despite being the third most prevalent gynecological cancer, ovarian cancer is 
the leading cause of death among those afflicted with gynecological cancers. The 
disproportionate mortality of ovarian cancer is attributable to a substantial absence of 

30 symptoms among those afflicted with early-stage ovarian cancer and to difficulty 

diagnosing ovarian cancer at an early stage. Patients afflicted with ovarian cancer most 
often present with non-specific complaints, such as abnormal vaginal bleeding, 
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gastrointestinal symptoms, urinary tract symptoms, lower abdominal pain, and 
generalized abdominal distension. These patients rarely present with paraneoplastic 
symptoms or with symptoms which clearly indicate their affliction. Presently, less than 
about 40% of patients afflicted with ovarian cancer present with stage I or stage II. 
5 Management of ovarian cancer would be significantly enhanced if the disease could be 
detected at an earlier stage, when treatments are much more generally efficacious. 

Ovarian cancer may be diagnosed, in part, by collecting a routine medical history 
from a patient and by performing physical examination, x-ray examination, and 
chemical and hematological studies on the patient. Hematological tests which may be 

10 indicative of ovarian cancer in a patient include analyses of serum levels of proteins 
designated CA125 and DF3 and plasma levels of lysophosphatidic acid (LP A). 
Palpation of the ovaries and ultrasound techniques (particularly including endovaginal 
ultrasound and color Doppler flow ultrasound techniques) can aid detection of ovarian 
tumors and differentiation of ovarian cancer from benign ovarian cysts. However, a 

1 5 definitive diagnosis of ovarian cancer typically requires performing exploratory 
laparotomy of the patient. 

Potential tests for the detection of ovarian cancer (e.g., screening, reflex or 
monitoring) may be characterized by a number of factors. The "sensitivity" of an assay 
refers to the probability that the test will yield a positive result in an individual afflicted 

20 with ovarian cancer. The "specificity" of an assay refers to the probability that the test 
will yield a negative result in an individual not afflicted with ovarian cancer. The 
"positive predictive value" (PPV) of an assay is the ratio of true positive results (i.e. 
positive assay results for patients afflicted with ovarian cancer) to all positive results 
(i.e. positive assay results for patients afflicted with ovarian cancer + positive assay 

25 results for patients not afflicted with ovarian cancer). It has been estimated that in order 
for an assay to be an appropriate population-wide screening tool for ovarian cancer the 
assay must have a PPV of at least about 10% (Rosenthal et al, 1998, Sent. Oncol 
25:315-325). It would thus be desirable for a screening assay for detecting ovarian 
cancer in patients to have a high sensitivity and a high PPV. Monitoring and reflex tests 

30 would also require appropriate specifications. 
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Owing to the cost, limited sensitivity, and limited specificity of known methods 
of detecting ovarian cancer, screening is not presently performed for the general 
population. In addition, the need to perform laparotomy in order to diagnose ovarian 
cancer in patients who screen positive for indications of ovarian cancer limits the 
5 desirability of population-wide screening, such that a PPV even greater than 1 0% would 
be desirable. 

Prior use of serum CA125 level as a diagnostic marker for ovarian cancer 
indicated that this method exhibited insufficient specificity for use as a general 
screening method. Use of a refined algorithm for interpreting CA125 levels in serial 

1 0 retrospective samples obtained from patients improved the specificity of the method 
without shifting detection of ovarian cancer to an earlier stage (Skakes, 1995, Cancer 
76:2004). Screening for LPA to detect gynecological cancers including ovarian cancer 
exhibited a sensitivity of about 96% and a specificity of about 89%. However, CA125- 
based screening methods and LPA-based screening methods are hampered by the 

1 5 presence of CA125 and LPA, respectively, in the serum of patients afflicted with 

conditions other than ovarian cancer. For example, serum CA125 levels are known to 
be associated with menstruation, pregnancy, gastrointestinal and hepatic conditions such 
as colitis and cirrhosis, pericarditis, renal disease, and various non-ovarian malignancies. 
Serum LPA is known, for example, to be affected by the presence of non-ovarian 

20 gynecological malignancies. A screening method having a greater specificity for 

ovarian cancer than the current screening methods for CA125 and LPA could provide a 
population-wide screening for early stage ovarian cancer. 

Presently greater than about 60% of ovarian cancers diagnosed in patients are 
stage III or stage IV cancers. Treatment at these stages is largely limited to 

25 cytoreductive surgery (when feasible) and chemotherapy, both of which aim to slow the 
spread and development of metastasized tumor. Substantially all late stage ovarian 
cancer patients currently undergo combination chemotherapy as primary treatment, 
usually a combination of a platinum compound and a taxane. Median survival for 
responding patients is about one year. Combination chemotherapy involving agents 

30 such as doxorubicin, cyclophosphamide, cisplatin, hexamethylmelamine, paclitaxel, and 
methotrexate may improve survival rates in these groups, relative to single-agent 
therapies. Various recently-developed chemotherapeutic agents and treatment regimens 
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have also demonstrated usefulness for treatment of advanced ovarian cancer. For 
example, use of the topoisomerase I inhibitor topectan, use of amifostine to minimize 
chemotherapeutic side effects, and use of intraperitoneal chemotherapy for patients 
having peritoneally implanted tumors have demonstrated at least limited utility. 
5 Presently, however, the 5-year survival rate for patients afflicted with stage III ovarian 
cancer is 25%, and the survival rate for patients afflicted with stage IV ovarian cancer is 
8%. 

In summary, the earlier ovarian cancer is detected, the aggressiveness of 
therapeutic intervention and the side effects associated with therapeutic intervention are 
10 minimized. More importantly, the earlier the cancer is detected, the survival rate and 
quality of life of ovarian cancer patients is enhanced. Thus, a pressing need exists for 
methods of detecting ovarian cancer as early as possible. There also exists a need for 
methods of detecting recurrence of ovarian cancer as well as methods for predicting and 
monitoring the efficacy of treatment. The present invention satisfies these needs. 

15 

SUMMARY OF THE INVENTION 
The invention relates to novel genes associated with ovarian cancer as well as 
methods of assessing whether a patient is afflicted with ovarian cancer. This method 
comprises the step of comparing the level of expression of a marker in a patient . sample, 

20 wherein the marker is listed in Tables 1 -2, and the normal level of expression of the 
marker in a control, e.g., a sample from a patient without ovarian cancer. A significant 
difference between the level of expression of the marker in the patient sample and the 
normal level is an indication that the patient is afflicted with ovarian cancer. Preferably, 
a protein corresponding to the marker is a secreted protein. Alternatively, the marker 

25 can correspond to a protein having an extracellular portion, to one which is normally 
expressed in ovarian tissue at a detectable level, or both. 

In one method, the marker(s) are preferably selected such that the positive 
predictive value of the method is at least about 10%. Also preferred are embodiments of 
the method wherein the marker is over- or under-expressed by at least two-fold in at 

30 least about 20% of stage I ovarian cancer patients, stage II ovarian cancer patients, stage 
III ovarian cancer patients, stage IV ovarian cancer patients, grade I ovarian cancer 
patients, grade II ovarian cancer patients, grade III ovarian cancer patients, epithelial 
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ovarian cancer patients, stromal ovarian cancer patients, germ cell ovarian cancer 
patients, malignant ovarian cancer patients, benign ovarian patients, serous neoplasm 
ovarian cancer patients, mucinous neoplasm ovarian cancer patients, endometrioid 
neoplasm ovarian cancer patients and/or clear cell neoplasm ovarian cancer patients. 
5 In one embodiment of the methods of the present invention, the patient sample is 

an ovary-associated body fluid. Such fluids include, for example, blood fluids, lymph, 
ascitic fluids, gynecological fluids, cystic fluids, urine, and fluids collected by peritoneal 
rinsing. In another embodiment, the sample comprises cells obtained from the patient. 
In this embodiment, the cells may be found in a fluid selected from the group consisting 

10 of a fluid collected by peritoneal rinsing, a fluid collected by uterine rinsing, a uterine 
fluid, a uterine exudate, a pleural fluid, and an ovarian exudate. In another embodiment, 
the patient sample is in vivo. 

In accordance with the methods of the present invention, the level of expression 
of the marker in a sample can be assessed, for example, by detecting the presence in the 

15 sample of: 

• a protein corresponding to the marker or fragment of the protein (e.g. 
using a reagent, such as an antibody, an antibody derivative, or an antibody 
fragment, which binds specifically with the protein) 

• a transcribed polynucleotide (e.g. an mRNA or a cDNA), or fragment 
20 thereof, having at least a portion with which the marker is substantially 

homologous (e.g. by contacting a mixture of transcribed polynucleotides 
obtained from the sample with a substrate having one or more of the markers 
listed in Tables 1-2 fixed thereto at selected positions) 

• a transcribed polynucleotide or fragment thereof, wherein the 

25 polynucleotide anneals with the marker under stringent hybridization 

conditions. 

• a metabolite which is produced directly (i.e., catalyzed) or indirectly by a 
protein corresponding to the marker 

The methods of the present invention are particularly useful for patients with an 
30 identified pelvic mass or symptoms associated with ovarian cancer. The methods of the 
present invention can also be of particular use with patients having an enhanced risk of 
developing ovarian cancer (e.g., patients having a familial history of ovarian cancer, 
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patients identified as having a mutant oncogene, and patients at least about 50 years of 
age). The methods of the present invention may further be of particular use in 
monitoring the efficacy of treatment of an ovarian cancer patient (e.g. the efficacy of 
chemotherapy). 

5 The methods of the present invention may be performed using a plurality (e.g. 2, 

3, 5, or 10 or more) of markers. According to a method involving a plurality of markers, 
the level of expression in the sample of each of a plurality of markers independently 
selected from the markers listed in Tables 1-2 is compared with the normal level of 
expression of each of the plurality of markers in samples of the same type obtained from 
10 control humans not afflicted with ovarian cancer. The markers of Tables 1-2 may also 
be used in combination with known ovarian cancer markers in the methods of the 
present invention. 

In a preferred method of assessing whether a patient is afflicted with ovarian 
cancer (e.g., new detection ("screening"), detection of recurrence, reflex testing), the 
1 5 method comprises comparing: 

a) the level of expression of a marker in a patient sample, wherein at least 
one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control non-ovarian 
cancer sample. 

20 A significant difference between the level of expression of the marker in the patient 
sample and the normal level is an indication that the patient is afflicted with ovarian 
cancer. 

The methods of the present invention further include a method of assessing the 
efficacy of a test compound for inhibiting ovarian cancer in a patient. This method 
25 comprises comparing: 

a) expression of a marker in a first sample obtained from the patient and 
maintained in the presence of the test compound, wherein the marker is 
selected from the group consisting of the markers listed in Tables 1-2, 
and 

30 b) expression of the marker in a second sample obtained from the patient 

and maintained in the absence of the test compound. 
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A significant difference between the level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting ovarian cancer in the patient. For example, the first and 
second samples can be portions of a single sample obtained from the patient or portions 
5 of pooled samples obtained from the patient. 

The invention further relates to a method of assessing the efficacy of a therapy 
for inhibiting ovarian cancer in a patient This method comprises comparing: 

a) expression of a marker in a first sample obtained from the patient prior 
to providing at least a portion of the therapy to the patient, wherein the 

10 marker is selected from the group consisting of the markers listed in 

Tables 1-2, and 

b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy. 

A significant difference between the level of expression of the marker in the second 
15 sample, relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting ovarian cancer in the patient. 

It will be appreciated that in these methods the "therapy" may be any traditional 
therapy for treating ovarian cancer including, but not limited to, chemotherapy, radiation 
therapy and surgical removal of tissue, e.g., an ovarian tumor. Thus, the methods of the 
20 invention may be used to evaluate a patient before, during and after thereapy, for 
example, to evaluate the reduction in tumor burden. 

The present invention therefore further comprises a method for monitoring the 
progression of ovarian cancer in a patient, the method comprising: 

a) detecting in a patient sample at a first time point, the expression of a marker, 
25 wherein the marker is selected from the group consisting of the markers listed in Tables 

1-2; 

b) repeating step a) at a subsequent time point in time; and 

c) comparing the level of expression detected in steps a) and b), and therefrom 
monitoring the progression of ovarian cancer in the patient. 

30 The invention also includes a method of selecting a composition for inhibiting 

ovarian cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 
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b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) selecting one of the test compositions which alters the level of 

expression of the marker in the aliquot containing that test composition, 
relative to other test compositions. 
In addition, the invention includes a method of inhibiting ovarian cancer in a 
patient- This method comprises the steps of: 
0 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) administering to the patient at least one of the test compositions which 

alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 
The invention also includes a kit for assessing whether a patient is afflicted with 
ovarian cancer. This kit comprises reagents for assessing expression of a marker listed 
0 in Tables 1-2. 

In another aspect, the invention relates to a kit for assessing the suitability of 
each of a plurality of compounds for inhibiting an ovarian cancer in a patient. The kit 
comprises a reagent for assessing expression of a marker listed in Tables 1-2, and may 
also comprise a plurality of compounds. 

5 In another aspect, the invention relates to a kit for assessing the presence of 

ovarian cancer cells. This kit comprises an antibody, wherein the antibody binds 
specifically with a protein corresponding to a marker listed in Tables 1-2. The kit may 
also comprise a plurality of antibodies, wherein the plurality binds specifically with a 
protein corresponding to a different marker listed in Tables 1-2. 

3 The invention also includes a kit for assessing the presence of ovarian cancer 

cells, wherein the kit comprises a nucleic acid probe. The probe binds specifically with 
a transcribed polynucleotide corresponding to a marker listed in Tables 1-2. The kit 
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may also comprise a plurality of probes, wherein each of the probes binds specifically 
with a transcribed polynucleotide corresponding to a different marker listed in Tables 1- 
2. 

The invention further relates to a method of making an isolated hybridoma which 
5 produces an antibody useful for assessing whether a patient is afflicted with ovarian 
cancer. The method comprises isolating a protein corresponding to a marker listed in 
Tables 1-2, immunizing a mammal using the isolated protein, isolating splenocytes from 
the immunized mammal, fusing the isolated splenocytes with an immortalized cell line 
to form hybridomas, and screening individual hybridomas for production of an antibody 
1 0 which specifically binds with the protein to isolate the hybridoma. The invention also 
includes an antibody produced by this method. 

The invention further includes a method of assessing the ovarian carcinogenic 
potential of a test compound. This method comprises the steps of: 

a) maintaining separate aliquots of ovarian cells in the presence and 
1 5 absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots. 
The marker is selected from those listed in Tables 1-2. A significantly altered 
level of expression of the marker in the aliquot maintained in the presence of (or 
exposed to) the test compound, relative to the aliquot maintained in the absence of the 
20 test compound, is an indication that the test compound possesses ovarian carcinogenic 
potential. 

Additionally, the invention includes a kit for assessing the ovarian carcinogenic 
potential of a test compound. The kit comprises ovarian cells and a reagent for 
assessing expression of a marker in each of the aliquots. The marker is selected from 

25 those listed in Tables 1-2. 

The invention further relates to a method of treating a patient afflicted with 
ovarian cancer or at risk of developing ovarian cancer. This method comprises 
enhancing expression of a marker listed in Tables 1-2 or providing to cells of the patient 
a protein corresponding to a marker listed in Tables 1-2, wherein the marker is 

30 underexpressed in patients afflicted with ovarian cancer. The protein can be provided to 
the cells, for example, by providing a vector comprising a polynucleotide encoding the 
protein to the cells. 
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The invention includes another method of treating a patient afflicted with ovarian 
cancer or at risk of developing ovarian cancer. This method comprises inhibiting 
expression or overexpression of a marker listed in Tables 1-2 by, e.g., providing to cells 
of the patient an antisense oligonucleotide complementary to a polynucleotide 
5 corresponding to a marker listed in Tables 1-2, wherein the marker is overexpressed in 
patients afflicted with ovarian cancer. 

It will be appreciated that the methods and kits of the present invention may also 
include known cancer markers including known ovarian cancer markers. It will further 
be appreciated that the methods and kits may be used to identify cancers other than 
10 ovarian cancer. 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to newly discovered genes associated with the cancerous 
state of ovarian cells. It has been discovered that the level of expression of individual 
15 genes, also referred to as markers, and combinations of these genes, correlates with the 
presence of ovarian cancer in a patient. Methods are provided for detecting the presence 
of ovarian cancer in a sample, the absence of ovarian cancer in a sample, the stage of an 
ovarian cancer, and with other characteristics of ovarian cancer that are relevant to 
prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient. 

20 

Definitions 

As used herein, each of the following terms has the meaning associated with it in 
this section. 

The articles M a" and "an" are used herein to refer to one or to more than one (i.e. 
25 to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a naturally-occurring polymer corresponding to at least one of the 
novel nucleic acids listed in Tables 1-2. For example, markers include, without 
limitation, sense and anti-sense strands of genomic DNA (i.e. including any introns 
30 occurring therein), RNA generated by transcription of genomic DNA (i.e. prior to 
splicing), RNA generated by splicing of RNA transcribed from genomic DNA, and 
proteins generated by translation of spliced RNA (i.e. including proteins both before and 
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after cleavage of normally cleaved regions such as transmembrane signal sequences). 
As used herein, "marker" may also include a cDNA made by reverse transcription of an 
RNA generated by transcription of genomic DNA (including spliced RNA). 

The term "probe" refers to any molecule which is capable of selectively binding 
5 to a specifically intended target molecule, for example a marker of the invention. 
Probes can be either synthesized by one skilled in the art, or derived from appropriate 
biological preparations. For purposes of detection of the target molecule, probes may be 
specifically designed to be labeled, as described herein. Examples of molecules that can 
be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, 
10 and organic monomers. 

An "ovary-associated" body fluid is a fluid which, when in the body of a patient, 
contacts or passes through ovarian cells or into which cells or proteins shed from 
ovarian cells e.g. ovarian epithelium, are capable of passing. Exemplary ovary- 
associated body fluids include blood fluids, lymph, ascites, gynecological fluids, cystic 
1 5 fluid, urine, and fluids collected by peritoneal rinsing. 

The "normal" level of expression of a marker is the level of expression of the 
marker in ovarian cells of a patient, e.g. a human, not afflicted with ovarian cancer. 

"Over-expression" and "under-expression" of a marker refer to expression of the 
marker of a patient at a greater or lesser level, respectively, than normal level of 
20 expression of the marker (e.g. at least two-fold greater or lesser level). 

As used herein, the term "promoter/regulatory sequence" means a nucleic acid 
sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
promoter sequence and in other instances, this sequence may also include an enhancer 
25 sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
30 product to be produced in a living human cell under most or all physiological conditions 
of the cell. 
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An "inducible" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
5 A "tissue-specific" promoter is a nucleotide sequence which, when operably 

linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only if the cell is a cell of the 
tissue type corresponding to the promoter. 

A "transcribed polynucleotide" is a polynucleotide (e.g. an RNA, a cDNA, or an 

1 0 analog of one of an RNA or cDNA) which is complementary to or homologous with all 
or a portion of a mature RNA made by transcription of a genomic DNA corresponding 
to a marker of the invention and normal post-transcriptional processing (e.g. splicing), if 
any, of the transcript. 

"Complementary" refers to the broad concept of sequence complementarity 

1 5 between regions of two nucleic acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a first nucleic acid region is capable 
of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic u 
acid region which is antiparallel to the first region if the residue is thymine or uracil. 
Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of - 

20 base pairing with a residue of a second nucleic acid strand which is antiparallel to the 
first strand if the residue is guanine. A first region of a nucleic acid is complementary to 
a second region of the same or a different nucleic acid if, when the two regions are 
arranged in an antiparallel fashion, at least one nucleotide residue of the first region is 
capable of base pairing with a residue of the second region. Preferably, the first region 

25 comprises a first portion and the second region comprises a second portion, whereby, 
when the first and second portions are arranged in an antiparallel fashion, at least about 
50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the 
nucleotide residues of the first portion are capable of base pairing with nucleotide 
residues in the second portion. More preferably, all nucleotide residues of the first 

30 portion are capable of base pairing with nucleotide residues in the second portion. 
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"Homologous" as used herein, refers to nucleotide sequence similarity between 
two regions of the same nucleic acid strand or between regions of two different nucleic 
acid strands. When a nucleotide residue position in both regions is occupied by the 
same nucleotide residue, then the regions are homologous at that position. A first region 
5 is homologous to a second region if at least one nucleotide residue position of each 
region is occupied by the same residue. Homology between two regions is expressed in 
terms of the proportion of nucleotide residue positions of the two regions that are 
occupied by the same nucleotide residue. By way of example, a region having the 
nucleotide sequence 5 r -ATTGCC-3' and a region having the nucleotide sequence 5'- 

10 TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 
and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 

15 the same nucleotide residue. 

A marker is "fixed" to a substrate if it is covalently or non-covalently associated 
with the substrate such the substrate can be rinsed with a fluid (e.g. standard saline 
citrate, pH 7.4) without a substantial fraction of the marker dissociating from the 
substrate. 

20 As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA 

or DNA molecule having a nucleotide sequence that occurs in nature (e.g. encodes a 
natural protein). 

Expression of a marker in a patient is "significantly" higher or lower than the 
normal level of expression of a marker if the level of expression of the marker is greater 

25 or less, respectively, than the normal level by an amount greater than the standard error 
of the assay employed to assess expression, and preferably at least twice, and more 
preferably three, four, five or ten times that amount. Alternately, expression of the 
marker in the patient can be considered "significantly" higher or lower than the normal 
level of expression if the level of expression is at least about two, and preferably at least 

30 about three, four, or five times, higher or lower, respectively, than the normal level of 
expression of the marker. 
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Ovarian cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, ovarian cancer is also "inhibited" if 
recurrence or metastasis of the cancer is reduced, slowed, delayed, or prevented. 

A kit is any manufacture (e.g. a package or container) comprising at least one 
5 reagent, e.g. a probe, for specifically detecting a marker of the invention, the 

manufacture being promoted, distributed, or sold as a unit for performing the methods of 
the present invention. 

Description 

1 0 The present invention is based, in part, on identification of novel markers which 

are over-expressed in ovarian cancer cells as compared to their expression in normal (i.e. 
non- cancerous) ovarian cells. The markers of the invention correspond to DNA, RNA, 
and polypeptide molecules which can be detected in one or both of normal and 
cancerous ovarian cells. The enhanced expression of one or more of these markers in 

1 5 ovarian cells is herein correlated with the cancerous state of the tissue. The invention 
thus includes compositions, kits, and methods for assessing the cancerous state of 
ovarian cells (e.g. cells obtained from a human, cultured human cells, archived or 
preserved human cells and in vivo cells). 

The compositions, kits, and methods of the invention have the following uses, 

20 among others: 

1 ) assessing whether a patient is afflicted with ovarian cancer; 

2) assessing the stage of ovarian cancer in a human patient; 

3) assessing the grade of ovarian cancer in a patient; 

4) assessing the benign or malignant nature of ovarian cancer in a patient; 
25 5) assessing the histological type of neoplasm (e.g. serous, mucinous, 

endometroid, or clear cell neoplasm) associated with ovarian 
cancer in a patient; 

6) making an isolated hybridoma which produces an antibody useful for 
assessing whether a patient is afflicted with ovarian cancer; 
30 7) assessing the presence of ovarian cancer cells; 

8) assessing the efficacy of one or more test compounds for inhibiting 
ovarian cancer in a patient; 
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assessing the efficacy of a therapy for inhibiting ovarian cancer in a 
patient; 

monitoring the progression of ovarian cancer in a patient; 
selecting a composition or therapy for inhibiting ovarian cancer in a 
patient; 

treating a patient afflicted with ovarian cancer; 
inhibiting ovarian cancer in a patient; 

14) assessing the ovarian carcinogenic potential of a test compound; 
and 

15) inhibiting an ovarian cancer in a patient at risk for developing 
ovarian cancer. 

The invention thus includes a method of assessing whether a patient is afflicted 
with ovarian cancer. This method comprises comparing the level of expression of a 

1 5 marker in a patient sample and the normal level of expression of the marker in a control, 
e.g., a non-ovarian cancer sample. A significant difference between the level of 
expression of the marker in the patient sample and the normal level is an indication that 
the patient is afflicted with ovarian cancer. The marker is selected from the group 
consisting of the markers listed in Tables 1-2. 

20 The polynucleotides set forth in Tables 1-2 represent previously unidentified 

nucleotide sequences. These nucleotide sequences were identified through subtracted 
library experiments described herein. Also provided by this invention are 
polynucleotides that correspond to the polynucleotides of Tables 1-2. In one 
embodiment, these polynucleotides are obtained by identification of a larger fragment or 

25 full-length coding sequence of these polynucleotides. Gene delivery vehicles, host cells, 
compositions and databases (all described herein) containing these polynucleotides are 
also provided by this invention. 

Any marker or combination of markers listed in Tables 1-2, as well as any 
known markers in combination with the markers set forth in Tables 1 -2, may be used in 

30 the compositions, kits, and methods of the present invention. In general, it is preferable 
to use markers for which the difference between the level of expression of the marker in 
ovarian cancer cells and the level of expression of the same marker in normal ovarian 
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cells is as great as possible. Although this difference can be as small as the limit of 
detection of the method for assessing expression of the marker, it is preferred that the 
difference be at least greater than the standard error of the assessment method, and 
preferably a difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 
5 500-, 1000-fold or greater. 

It is recognized that certain markers correspond to proteins which are secreted 
from ovarian cells {le. one or both of normal and cancerous cells) to the extracellular 
space surrounding the cells. These markers are preferably used in certain embodiments 
of the compositions, kits, and methods of the invention, owing to the fact that the protein 

10 corresponding to each of these markers can be detected in an ovary-associated body 
fluid sample, which may be more easily collected from a human patient than a tissue 
biopsy sample. In addition, preferred in vivo techniques for detection of a protein 
corresponding to a marker of the invention include introducing into a subject a labeled 
antibody directed against the protein. For example, the antibody can be labeled with a 

15 radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

It is a simple matter for the skilled artisan to determine whether any particular 
marker corresponds to a secreted protein. In order to make this determination, the 
protein corresponding to a marker is expressed in a test cell {e.g. a cell of an ovarian cell 

20 line), extracellular fluid is collected, and the presence or absence of the protein in the 
extracellular fluid is assessed {e.g. using a labeled antibody which binds specifically 
with the protein). 

The following is an example of a method which can be used to detect secretion 
of a protein corresponding to a marker of the invention. About 8 x 10 5 293T cells are 

25 incubated at 37°C in wells containing growth medium (Dulbecco's modified Eagle's 
medium {DMEM} supplemented with 10% fetal bovine serum) under a 5% (v/v) C0 2 , 
95% air atmosphere to about 60-70% confluence. The cells are then transfected using a 
standard transfection mixture comprising 2 micrograms of DNA comprising an 
expression vector encoding the protein and 10 microliters of LipofectAMINE™ 

30 (GIBCO/BRL Catalog no. 1 8342-012) per well. The transfection mixture is maintained 
for about 5 hours, and then replaced with fresh growth medium and maintained in an air 
atmosphere. Each well is gently rinsed twice with DMEM which does not contain 
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methionine or cysteine (DMEM-MC; ICN Catalog no. 16-424- 54). About 1 milliliter 
of DMEM-MC and about 50 microcuries of Trans- 35 S™ reagent (ICN Catalog no. 
51006) are added to each well. The wells are maintained under the 5% C0 2 atmosphere 
described above and incubated at 37°C for a selected period. Following incubation, 150 
5 microliters of conditioned medium is removed and centrifuged to remove floating cells 
and debris. The presence of the protein in the supernatant is an indication that the 
protein is secreted. 

Examples of ovary-associated body fluids include blood fluids (e.g. whole blood, 
blood serum, blood having platelets removed therefrom, etc.), lymph, ascitic fluids, 

10 gynecological fluids (e.g. ovarian, fallopian, and uterine secretions, menses, vaginal 
douching fluids, fluids used to rinse cervical cell samples, etc.), cystic fluid, urine, and 
fluids collected by peritoneal rinsing (e.g. fluids applied and collected during 
laparoscopy or fluids instilled into and withdrawn from the peritoneal cavity of a human 
patient). In these embodiments, the level of expression of the marker can be assessed by 

1 5 assessing the amount (e.g. absolute amount or concentration) of the marker in an ovary- 
associated body fluid obtained from a patient. The fluid can, of course, be subjected to a 
variety of well-known post-collection preparative and storage techniques (e.g. storage, 
freezing, ultrafiltration, concentration, evaporation, centrifugation, etc.) prior to 
assessing the amount of the marker in the fluid. 

20 Many ovary-associated body fluids (i.e. usually excluding urine) can have 

ovarian cells, e.g. ovarian epithelium, therein, particularly when the ovarian cells are 
cancerous, and, more particularly, when the ovarian cancer is metastasizing. Cell- 
containing fluids which can contain ovarian cancer cells include, but are not limited to, 
peritoneal ascites, fluids collected by peritoneal rinsing, fluids collected by uterine 

25 rinsing, uterine fluids such as uterine exudate and menses, pleural fluid, and ovarian 
exudates. Thus, the compositions, kits, and methods of the invention can be used to 
detect expression of markers corresponding to proteins having at least one portion which 
is displayed on the surface of cells which express it. Examples of such proteins are 
indicated in the Tables herein. Although not every protein having at least one cell- 

30 surface portion is indicated in the Tables, it is a simple matter for the skilled artisan to 
determine whether the protein corresponding to any particular marker comprises a cell- 
surface protein. For example, immunological methods may be used to detect such 
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proteins on whole cells, or well known computer-based sequence analysis methods {e.g. 
the SIGNALP program; Nielsen et al, 1997, Protein Engineering 10:1-6) may be used 
to predict the presence of at least one extracellular domain (i.e. including both secreted 
proteins and proteins having at least one cell-surface domain). Expression of a marker 
5 corresponding to a protein having at least one portion which is displayed on the surface 
of a cell which expresses it may be detected without necessarily lysing the cell (e.g. 
using a labeled antibody which binds specifically with a cell-surface domain of the 
protein). 

Expression of a marker of the invention may be assessed by any of a wide 

0 variety of well known methods for detecting expression of a transcribed molecule or 
protein. Non-limiting examples of such methods include immunological methods for 
detection of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification 
methods, protein function or activity assays, nucleic acid hybridization methods, nucleic 
acid reverse transcription methods, and nucleic acid amplification methods. 

5 In a preferred embodiment, expression of a marker is assessed using an antibody 

(e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme-labeled 
antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or with 
the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an antibody 
fragment (e.g. a single-chain antibody, an isolated antibody hypervariable domain, etc.) 

0 which binds specifically with a protein corresponding to the marker, such as the protein 
encoded by the open reading frame corresponding to the marker or such a protein which 
has undergone all or a portion of its normal ppst-translational modification. 

In another preferred embodiment, expression of a marker is assessed by 
preparing mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient 

5 sample, and by hybridizing the mRNA/cDNA with a reference polynucleotide which is a 
complement of a polynucleotide comprising the marker, and fragments thereof. cDNA 
can, optionally, be amplified using any of a variety of polymerase chain reaction 
methods prior to hybridization with the reference polynucleotide; preferably, it is not 
amplified. Expression of one or more markers can likewise be detected using 

3 quantitative PCR to assess the level of expression of the marker(s). Alternatively, any 
of the many known methods of detecting mutations or variants (e.g. single nucleotide 
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polymorphisms, deletions, etc) of a marker of the invention may be used to detect 
occurrence of a marker in a patient. 

In a related embodiment, a mixture of transcribed polynucleotides obtained from 
the sample is contacted with a substrate having fixed thereto a polynucleotide 
5 complementary to or homologous with at least a portion (e.g. at least 7, 10, 15, 20, 25, 
30, 40, 50, 100, 500, or more nucleotide residues) of a marker of the invention. If 
polynucleotides complementary to or homologous with are differentially detectable on 
the substrate (e.g. detectable using different chromophores or fluorophores, or fixed to 
different selected positions), then the levels of expression of a plurality of markers can 

10 be assessed simultaneously using a single substrate (e.g. a "gene chip" microarray of 
polynucleotides fixed at selected positions). When a method of assessing marker 
expression is used which involves hybridization of one nucleic acid with another, it is 
preferred that the hybridization be perfonned under stringent hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on detection of 

1 5 a difference in expression levels of one or more markers of the invention, it is preferable 
that the level of expression of the marker is significantly greater than the minimum 
detection limit of the method used to assess expression in at least one of normal ovarian 
cells and cancerous ovarian cells. 

It is understood that by routine screening of additional patient samples using one 

20 or more of the markers of the invention, it will be realized that certain of the markers are 
over- or under-expressed in cancers of various types, including specific ovarian cancers, 
as well as other cancers such as breast cancer, cervical cancer, etc. For example, it will 
be confirmed that some of the markers of the invention are over- or under-expressed in 
most (i.e. 50% or more) or substantially all (i.e. 80% or more) of ovarian cancer. 

25 Furthermore, it will be confirmed that certain of the markers of the invention are 
associated with ovarian cancer of various stages (i.e. stage I, II, III, and IV ovarian 
cancers, as well as subclassifications IA, IB, IC, IIA, IIB, IIC, IIIA, IIIB, and IIIC, using 
the FIGO Stage Grouping system for primary carcinoma of the ovary; 1987, Am. J. 
Obstet. Gynecol 156:236), of various histologic subtypes (e.g. serous, mucinous, 

30 endometxoid, and clear cell subtypes, as well as subclassifications and alternate 
classifications adenocarcinoma, papillary adenocarcinoma, papillary 
cystadenocarcinoma, surface papillary carcinoma, malignant adenofibroma, 
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cystadenofibroma, adenocarcinoma, cystadenocarcinoma, adenoacanthoma, 
endometrioid stromal sarcoma, mesodermal (Mullerian) mixed tumor, mesonephroid 
tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
5 ovarian tumors; Scully, Atlas of Tumor Pathology, 3d series, Washington DC), and 
various grades (i.e. grade I {well differentiated} , grade II (moderately well 
differentiated}, and grade III {poorly differentiated from surrounding normal tissue} ). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 

10 samples were obtained are correlated, it will also be confirmed that altered expression of 
certain of the markers of the invention are strongly correlated with malignant cancers 
and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 

1 5 nature of ovarian cancer in patients. In addition, these compositions, kits, and methods 
can be used to detect and differentiate epithelial, stromal, and germ cell ovarian cancers. 

When the compositions, kits, and methods of the invention are used for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of ovarian cancer in a patient, it is preferred that the marker or panel of markers 

20 of the invention is selected such that a positive result is obtained in at least about 20%, 
and preferably at least about 40%, 60%, or 80%, and more preferably in substantially all 
patients afflicted with an ovarian cancer of the corresponding stage, grade, histological 
type, or benign/malignant nature. Preferably, the marker or panel of markers of the 
invention is selected such that a PPV of greater than about 10% is obtained for the 

25 general population (more preferably coupled with an assay specificity greater than 
99.5%). 

When a plurality of markers of the invention are used in the compositions, kits, 
and methods of the invention, the level of expression of each marker in a patient sample 
can be compared with the normal level of expression of each of the plurality of markers 
30 in non-cancerous samples of the same type, either in a single reaction mixture (i.e. using 
reagents, such as different fluorescent probes, for each marker) or in individual reaction 
mixtures corresponding to one or more of the markers. In one embodiment, a 
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significantly enhanced level of expression of more than one of the plurality of markers 
in the sample, relative to the corresponding normal levels, is an indication that the 
patient is afflicted with ovarian cancer. In another embodiment, a significantly lower 
level of expression in the sample of each of the plurality of markers, relative to the 
5 corresponding normal levels, is an indication that the patient is afflicted with ovarian 
cancer. In yet another embodiment, a significantly enhanced level of expression of one 
or more marks and a significantly lower level of expression of one or more markers in a 
sample relative to the corresponding normal levels, is an indication that the patient is 
afflicted with ovarian cancer. When a plurality of markers is used, it is preferred that 2, 

10 3, 4, 5, 8, 10, 12, 15, 20, 30, or 50 or more individual markers be used, wherein fewer 
markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and methods of the 
invention (i.e. by interference attributable to cells of non-ovarian origin in a patient 
sample), it is preferable that the marker of the invention used therein be a marker which 

15 has a restricted tissue distribution, e.g., normally not expressed in a non-epithelial tissue, 
and more preferably a marker which is normally not expressed in a non-ovarian tissue. 

Only a small number of markers are known to be associated with ovarian cancers 
(e.g. AKT2, KURAS, ERBB2, c-MYC, RBI, and TP53; Lynch, supra). These markers are 
not, of course, included among the markers of the invention, although they may be used 

20 together with one or more markers of the invention in a panel of markers, for example. 
It is well known that certain types of genes, such as oncogenes, tumor suppressor genes, 
growth factor-like genes, protease-like genes, and protein kinase-like genes are often 
involved with development of cancers of various types. Thus, among the markers of the 
invention, use of those which correspond to proteins which resemble known proteins 

25 encoded by known oncogenes and tumor suppressor genes, and those which correspond 
to proteins which resemble growth factors, proteases, and protein kinases are preferred. 

Known oncogenes and tumor suppressor genes include, for example, abl, abr, 
akt2, ape, bcl2a, bcl2j3, bcl3, bcr, brcal, brca2, cbl, ccndl, cdc42 } cdk4 9 crk- II, 
csflrlfins, dbl, dec, dpc4lsmad4, e-cad, e2fllrbap, egfr/erbb-1, elkl, elk3, eph, erg, etsl, 

30 ets2,fer,fgrlsrc2,flillergb2,fos,fpslfes,fral,fra2,fy^ hck, hek, her2lerbb- 2lneu, 
her3/erbb-3, her4lerbb-4, hrasl, hst2, hstfl, igfbp2, ink4a, ink4b, int2lfgf3,jun,junb, 
jund, kip2, kit, kras2a, kras2b, Ick, lyn> mas, max, mcc, mdm2 > met, mlhl, mmplO, mos, 
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msh2, msh3, msh6, myb, myba, mybb, myc, mycll, mycn, nfl, nf2, nme2, nras,p53, 
pdgfb,phb, piml,pmsl,pms2,ptc,pten, rafl, rapla, rbl, rel, ret, rosl, ski, srcl, tall, 
tgfbr2, tgfb3 f tgfbr3, thral, thrb, tiaml, timp3, tjpl, tp53, trk, vav, vhl, vil2, wafl, wntl, 
wnt2, wtl, and yesl (Hesketh, 1997, In: The Oncogene and Tumour Suppressor Gene 

5 Facts Book, 2nd Ed., Academic Press; Fishel et al, 1994, Science 266:1403-1405). 

Known growth factors include platelet-derived growth factor alpha, platelet- 
derived growth factor beta (simian sarcoma viral {v-sis} oncogene homolog), 
thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte 
growth and development factor), erythropoietin, B cell growth factor, macrophage 

0 stimulating factor 1 (hepatocyte growth factor-like protein), hepatocyte growth factor 
(hepapoietin A), insulin-like growth factor 1 (somatomedia C), hepatoma-derived 
growth factor, amphiregulin (schwannoma-derived growth factor), bone morphogenetic 
proteins 1, 2, 3, 3 beta, and 4, bone morphogenetic protein 7 (osteogenic protein 1), bone 
morphogenetic protein 8 (osteogenic protein 2), connective tissue growth factor, 

5 connective tissue activation peptide 3, epidermal growth factor (EGF), teratocarcinoma- 
derived growth factor 1, endothelin, endothelin 2, endothelin 3, stromal cell-derived 
factor 1, vascular endothelial growth factor (VEGF), VEGF-B, VEGF-C, placental 
growth factor (vascular endothelial growth factor-related protein), transforming growth 
factor alpha, transforming growth factor beta 1 and its precursors, transforming growth 

0 factor beta 2 and its precursors, fibroblast growth factor 1 (acidic), fibroblast growth 
factor 2 (basic), fibroblast growth factor 5 and its precursors, fibroblast growth factor 6 
and its precursors, fibroblast growth factor 7 (keratinocyte growth factor), fibroblast 
growth factor 8 (androgen-induced), fibroblast growth factor 9 (glia-activating factor), 
pleiotrophin (heparin binding growth factor 8, neurite growth-promoting factor 1), 

5 brain-derived neurotrophic factor, and recombinant glial growth factor 2. 

Known proteases include interleukin-1 beta convertase and its precursors, Mch6 
and its precursors, Mch2 isoform alpha, Mch4, Cpp32 isoform alpha, Lice2 gamma 
cysteine protease, Ich-lS, Ich-IL, Ich-2 and its precursors, TY protease, matrix 
metalloproteinase 1 (interstitial collagenase), matrix metalloproteinase 2 (gelatinase A, 

3 72kD gelatinase, 72kD type IV collagenase), matrix metalloproteinase 7 (matrilysin), 
matrix metalloproteinase 8 (neutrophil collagenase), matrix metalloproteinase 12 
(macrophage elastase), matrix metalloproteinase 13 (collagenase 3), metallopeptidase 1, 
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cysteine-rich metalloprotease (disintegrin) and its precursors, subtilisin-like protease Pc8 
and its precursors, chymotrypsin, snake venom-like protease, cathepsin 1, cathepsin D 
(lysosomal aspartyl protease), stromelysin, aminopeptidase N, plasminogen, tissue 
plasminogen activator, plasminogen activator inhibitor type II, and urokinase-type 
5 plasminogen activator. 

Known protein kinases include DAP kinase, serine/threonine protein kinases 
NIK, PK428, Krs-2, SAK, and EMK, interferon-inducible double stranded RNA 
dependent protein kinase, FAST kinase, AIM1, IPLl-like midbody-associated protein 
kinase- 1, NIMA-like protein kinase 1 (NLK1), the cyclin-dependent kinases (cdkl-10), 

10 checkpoint kinase Chkl, Nek3 protein kinase, BMK1 beta kinase, Clkl, Clk2, Clk3, 
extracellular signal-regulated kinases 1, 3, and 6, cdc28 protein kinase 1, cdc28 protein 
kinase 2, pLK, Mytl, c-Jun N-terminal kinase 2, Cam kinase 1, the MAP kinases, 
insulin-stimulated protein kinase 1, beta-adrenergic receptor kinase 2, ribosomal protein 
S6 kinase, kinase suppressor of ras-1 (KSR1), putative serine/threonine protein kinase 

15 Prk, PkB kinase, cAMP-dependent protein kinase, cGMP-dependent protein kinase, type 
II cGMP-dependent protein kinase, protein kinases Dyrk2, Dyrk3, and Dyrk4, Rho- 
associated coiled-coil containing protein kinase pl60ROCK, protein tyrosine kinase t- 
Rorl, Ste20-related kinases, cell adhesion kinase beta, protein kinase 3, stress-activated 
protein kinase 4, protein kinase Zpk, serine kinase hPAK65, dual specificity mitogen- : ' 

20 activated protein kinases 1 and 2, casein kinase I gamma 2, p21 -activated protein kinase 
Pakl , lipid-activated protein kinase PRK2, focal adhesion kinase, dual-specificity 
tyrosine-phosphorylation regulated kinase, myosin light chain kinase, serine kinases 
SRPK2, TESK1, and VRK2, B lymphocyte serine/threonine protein kinase, stress- 
activated protein kinases JNK1 and JNK2, phosphorylase kinase, protein tyrosine kinase 

25 Tec, Jak2 kinase, protein kinase Ndr, MEK kinase 3, SHB adaptor protein (a Src 
homology 2 protein), agammaglobulinaemia protein-tyrosine kinase (Atk), protein 
kinase ATR, guanylate kinase 1, thrombopoeitin receptor and its precursors, DAG 
kinase epsilon, and kinases encoded by oncogenes or viral oncogenes such as v-fgr 
(Gardner-Rasheed), v-abl (Abelson murine leukemia viral oncogene homolog 1), v-arg 

30 (Abelson murine leukemia viral oncogene homolog, Abelson-related gene), v-fes and v- 
fps (feline sarcoma viral oncogene and Fujinami avian sarcoma viral oncogene 
homologs), proto-oncogene c-cot, oncogene pim-l , and oncogene masl. 
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It is recognized that the compositions, kits, and methods of the invention will be 
of particular utility to patients having an enhanced risk of developing ovarian cancer and 
their medical advisors. Patients recognized as having an enhanced risk of developing 
ovarian cancer include, for example, patients having a familial history of ovarian cancer, 
5 patients identified as having a mutant oncogene (i.e. at least one allele), and patients of 
advancing age (i.e. women older than about 50 or 60 years). 

The level of expression of a marker in normal (i.e. non-cancerous) human 
ovarian tissue can be assessed in a variety of ways. In one embodiment, this normal 
level of expression is assessed by assessing the level of expression of the marker in a 

10 portion of ovarian cells which appears to be non-cancerous and by comparing this 
normal level of expression with the level of expression in a portion of the ovarian cells 
which is suspected of being cancerous. For example, when laparoscopy or other 
medical procedure, reveals the presence of a lump on one portion of a patient's ovary, 
but not on another portion of the same ovary or on the other ovary, the normal level of 

1 5 expression of a marker may be assessed using one or both or the non-affected ovary and 
a non-affected portion of the affected ovary, and this normal level of expression may be 
compared with the level of expression of the same marker in an affected portion (i.e. the 
lump) of the affected ovary. Alternately, and particularly as further information 
becomes available as a result of routine performance of the methods described herein, 

20 population-average values for normal expression of the markers of the invention may be 
used. In other embodiments, the 'normal' level of expression of a marker may be 
determined by assessing expression of the marker in a patient sample obtained from a 
non-cancer-afflicted patient, from a patient sample obtained from a patient before the 
suspected onset of ovarian cancer in the patient, from archived patient samples, and the 

25 like. 

The invention includes compositions, kits, and methods for assessing the 
presence of ovarian cancer cells in a sample (e.g. an archived tissue sample or a sample 
obtained from a patient). These compositions, kits, and methods are substantially the 
same as those described above, except that, where necessary, the compositions, kits, and 
30 methods are adapted for use with samples other than patient samples. For example, 
when the sample to be used is a parafinized, archived human tissue sample, it can be 
necessary to adjust the ratio of compounds in the compositions of the invention, in the 
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kits of the invention, or the methods used to assess levels of marker expression in the 
sample. Such methods are well known in the art and within the skill of the ordinary 
artisan. 

The invention includes a kit for assessing the presence of ovarian cancer cells 
5 {e.g. in a sample such as a patient sample). The kit comprises a plurality of reagents, 
each of which is capable of binding specifically with a nucleic acid or polypeptide 
corresponding to a marker of the invention. Suitable reagents for binding with a 
polypeptide corresponding to a marker of the invention include antibodies, antibody 
derivatives, antibody fragments, and the like. Suitable reagents for binding with a 
0 nucleic acid {e.g. a genomic DNA, an mRNA, a spliced mRNA, a cDNA, or the like) 
include complementary nucleic acids. For example, the nucleic acid reagents may 
include oligonucleotides (labeled or non-labeled) fixed to a substrate, labeled 
oligonucleotides not bound with a substrate, pairs of PCR primers, molecular beacon 
probes, and the like. 

5 The kit of the invention may optionally comprise additional components useful 

for performing the methods of the invention. By way of example, the kit may comprise 
fluids {e.g. SSC buffer) suitable for annealing complementary nucleic acids or for 
binding an antibody with a protein with which it specifically binds, one or more sample 
compartments, an instructional material which describes performance of a method of the 

0 invention, a sample of normal ovarian cells, a sample of ovarian cancer cells, and the 
like. 

The invention also includes a method of making an isolated hybridoma which 
produces an antibody useful for assessing whether patient is afflicted with an ovarian 
cancer. In this method, a protein corresponding to a marker of the invention is isolated 

5 {e.g. by purification from a cell in which it is expressed or by transcription and 
translation of a nucleic acid encoding the protein in vivo or in vitro using known 
methods). A vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is 
immunized using the isolated protein. The vertebrate may optionally (and preferably) be 
immunized at least one additional time with the isolated protein, so that the vertebrate 

) exhibits a robust immune response to the protein. Splenocytes are isolated from the 
immunized vertebrate and fused with an immortalized cell line to form hybridomas, 
using any of a variety of methods well known in the art. Hybridomas formed in this 
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manner are then screened using standard methods to identify one or more hybridomas 
which produce an antibody which specifically binds with the protein. The invention 
also includes hybridomas made by this method and antibodies made using such 
hybridomas. 

5 The invention also includes a method of assessing the efficacy of a test 

compound for inhibiting ovarian cancer cells. As described above, differences in the 
level of expression of the markers of the invention correlate with the cancerous state of 
ovarian cells. Although it is recognized that changes in the levels of expression of 
certain of the markers of the invention likely result from the cancerous state of ovarian 

0 cells, it is likewise recognized that changes in the levels of expression of other of the 
markers of the invention induce, maintain, and promote the cancerous state of those 
cells. Thus, compounds which inhibit an ovarian cancer in a patient will cause the level 
of expression of one or more of the markers of the invention to change to a level nearer 
the normal level of expression for that marker {i.e. the level of expression for the marker 

5 in non-cancerous ovarian cells). 

This method thus comprises comparing expression of a marker in a first ovarian 
cell sample and maintained in the presence of the test compound and expression of the 
marker in a second ovarian cell sample and maintained in the absence of the test 
compound. A significant alteration in the level of expression of a marker listed in 

0 Tables 1-2 is an indication that the test compound inhibits ovarian cancer. The ovarian 
cell samples may, for example, be aliquots of a single sample of normal ovarian cells 
obtained from a patient, pooled samples of normal ovarian cells obtained from a patient, 
cells of a normal ovarian cell line, aliquots of a single sample of ovarian cancer cells 
obtained from a patient, pooled samples of ovarian cancer cells obtained from a patient, 

5 cells of an ovarian cancer cell line, or the like. In one embodiment, the samples are 
ovarian cancer cells obtained from a patient and a plurality of compounds known to be 
effective for inhibiting various ovarian cancers are tested in order to identify the 
compound which is likely to best inhibit the ovarian cancer in the patient. 

This method may likewise be used to assess the efficacy of a therapy for 

) inhibiting ovarian cancer in a patient. In this method, the level of expression of one or 
more markers of the invention in a pair of samples (one subjected to the therapy, the 
other not subjected to the therapy) is assessed. As with the method of assessing the 
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efficacy of test compounds, if the therapy induces a significant alteration in the level of 
expression of a marker listed in Tables 1-2 then the therapy is efficacious for inhibiting 
ovarian cancer. As above, if samples from a selected patient are used in this method, 
then alternative therapies can be assessed in vitro in order to select a therapy most likely 
5 to be efficacious for inhibiting ovarian cancer in the patient. 

As described herein, ovarian cancer in patients is associated with an altereration 
in the level of expression of one or more markers listed in Tables 1-2. While, as 
discussed above, some of these changes in expression level result from occurrence of the 
ovarian cancer, others of these changes induce, maintain, and promote the cancerous 

1 0 state of ovarian cancer cells. Thus, ovarian cancer characterized by an increase in the 
level of expression of one or more markers listed in either or both of Tables 1-2 can be 
inhibited by inhibiting expression of those markers. 

Expression of a marker listed in Tables 1-2 can be inhibited in a number of ways 
generally known in the art. For example, an antisense oligonucleotide can be provided 

15 to the ovarian cancer cells in order to inhibit transcription, translation, or both, of the 
marker(s). Alternately, a polynucleotide encoding an antibody, an antibody derivative, 
or an antibody fragment, and operably linked with an appropriate promoter/regulator 
region, can be provided to the cell in order to generate intracellular antibodies which 
will inhibit the function or activity of the protein corresponding to the marker(s). Using 

20 the methods described herein, a variety of molecules, particularly including molecules 
sufficiently small that they are able to cross the cell membrane, can be screened in order 
to identify molecules which inhibit expression of the marker(s). The compound so 
identified can be provided to the patient in order to inhibit expression of the marker(s) in 
the ovarian cancer cells of the patient. 

25 Expression of a marker listed in Tables 1 -2 can be enhanced in a number of ways 

generally known in the art. For example, a polynucleotide encoding the marker and 
operably linked with an appropriate promoter/regulator region can be provided to 
ovarian cancer cells of the patient in order to induce enhanced expression of the protein 
(and mRNA) corresponding to the marker therein. Alternatively, if the protein is 

30 capable of crossing the cell membrane, inserting itself in the cell membrane, or is 

normally a secreted protein, then expression of the protein can be enhanced by providing 
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the protein (e.g. directly or by way of the bloodstream or another ovary-associated fluid) 
to ovarian cancer cells in the patient. 

As described above, the cancerous state of human ovarian cells is correlated with 
changes in the levels of expression of the markers of the invention. The invention 

5 includes a method for assessing the human ovarian cell carcinogenic potential of a test 
compound. This method comprises maintaining separate aliquots of human ovarian 
cells in the presence and absence of the test compound. Expression of a marker of the 
invention in each of the aliquots is compared. A significant alteration in the level of 
expression of a marker listed in Tables 1-2 in the aliquot maintained in the presence of 

0 the test compound (relative to the aliquot maintained in the absence of the test 
compound) is an indication that the test compound possesses human ovarian cell 
carcinogenic potential. The relative carcinogenic potentials of various test compounds 
can be assessed by comparing the degree of enhancement or inhibition of the level of 
expression of the relevant markers, by comparing the number of markers for which the 

5 level of expression is enhanced or inhibited, or by comparing both. 

Various aspects of the invention are described in further detail in the following 
subsections. 



L Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to novel isolated nucleic acid molecules that 
correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention or a portion of such a 
polypeptide. Isolated nucleic acids of the invention also include nucleic acid molecules 
sufficient for use as hybridization probes to identify nucleic acid molecules that 
correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention, and fragments of such nucleic 
acid molecules, e.g. , those suitable for use as PCR primers for the amplification or 
mutation of nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
intended to include DNA molecules (e.g, cDNA or genomic DNA) and RNA molecules 
{e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 
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An "isolated" nucleic acid molecule is one which is separated from other nucleic 
acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein- 
encoding sequences) which naturally flank the nucleic acid (Le. 9 sequences located at the 
5 5 ! and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 
nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 

10 molecule, such as a cDNA molecule, can be substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention can be isolated using standard 
molecular biology techniques and the sequence information in the database records 

1 5 described herein. Using all or a portion of such nucleic acid sequences, nucleic acid 
molecules of the invention can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook et aL, ed., Molecular Cloning: A Laboratory 
Manual, 2nd ed, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

20 A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, 

or genomic DNA as a template and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid so amplified can be cloned 
into an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 

25 invention can be prepared by standard synthetic techniques, e.g. , using an automated 
DNA synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a nucleic acid corresponding to a marker 
30 of the invention or to the nucleotide sequence of a nucleic acid encoding a protein which 
corresponds to a marker of the invention. A nucleic acid molecule which is 
complementary to a given nucleotide sequence is one which is sufficiently 
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complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a portion 
of a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 
5 marker of the invention or which encodes a polypeptide corresponding to a marker of 
the invention. Such nucleic acids can be used, for example, as a probe or primer. The 
probe/primer typically is used as one or more substantially purified oligonucleotides. 
The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 7, preferably about 15, more preferably about 

10 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive 
nucleotides of a nucleic acid of the invention. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences corresponding to one or more markers 
of the invention. The probe comprises a label group attached thereto, e.g., a 

15 radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 
express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

20 The invention further encompasses nucleic acid molecules that differ, due to 

degeneracy of the genetic code, from the nucleotide sequence of nucleic acids encoding 
a protein which corresponds to a marker of the invention, and thus encode the same 
protein. 



25 polymorphisms that lead to changes in the amino acid sequence can exist within a 

population (e.g. , the human population). Such genetic polymorphisms can exist among 
individuals within a population due to natural allelic variation. An allele is one of a 
group of genes which occur alternatively at a given genetic locus. In addition, it will be 
appreciated that DNA polymorphisms that affect RNA expression levels can also exist 

30 that may affect the overall expression level of that gene (e.g., by affecting regulation or 
degradation). 



It will be appreciated by those skilled in the art that DNA sequence 




WO 01/070979 



PCT/US01/09126 



-32- 



As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules comprising an open reading frame encoding a polypeptide corresponding to a 
5 marker of the invention. Such natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
variety of individuals. Any and all such nucleotide variations and resulting amino acid 

1 0 polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the invention is at 
least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 
800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 

15 4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a 
nucleic acid corresponding to a marker of the invention or to a nucleic acid encoding a 
protein corresponding to a marker of the invention. As used herein, the term "hybridizes 
under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 75% (80%, 85%, preferably 90%) 

20 identical to each other typically remain hybridized to each other. Such stringent 

conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 
of Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989). A 
preferred, non-limiting example of stringent hybridization conditions for annealing two 
single-stranded DNA each of which is at least about 100 bases in length and/or for 

25 annealing a single-stranded DNA and a single-stranded RNA each of which is at least 
about 100 bases in length, are hybridization in 6X sodium chloride/sodium citrate (SSC) 
at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50-65°C. 
Further preferred hybridization conditions are taught in Lockhart, et al, Nature 
Biotechnology, Volume 14, 1996 August: 1675-1 680; Breslauer, et al, Proc. Natl Acad. 

30 Sci. USA, Volume 83, 1986 June: 3746-3750; Van Ness, et al, Nucleic Acids Research, 
Volume 19, No. 19, 1991 September: 5143-5151; McGraw, et al, BioTechniques, 
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Volume 8, No. 6 1990: 674-678; and Milner, et al, Nature Biotechnology, Volume 15, 
1997 June: 537-541, all expressly incorporated by reference. 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of 
•the invention that can exist in the population, the skilled artisan will further appreciate 
5 that sequence changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of 
the protein encoded thereby. For example, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" amino acid residues. A "non- 
essential" amino acid residue is a residue that can be altered from the wild-type 

1 0 sequence without altering the biological activity, whereas an "essential" amino acid 
residue is required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologs of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, amino 
acid residues that are conserved among the homologs of various species (e.g., murine 

1 5 and human) may be essential for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from the 
20 naturally-occxirring proteins which correspond to the markers of the invention, yet retain 
biological activity. In one embodiment, such a protein has an amino acid sequence that 
is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of one of the proteins which correspond to the markers of the 
invention. 

25 An isolated nucleic acid molecule encoding a variant protein can be created by 

introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of nucleic acids of the invention, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such as site-directed mutagenesis 

30 and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
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amino acid residue having a similar side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains {e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
5 serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 

10 and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid of the invention, e.g., 

1 5 complementary to the coding strand of a double-stranded cDN A molecule 

corresponding to a marker of the invention or complementary to an mRNA sequence 
corresponding to a marker of the invention. Accordingly, an antisense nucleic acid of 
the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 
invention. The antisense nucleic acid can be complementary to an entire coding strand, 

20 or to only a portion thereof, e.g., all or part of the protein coding region (or open reading 
frame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a polypeptide of 
the invention. The non-coding regions ("5' and 3' untranslated regions") are the 5' and 3' 
sequences which flank the coding region arid are not translated into amino acids. 

25 An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 

40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 

30 variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
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substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine 5 5- 
5 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

10 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine 3 5-methyl-2-thiouracil, 2-thiouracil 5 4-thiouracil 3 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3 -(3 -amino-3 -N-2-carboxypropy 1) uracil, (acp3)w, and 2,6-diaminopurine. 
Alternatively, the antisense nucleic acid can be produced biologically using an 

15 expression vector into which a nucleic acid has been sub-cloned in an antisense 

orientation {i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
20 to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a polypeptide corresponding to a selected marker of the 
invention to thereby inhibit expression of the marker, e.g., by inhibiting transcription 
and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense 
25 nucleic acid molecule which binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. Examples of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site or 
infusion of the antisense nucleic acid into an ovary-associated body fluid. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then 
30 administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
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to peptides or antibodies which bind to cell surface receptors or antigens. The antisense 
nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 
5 strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gaultier et al, 1987, Nucleic Acids Res. 15:6625- 

10 6641). The antisense nucleic acid molecule can also comprise a 2'-o- 

methylribonucleotide (Inoue et al, 1987, Nucleic Acids. Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al, 1987, FEBS Lett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 

1 5 nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide corresponding to a marker of the 

20 invention can be designed based upon the nucleotide sequence of a cDNA 

corresponding to the marker. For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved (see Cech et al. U.S. Patent No. 
4,987,071 ; and Cech et al U.S. Patent No. 5,1 16,742). Alternatively, an mRNA 

25 encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules (see, e.g., Bartel and 
Szostak, 1993, Science 261:1411-1418). 

The invention also encompasses nucleic acid molecules which form triple helical 
structures. For example, expression of a polypeptide of the invention can be inhibited 

30 by targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
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20 
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(1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. N.Y. Acad Set 660:27- 
36; and Maher (1992) Bioassays 14(12):807-15. 

In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al, 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
et al (1996), supra; Perry-O'Keefe et al (1996) Proa Natl Acad Sci. USA 93:14670- 
675. 

PNAs can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs can also be used, e.g., in the analysis of single base pair mutations in 
a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used . 
in combination with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as 
probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- 
O'Keefe e/*/., 1996, Proc. Natl Acad Sci. USA 93:14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 
cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation 
of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which can 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNASE H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 
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1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hym? (1996), supra, and Finn et al (1996) Nucleic Acids Res. 24(1 7):33 57-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds 
5 such as 5'-(4-methoxytrityl)amino-5 t -deoxy-thymidine phosphoramidite can be used as a 
link between the PNA and the 5' end of DNA (Mag et al, 1989, Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a step-wise manner to produce a 
chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al, 1996, 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 

10 synthesized with a 5 r DNA segment and a 3' PNA segment (Peterser et al, 1975, 
Bioorganic Med. Chem. Lett. 5: 1 1 1 9-1 1 124). 

In other embodiments, the oligonucleotide can include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. 

15 Sci. USA 86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad. Set USA 84:648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol et al, 1988, Bio/Techniques 
6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 

20 this end, the oligonucleotide can be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at least one 
region which is complementary to a nucleic acid of the invention, such that the 

25 molecular beacon is useful for quantitating the presence of the nucleic acid of the 

invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 
portions of the nucleic acid in such an orientation that when the complementary regions 

30 are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
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one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 

II. Isolated Proteins and Antibodies 
5 One aspect of the invention pertains to isolated proteins which correspond to 

individual markers of the invention, and biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to raise antibodies directed 
against a polypeptide corresponding to a marker of the invention. In one embodiment, 
the native polypeptide corresponding to a marker can be isolated from cells or tissue 

1 0 sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, polypeptides corresponding to a marker of the 
invention are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a polypeptide corresponding to a marker of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 

15 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical, 
precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 

20 protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 

25 also preferably substantially free of culture medium, i.e., culture medium represents less 
than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, i.e., it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 

30 preparations of the protein have less than about 30%, 20%, 1 0%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide of interest. 
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Biologically active portions of a polypeptide corresponding to a marker of the 
invention include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of the protein corresponding to the marker, 
which include fewer amino acids than the full length protein, and exhibit at least one 
5 activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 
biologically active portions, in which other regions of the protein are deleted, can be 

10 prepared by recombinant techniques and evaluated for one or more of the functional 
activities of the native form of a polypeptide of the invention. 

Preferred polypeptides are encoded by the nucleotide sequences of Tables 1 -2. 
Other useful proteins are substantially identical (e.g., at least about 40%, preferably 
50%, 60%, 70%, 80%, 90%, 95%, or 99%) to one of these sequences and retain the 

15 functional activity of the protein of the corresponding naturally-occurring protein yet 
differ in amino acid sequence due to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 

20 alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences 

25 is a function of the number of identical positions shared by the sequences (£ e. , % 

identity = # of identical positions/total # of positions (e.g. , overlapping positions) xl 00). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 

30 mathematical algorithm utilized for the comparison of two sequences is the algorithm of 
Karlin and Altschul (1990) Proc. Natl. Acad Set USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl Acad. Sci. USA 90:5873-5877. Such an 
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algorithm is incorporated into the NBLAST and XBLAST programs of Altschul, et al 
(1990) J. Mol Biol 215:403-410. BLAST nucleotide searches can be performed with 
the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
5 be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul etal (1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an iterated search which detects distant relationships between 

1 0 molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOS 4:1 1-17. Such an algorithm is incorporated into the 

1 5 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useful algorithm for identifying regions of local sequence similarity 
and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) 

20 Proc. Natl Acad. Sci. USA 85:2444-2448. When using the FASTA algorithm for 

comparing nucleotide or amino acid sequences, a P AMI 20 weight residue table can, for 
example, be used with a £-tuple value of 2. 

The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 

25 identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins corresponding to a 
marker of the invention. As used herein, a "chimeric protein' 1 or "fusion protein" 
comprises all or part (preferably a biologically active part) of a polypeptide 
corresponding to a marker of the invention operably linked to a heterologous 

30 polypeptide (f. e. , a polypeptide other than the polypeptide corresponding to the marker). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 
polypeptide of the invention and the heterologous polypeptide are fused in-frame to each 
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other. The heterologous polypeptide can be fused to the amino-terminus or the 
carboxyl-terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in which a polypeptide 
corresponding to a marker of the invention is fused to the carboxyl terminus of GST 
5 sequences. Such fusion proteins can facilitate the purification of a recombinant 
polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 
sequence at its amino terminus. For example, the native signal sequence of a 
polypeptide corresponding to a marker of the invention can be removed and replaced 

10 with a signal sequence from another protein. For example, the gp67 secretory sequence 
of the baculovirus envelope protein can be used as a heterologous signal sequence 
(Ausubel et aL, ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 
1992). Other examples of eukaryotic heterologous signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 

15 La Jolla, California). In yet another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et aL, supra) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide corresponding to a marker of the invention 

20 is fused to sequences derived from a member of the immunoglobulin protein family. 
The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand (soluble or membrane-bound) and a protein on the surface of a cell 
(receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion 

25 protein can be used to affect the bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and differentiative disorders and for modulating (e.g. 
promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of 
the invention can be used as immunogens to produce antibodies directed against a 

30 polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 
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Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
5 primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et al, supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety {e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of the invention can be cloned into such an 
1 0 expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 

A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are typically characterized by a 
core of hydrophobic amino acids which are generally cleaved from the mature protein 

1 5 during secretion in one or more cleavage events. Such signal peptides contain 

processing sites that allow cleavage of the signal sequence from the mature proteins as 
they pass through the secretory pathway. Thus, the invention pertains to the described 
polypeptides having a signal sequence, as well as to polypeptides from which the signal 
sequence has been proteolytically cleaved (i.e., the cleavage products). In one 

20 embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 
in an expression vector to a protein of interest, such as a protein which is ordinarily not 
secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the expression vector is transformed, 
and the signal sequence is subsequently or concurrently cleaved. The protein can then 

25 be readily purified from the extracellular medium by art recognized methods. 
Alternatively, the signal sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a GST domain. 

The present invention also pertains to variants of the polypeptides corresponding 
to individual markers of the invention. Such variants have an altered amino acid 

30 sequence which can function as either agonists (mimetics) or as antagonists. Variants 
can be generated by mutagenesis, e.g., discrete point mutation or truncation. An agonist 
can retain, substantially the same, or a subset, of the biological activities of the naturally 




WO 01/070979 PCT/US01/09126 

-44- 

occurring form of the protein. An antagonist of a protein can inhibit one or more of the 
activities of the naturally occurring form of the protein by, for example, competitively 
binding to a downstream or upstream member of a cellular signaling cascade which 
includes the protein of interest. Thus, specific biological effects can be elicited by 
5 treatment with a variant of limited function. Treatment of a subject with a variant 

having a subset of the biological activities of the naturally occurring form of the protein 
can have fewer side effects in a subject relative to treatment with the naturally occurring 
form of the protein. 

Variants of a protein of the invention which function as either agonists 

1 0 (mimetics) or as antagonists can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the protein of the invention for agonist or antagonist 
activity. In one embodiment, a variegated library of variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
library. A variegated library of variants can be produced by, for example, enzymatically . 

1 5 ligating a mixture of synthetic oligonucleotides into gene sequences such that a 

degenerate set of potential protein sequences is expressible as individual polypeptides, 
or alternatively, as a set of larger fusion proteins (e.g., for phage display). There are a 
variety of methods which can be used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate oligonucleotide sequence. Methods for 

20 synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, 
Tetrahedron 39:3; Itakura et al> 1984, Annu. Rev. Biochem. 53:323; Itakura et al, 1984, 
Science 198:1056; Ike et al, 1983 Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide 
corresponding to a marker of the invention can be used to generate a variegated 

25 population of polypeptides for screening and subsequent selection of variants. For 
example, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double 
stranded DNA, renaturing the DNA to form double stranded DNA which can include 

30 sense/antisense pairs from different nicked products, removing single stranded portions 
from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
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derived which encodes amino terminal and internal fragments of various sizes of the 
protein of interest. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
5 libraries for gene products having a selected property. The most widely used 

techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 

10 isolation of the vector encoding the gene whose product was detected. Recursive 

ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl Acad. Set 
USA 59:781 1-7815; Delgrave et al, 1993, Protein Engineering 6(3):327- 33 1). 

1 5 An isolated polypeptide corresponding to a marker of the invention, or a 

fragment thereof, can be used as an immunogen to generate antibodies using standard 
techniques for polyclonal and monoclonal antibody preparation. The full-length 
polypeptide or protein can be used or, alternatively, the invention provides antigenic ' 
peptide fragments for use as immunogens. The antigenic peptide of a protein of the 

20 invention comprises at least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues 
of the amino acid sequence of one of the polypeptides of the invention, and encompasses 
an epitope of the protein such that an antibody raised against the peptide forms a specific 
immune complex with a marker of the invention to which the protein corresponds. 
Preferred epitopes encompassed by the antigenic peptide are regions that are located on 

25 the surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence analysis, 
hydrophilicity sequence analysis, or similar analyses can be used to identify hydrophilic 
regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
(i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or 
30 vertebrate. An appropriate immunogenic preparation can contain, for example, 

recombinantly-expressed or chemically-synthesized polypeptide. The preparation can 
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further include an adjuvant, such as Freund's complete or incomplete adjuvant, or a 
similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The terms "antibody" and "antibody substance" 
5 as used interchangeably herein refer to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen 
binding site which specifically binds an antigen, such as a polypeptide of the invention, 
e.g., an epitope of a polypeptide of the invention. A molecule which specifically binds 
to a given polypeptide of the invention is a molecule which binds the polypeptide, but 

1 0 does not substantially bind other molecules in a sample, e.g. , a biological sample, which 
naturally contains the polypeptide. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab f >2 fragments which can be generated 
by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies. The term "monoclonal antibody" or 

1 5 "monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 

20 polyclonal antibody compositions are ones that have been selected for antibodies 

directed against a polypeptide or polypeptides of the invention. Particularly preferred 
polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 

25 immunogen compositions made using a non-human host cell for recombinant expression 
of a polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will 
be present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 

30 standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be harvested or 
isolated from the subject (e.g., from the blood or serum of the subject) and further 



WO 01/070979 



PCT/US01/09126 



- 47 - 

purified by well-known techniques, such as protein A chromatography to obtain the IgG 
fraction. Alternatively, antibodies specific for a protein or polypeptide of the invention 
can be selected or (e.g., partially purified) or purified by, e.g., affinity chromatography. 
For example, a recombinantly expressed and purified (or partially purified) protein of 
5 the invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can then 
be used to affinity purify antibodies specific for the proteins of the invention from a 
sample containing antibodies directed against a large number of different epitopes, 
thereby generating a substantially purified antibody composition, Le., one that is 

10 substantially free of contaminating antibodies. By a substantially purified antibody 
composition is meant, in this context, that the antibody sample contains at most only 
30% (by dry weight) of contaminating antibodies directed against epitopes other than 
those of the desired protein or polypeptide of the invention, and preferably at most 20%, 
yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the 

1 5 sample is contaminating antibodies. A purified antibody composition means that at least 
99% of the antibodies in the composition are directed against the desired protein or 
polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 

20 prepare monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B 
cell hybridoma technique (see Kozbor et al, 1983, Immunol. Today 4:72), the EBV- 
hybridoma technique (see Cole et al , pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 

25 hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 
al ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of interest, e.g. , using a standard 
ELISA assay. 

30 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody directed against a polypeptide of the invention can be identified 
and isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
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antibody phage display library) with the polypeptide of interest. Kits for generating and 
screening phage display libraries are commercially available {e.g., the Pharmacia 
Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the Stratagene 
Sur/ZAP Phage Display Kit, Catalog No. 240612). Additionally, examples of methods 
5 and reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 
WO 92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 
92/20791; PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; 
PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 

10 Publication No. WO 90/02809; Fuchs et al (1991) Bio/Technology 9:1370^1372; Hay et 
al (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al (1989) Science 246:1275- 
1281; Griffiths <?r al (1993) EMBO J. 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 

1 5 made using standard recombinant DNA techniques, are within the scope of the 

invention. A chimeric antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 

20 incorporated herein by reference in their entirety.) Humanized antibodies are antibody 
molecules from non-human species having one or more complementarily determining 
regions (CDRs) from the non-human species and a framework region from a human 
immunoglobulin molecule. (See, e.g., Queen, U.S. Patent No. 5,585,089, which is 
incorporated herein by reference in its entirety.) Such chimeric and humanized 

25 monoclonal antibodies can be produced by recombinant DNA techniques known in the 
art, for example using methods described in PCT Publication No. WO 87/02671; 
European Patent Application 184,187; European Patent Application 171,496; European 
Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et al. (1988) Science 240:1041- 

30 1043; Liu et al (1987) Proc. Natl Acad. Sci. USA 84:3439-3443; Liu et al (1987) J. 
Immunol 139:3521- 3526; Swietal (1987) Proc. Natl Acad. Sci USA 84:214-218; 
NishimuraeM/. (1987) Cancer Res. 47:999-1005; Wood etal (1 985) Nature 314:446- 
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449; and Shaw et al (1988) J. Natl Cancer Inst 80:1553-1559); Morrison (1985) 
Science 229:1202-1207; Oi et al (1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; 
Jones etal (\9%6)Nature 321:552-525; Verhoeyan etal (1988) Science 239:1534; and 
Beidlerefa/. (1988) J. Immunol 141:4053-4060. 

5 Antibodies of the invention may be used as therapeutic agents in treating 

cancers. In a preferred embodiment, completely human antibodies of the invention are 
used for therapeutic treatment of human cancer patients, particularly those having an 
ovarian cancer. Such antibodies can be produced, for example, using transgenic mice 
which are incapable of expressing endogenous immunoglobulin heavy and light chains 

0 genes, but which can express human heavy and light chain genes. The transgenic mice 
are immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed 
against the antigen can be obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the transgenic mice rearrange during B 

5 cell differentiation, and subsequently undergo class switching and somatic mutation. 
Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA 
and IgE antibodies. For an overview of this technology for producing human antibodies, 
see Lonberg and Huszar (1995) Int. Rev. Immunol 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and human monoclonal antibodies 

0 and protocols for producing such antibodies, see, e.g. , U.S. Patent 5,625, 126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 
5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be 
engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

5 Completely human antibodies which recognize a selected epitope can be 

generated using a technique referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope (Jespers et al } 
1 994, Bio/technology 12:899-903). 

O An antibody directed against a polypeptide corresponding to a marker of the 

invention (e.g., a monoclonal antibody) can be used to isolate the polypeptide by 
standard techniques, such as affinity chromatography or immunoprecipitation: 
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Moreover, such an antibody can be used to detect the marker (e.g., in a cellular lysate or 
cell supernatant) in order to evaluate the level and pattern of expression of the marker. 
The antibodies can also be used diagnostically to monitor protein levels in tissues or 
body fluids (e.g. in an ovary-associated body fluid) as part of a clinical testing 
5 procedure, e.g. , to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling the antibody to a detectable substance. 
Examples of detectable substances include various enzymes, prosthetic groups, 
fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

1 0 phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 

15 materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include 125 1, 13, .1, 3:> S or 3 H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin 
or cytotoxic agent includes any agent that is detrimental to cells. Examples include 

20 taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof. Therapeutic agents include, but are not limited to, antimetabolites 

25 (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 
decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, 
melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 

30 doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and 
vinblastine). 
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The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
5 such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as 
tumor necrosis factor, .alphaMnterferon, .beta.-interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator; or, biological response modifiers 
such as, for example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), 

10 granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see. e.g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug 

15 Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 

20 Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 
303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

25 Accordingly, in one aspect, the invention provides substantially purified 

antibodies or fragments thereof, and non-human antibodies or fragments thereof, which 
antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence selected from the group consisting of the amino acid sequences of the present 
invention, an amino acid sequence encoded by the cDNA of the present invention, a 

30 fragment of at least 1 5 amino acid residues of an amino acid sequence of the present 
invention, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of the present invention (wherein the percent identity is determined, using the 
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ALIGN program of the GCG software package with a PAM120 weight residue table, a 
gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence which is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of the nucleic acid molecules of the present invention, or a complement 

5 thereof, under conditions of hybridization of 6X SSC at 45°C and washing in 0.2 X 
SSC, 0.1% SDS at 65°C. In various embodiments, the substantially purified antibodies 
of the invention, or fragments thereof, can be human, non-human, chimeric and/or 
humanized antibodies. 

In another aspect, the invention provides non-human antibodies or fragments 

0 thereof, which antibodies or fragments specifically bind to a polypeptide comprising an 
amino acid sequence selected from the group consisting of: the amino acid sequence of 
the present invention, an amino acid sequence encoded by the cDNA of the present 
invention, a fragment of at least 15 amino acid residues of the amino acid sequence of 
the present invention, an amino acid sequence which is at least 95% identical to the 

5 amino acid sequence of the present invention (wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence 
which is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of the nucleic acid molecules of the present invention, or a 

0 complement thereof, under conditions of hybridization of 6X SSC at 45°C and washing 
in 0.2 X SSC, 0.1% SDS at 65°C. Such non-human antibodies can be goat, mouse, 
sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies 
of the invention can be chimeric and/or humanized antibodies. In addition, the non- 
human antibodies of the invention can be polyclonal antibodies or monoclonal 

5 antibodies. 

In still a further aspect, the invention provides monoclonal antibodies or 
fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of the amino acid 
sequences of the present invention, an amino acid sequence encoded by the cDNA of the 
) present invention, a fragment of at least 15 amino acid residues of an amino acid 
sequence of the present invention, an amino acid sequence which is at least 95% 
identical to an amino acid sequence of the present invention (wherein the percent 
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identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4) and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule consisting of the nucleic acid molecules of the present invention, 
5 or a complement thereof, under conditions of hybridization of 6X SSC at 45°C and 
washing in 0.2 X SSC, 0. 1 % SDS at 65°C. The monoclonal antibodies can be human, 
humanized, chimeric and/or non-human antibodies. 

The substantially purified antibodies or fragments thereof may specifically bind 
to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
cytoplasmic domain or cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof, of the invention specifically bind to a secreted sequence 
or an extracellular domain of the amino acid sequences of the present invention. 

Any of the antibodies of the invention can be conjugated to a therapeutic moiety 
or to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

The invention also provides a kit cpntaining an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and 
a pharmaceutically acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes a polypeptide of the present invention, the method comprising 
immunizing a mammal with a polypeptide. The polypeptide used as an immungen 
comprises an amino acid sequence selected from the group consisting of the amino acid 
sequence of the present invention, an amino acid sequence encoded by the cDNA of the 
nucleic acid molecules of the present invention, a fragment of at least 15 amino acid 
residues of the amino acid sequence of the present invention, an amino acid sequence 
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which is at least 95% identical to the amino acid sequence of the present invention 
(wherein the percent identity is determined using the ALIGN program of the GCG 
software package with a P AMI 20 weight residue table, a gap length penalty of 12, and a 
gap penalty of 4) and an amino acid sequence which is encoded by a nucleic acid 
5 molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 

After immunization, a sample is collected from the mammal that contains an 
antibody that specifically recognizes the polypeptide. Preferably, the polypeptide is 
1 0 recombinantly produced using a non-human host cell. Optionally, the antibodies can be 
further purified from the sample using techniques well known to those of skill in the art. 
The method can further comprise producing a monoclonal antibody- producing cell from 
the cells of the mammal. Optionally, antibodies are collected from the 
antibody-producing cell. 

15 

HI. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding a polypeptide corresponding to a marker of 
the invention (or a portion of such a polypeptide). As used herein, the term "vector" 

20 refers to a nucleic acid molecule capable of transporting another nucleic acid to which it 
has been linked. One type of vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 

25 they are introduced {e.g., bacterial vectors having a bacterial origin of replication and 
episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, namely 
expression vectors, are capable of directing the expression of genes to which they are 

30 operably linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors (e.g., 
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replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host celL This 
5 means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked 1 ' is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 

10 of the nucleotide sequence (e.g. , in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol.185, 

15 Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
25 expression of a polypeptide corresponding to a marker of the invention in prokaxyotic 
(e.g., E. coli) or eukaryotic cells (e.g., insect cells {using baculovirus expression 
vectors}, yeast cells or mammalian cells). Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed 
and translated in vitro, for example using T7 promoter regulatory sequences and T7 
30 polymerase. 



WO 01/070979 



PCTYUS01/09126 



- 56 - 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
5 vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 

1 0 from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson, 1988, Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and 
pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 

1 5 maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et aL, 1988, Gene 69:301-3 1 5) and pET lid (Studier et aL, p. 60-89, In 
Gene Expression Technology: Methods in Enzymology vol.1 85, Academic Press, San 
Diego, CA, 1991). Target gene expression from the pTrc vector relies on host RNA 

20 polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion promoter 
mediated by a co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 
supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 
harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

25 One strategy to maximize recombinant protein expression in E. coli is to express 

the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, p. 1 19-128, In Gene Expression Technology: Methods 
in Enzymology vol. 185, Academic Press, San Diego, CA, 1990. Another strategy is to 
alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 

30 vector so that the individual codons for each amino acid are those preferentially utilized 
in£ coli (Wadaef al, 1992, Nucleic Acids Res. 20:2111-2118). Such alteration of 
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nucleic acid sequences of the invention can be carried out by standard DNA synthesis 



techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et 
5 al, 1987, EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 
943), pJRY88 (Schultz et al, 1987, Gene 54:113-123), pYES2 (Iiivitrogen Corporation, 
San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 

1 0 9 cells) include the pAc series (Smith et al , 1 983, Mol Cell Biol 3 :2 1 56-2 1 65) and the 
pVL series (Lucklowand Summers, 1989, Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 

15 (Kaufman et al } 1987, EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 1 7 of Sambrook et al, supra, 

20 In another embodiment, the recombinant mammalian expression vector is 

capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al, 

25 1987, Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton, 1988, 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989, EMBOJ. 8:729-733) and immunoglobulins (Banerji et al, 1983, Cell 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle, 1989, Proc. Natl Acad. Sci. USA 

30 86:5473-5477), pancreas-specific promoters (Edlund et al, 1985, Sriewce 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,3 1 6 and European Application Publication No. 264,166). Developmentally- 
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regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss, 1990, Science 249:374-379) and the oc-fetoprotein promoter (Camper 
andTilghman, 1989, Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
5 DNA molecule of the invention cloned into the expression vector in an antisense 

orientation. That is, the DNA molecule is operably linked to a regulatory sequence in a 
manner which allows for expression (by transcription of the DNA molecule) of an RNA 
molecule which is antisense to the mRNA encoding a polypeptide of the invention. 
Regulatory sequences operably linked to a nucleic acid cloned in the antisense 

1 0 orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 
regulatory sequences can be chosen which direct constitutive, tissue-specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the 
form of a recombinant plasmid, phagemid. or attenuated virus in which antisense nucleic 

1 5 acids are produced under the control of a high efficiency regulatory region, the activity 
of which can be determined by the cell type into which the vector is introduced. For a 
discussion of the regulation of gene expression using antisense genes see Weintraub et 
a!. t 1986, Trends in Genetics, Vol. 1(1). 

Another aspect of the invention pertains to host cells into which a recombinant 

20 expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 

25 identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect 
cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
30 conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
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phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
5 expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

10 hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide corresponding to a marker of the 

1 5 invention. Accordingly, the invention further provides methods for producing a 
polypeptide corresponding to a marker of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding a polypeptide of the 
invention has been introduced) in a suitable medium such that the marker is produced. 

20 In another embodiment, the method further comprises isolating the marker polypeptide 
from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences encoding a polypeptide 

25 corresponding to a marker of the invention have been introduced. Such host cells can 
then be used to create non-human transgenic animals in which exogenous sequences 
encoding a marker protein of the invention have been introduced into their genome or 
homologous recombinant animals in which endogenous gene(s) encoding a polypeptide 
corresponding to a marker of the invention sequences have been altered. Such animals 

30 are useful for studying the function and/or activity of the polypeptide corresponding to 
the marker and for identifying and/or evaluating modulators of polypeptide activity. As 
used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
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preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 
5 ■ develops and which remains in the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animal" is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has been altered by homologous recombination between the endogenous gene and 

1 0 an exogenous DNA molecule introduced into a cell of the animal, e.g. , an embryonic 
cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a nucleic acid 
encoding a polypeptide corresponding to a marker of the invention into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing 

1 5 the oocyte to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to 
particular cells. Methods for generating transgenic animals via embryo manipulation 

20 and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified 

25 based upon the presence of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

30 To create an homologous recombinant animal, a vector is prepared which 

contains at least a portion of a gene encoding a polypeptide corresponding to a marker of 
the invention into which a deletion, addition or substitution has been introduced to 
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thereby alter, e.g., functionally disrupt, the gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
"knock out" vector). Alternatively, the vector can be designed such that, upon 
5 homologous recombination, the endogenous gene is mutated or otherwise altered but 
still encodes functional protein {e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous protein). In the homologous 
recombination vector, the altered portion of the gene is flanked at its 5' and 3' ends by 
additional nucleic acid of the gene to allow for homologous recombination to occur 

10 between the exogenous gene carried by the vector and an endogenous gene in an 

embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5 r and 3 ! ends) are included in the vector 
(see, e.g., Thomas and Capecchi, 1987, Cell 51:503 for a description of homologous 

15 recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced gene has homologously 
recombined with the endogenous gene are selected (see, e.g., Li et ah, 1992, Cell 
69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see, e.g., Bradley, Teratocarcinomas and 

20 Embryonic Stem Cells: A Practical Approach, Robertson, Ed., IRL, Oxford, 1987, pp. 
1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 

25 transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described further in Bradley (1991) 
Current Opinion in Bio/Technology 2:823-829 and in PCT Publication NOS. WO 
90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 

30 contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
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Natl Acad. Set USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al, 1991, Science 
251:1351-1355). If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Ore recombinase and a 
5 selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be 
1 0 produced according to the methods described in Wilmut et al (1 997) Nature 385:810- 
813 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein 

15 as "active compounds") corresponding to a marker of the invention can be incorporated 
into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a 
pharmaceutical^ acceptable carrier. As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, 

20 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary active compounds can also be 

25 incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid corresponding to a 
marker of the invention. Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 

30 or nucleic acid corresponding to a marker of the invention. Such compositions can 
further include additional active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a pharmaceutically 
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acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention and one or more additional 
active compounds. 

The invention also provides methods (also referred to herein as 
5 "screening assays") for identifying modulators, i.e., candidate or test compounds or 
agents (e.g., peptides, peptidomimetics, peptoids, small molecules or other drugs) which 
(a) bind to the marker, or (b) have a modulatory {e.g., stimulatory or inhibitory) effect 
on the activity of the marker or, more specifically, (c) have a modulatory effect on the 
interactions of the marker with one or more of its natural substrates (e.g., peptide, 
1 0 protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 

expression of the marker. Such assays typically comprise a reaction between the marker 
and one or more assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

15 The test compounds of the present invention may be obtained from any available 

source, including systematic libraries of natural and/or synthetic compounds. Test 
compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
20 peptide backbone which are resistant to enzymatic degradation but which nevertheless 
remain bioactive; see, e.g., Zuckermann et al, 1994, J. Med Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconyolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The biological 
25 library and peptoid library approaches are limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997, Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad ScL U.S.A. 90:6909; Erb et 
30 al. (1994) Proc. Natl Acad. Set USA 91 :1 1422; Zuckermann et al (1994). J. Med 
Chem, 37:2678; Choetal (1993) Science 261:1303; Carrell etal. (1994) Angew. Chem. 
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Int. Ed. Engl 33:2059; Carell etal (1994) Angew. Chem. Int. Ed Engl 33:2061; and in 
Gallop etal (1994) J. Med Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
5 1993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids 
(Cull et al, 1992, Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith, 
1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et al, 1990, 
Proc. Natl Acad Sci. 87:6378-6382; Felici, 1991, J. Mol Biol 222:301-310; Ladner, 
supra.). 

10 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a marker or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
compounds which bind to a marker or biologically active portion thereof. Determining 
the ability of the test compound to directly bind to a marker can be accomplished, for 

15 example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to the marker can be determined by detecting the labeled 
marker compound in a complex. For example, compounds {e.g., marker substrates) can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemission or by scintillation counting. Alternatively, 

20 assay components can be enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 
determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening candidate or 
test compounds which modulate the activity of a marker or a biologically active portion 

25 thereof. In all likelihood, the marker can, in vivo, interact with one or more molecules, 
such as but not limited to, peptides, proteins, hormones, cofactors and nucleic acids. For 
the purposes of this discussion, such cellular and extracellular molecules are referred to 
herein as "binding partners" or marker "substrate". 

One necessary embodiment of the invention in order to facilitate such screening 

30 is the use of the marker to identify its natural in vivo binding partners. There are many 
ways to accomplish this which are known to one skilled in the art. One example is the 
use of the marker protein as "bait protein" in a two-hybrid assay or three-hybrid assay 
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(see, e.g., U.S. Patent No. 5,283,3 1 7; Zervos et al, 1 993, Cell 72:223-232; Madura et al, 
1993, J. Biol Chem. 268:12046-12054; Bartel et al ,1993, Biotechniques 14:920-924; 
Iwabuchi et al, 1993 Oncogene 8:1693-1696; Brent WO94/10300) in order to identify 
other proteins which bind to or interact with the marker (binding partners) and, 
5 therefore, are possibly involved in the natural function of the marker. Such marker 
binding partners are also likely to be involved in the propagation of signals by the 
marker or downstream elements of a marker-mediated signaling pathway. Alternatively, 
such marker binding partners may also be found to be inhibitors of the marker. 

The two-hybrid system is based on the modular nature of most transcription 

1 0 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that encodes a 
marker protein fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 

1 5 fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
• dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 

20 the transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 
the cloned gene which encodes the protein which interacts with the marker protein. 

In a further embodiment, assays may be devised through the use of the invention 
for the purpose of identifying compounds which modulate (e.g., affect either positively 

25 or negatively) interactions between a marker and its substrates and/or binding partners. 
Such compounds can include, but are not limited to, molecules such as antibodies, 
peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. Such 
compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 

30 in this embodiment is an ovarian cancer marker identified herein, the known binding 
partner and/or substrate of same, and the test compound. Test compounds can be 
supplied from any source. 
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The basic principle of the assay systems used to identify compounds that 
interfere with the interaction between the marker and its binding partner involves 
preparing a reaction mixture containing the marker and its binding partner under 
conditions and for a time sufficient to allow the two products to interact and bind, thus 
5 forming a complex. In order to test an agent for inhibitory activity, the reaction mixture 
is prepared in the presence and absence of the test compound. The test compound can 
be initially included in the reaction mixture, or can be added at a time subsequent to the 
addition of the marker and its binding partner. Control reaction mixtures are incubated 
without the test compound or with a placebo. The formation of any complexes between 

10 the marker and its binding partner is then detected. The formation of a complex in the 
control reaction, but less or no such formation in the reaction mixture containing the test 
compound, indicates that the compound interferes with the interaction of the marker and 
its binding partner. Conversely, the formation of more complex in the presence of 
compound than in the control reaction indicates that the compound may enhance 

1 5 interaction of the marker and its binding partner. 

The assay for compounds that interfere with the interaction of the marker with its 
binding partner may be conducted in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring either the marker or its binding partner onto a 
solid phase and detecting compiexes anchored to the solid phase at the end of the 

20 reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be varied to obtain different 
information about the compounds being tested. For example, test compounds that 
interfere with the interaction between the markers and the binding partners (e.g., by 
competition) can be identified by conducting the reaction in the presence of the test 

25 substance, i.e., by adding the test substance to the reaction mixture prior to or 

simultaneously with the marker and its interactive binding partner. Alternatively, test 
compounds that disrupt preformed complexes, e.g., compounds with higher binding 
constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 

30 various formats are briefly described below. 
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In a heterogeneous assay system, either the marker or its binding partner is 
anchored onto a solid surface or matrix, while the other corresponding non-anchored 
component may be labeled, either directly or indirectly. In practice, microtitre plates are 
often utilized for this approach. The anchored species can be immobilized by a number 
5 of methods, either non-covalent or covalent, that are typically well known to one who 
practices the art. Non-covalent attachment can often be accomplished simply by coating 
the solid surface with a solution of the marker or its binding partner and drying. 
Alternatively, an immobilized antibody specific for the assay component to be anchored 
can be used for this purpose. Such surfaces can often be prepared in advance and stored. 

1 0 In related embodiments, a fusion protein can be provided which adds a domain 

that allows one or both of the assay components to be anchored to a matrix. For 
example, glutathione-S-transferase/marker fusion proteins or glutathione-S- 
transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 

1 5 combined with the test compound or the test compound and either the non-adsorbed 
marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound assay components, the 
immobilized complex assessed either directly or indirectly, for example, as described 

20 above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a marker or a marker binding 
partner can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

25 marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 

succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). In certain embodiments, the protein-immobilized surfaces can be prepared in 
advance and stored. 

30 In order to conduct the assay, the corresponding partner of the immobilized assay 

component is exposed to the coated surface with or without the test compound. After 
the reaction is complete, unreacted assay components are removed (e.g., by washing) 
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and any complexes formed will remain immobilized on the solid surface. The detection 
of complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the non- 
5 immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g. , using a labeled antibody specific for the 
initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compounds which modulate (inhibit or enhance) 

10 complex formation or which disrupt preformed complexes can be detected. 

In an alternate embodiment of the invention, a homogeneous assay may be used. 
This is typically a reaction, analogous to those mentioned above, which is conducted in 
a liquid phase in the presence or absence of the test compound. The formed complexes 
are then separated from unreacted components, and the amount of complex formed is 

15 determined. As mentioned for heterogeneous assay systems, the order of addition of 
reactants to the liquid phase can yield information about which test compounds 
modulate (inhibit or enhance) complex formation and which disrupt preformed 
complexes. 

In such a homogeneous assay, the reaction products may be separated from 
20 unreacted assay components by any of a number of standard techniques, including but 
not limited to: differential centrifugation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifugation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifugal steps, due to the 
different sedimentation equilibria of complexes based on their different sizes and 
25 densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 

Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
chromatography separates molecules based on size, and through the utilization of an 
appropriate gel filtration resin in a column format, for example, the relatively larger 
30 complex may be separated from the relatively smaller uncomplexed components. 

Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 
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the remaining individual reactants, for example through the use of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 1998, JMol. Recognit. 1 1:141-148; Hage and 
Tweed, 1997, J. Chromatogr. B. Biomed. Sci. Appl., 699:499-525). Gel electrophoresis 

5 may also be employed to separate complexed molecules from unbound species (see, 
e.g., Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, 
New York. 1999). In this technique, protein or nucleic acid complexes are separated 
based on size or charge, for example. In order to maintain the binding interaction during 
the electrophoretic process, nondenaturing gels in the absence of reducing agent are 

0 typically preferred, but conditions appropriate to the particular interactants will be well 
known to one skilled in the art. Immunoprecipitation is another common technique 
utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1 999). In this technique, all proteins binding to an antibody specific to one of the 

5 binding molecules are precipitated from solution by conjugating the antibody to a 
polymer bead that may be readily collected by centrifiigation. The bound assay 
components are released from the beads (through a specific proteolysis event or other 
technique well known in the art which will not disturb the protein-protein interaction in 
the complex), and a second immunoprecipitation step is performed, this time utilizing 

0 antibodies specific for the correspondingly different interacting assay component. In this 
manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 
interactions between the marker and its binding partner. 

5 Also within the scope of the present invention are methods for direct detection of 

interactions between the marker and its natural binding partner and/or a test compound 
in a homogeneous or heterogeneous assay system without further sample manipulation. 
For example, the technique of fluorescence energy transfer may be utilized (see, e.g., 
Lakowicze/a/, U.S. Patent No. 5,631,169; Stavrianopoulos eta!, U.S. Patent No. 

0 4,868, 1 03). Generally, this technique involves the addition of a fluorophore label on a 
first 'donor' molecule (e.g., marker or test compound) such that its emitted fluorescent 
energy will be absorbed by a fluorescent label on a second, 'acceptor' molecule (e.g., 
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marker or test compound), which in turn is able to fluoresce due to the absorbed energy. 
Alternately, the 'donor' protein molecule may simply utilize the natural fluorescent 
energy of tryptophan residues. Labels are chosen that emit different wavelengths of 
light, such that the 'acceptor' molecule label may be differentiated from that of the 
5 'donor'. Since the efficiency of energy transfer between the labels is related to the 
distance separating the molecules, spatial relationships between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in the assay should be maximal. An FET 
binding event can be conveniently measured through standard fluorometric detection 

10 means well known in the art (e.g., using a fluorimeter). A test substance which either 
enhances or hinders participation of one of the species in the preformed complex will 
result in the generation of a signal variant to that of background. In this way, test 
substances that modulate interactions between a marker and its binding partner can be 
identified in controlled assays. 

15 In another embodiment, modulators of marker expression are identified in a 

method wherein a cell is contacted with a candidate compound and the expression of 
mRNA or protein, corresponding to a marker in the cell, is determined. The level of 
expression of mRNA or protein in the presence of the candidate compound is compared 
to the level of expression of mRNA or protein in the absence of the candidate 

20 compound. The candidate compound can then be identified as a modulator of marker 
expression based on this comparison. For example, when expression of marker mRNA 
or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
marker mRNA or protein expression. Conversely, when expression of marker mRNA 

25 or protein is less (statistically significantly less) in the presence of the candidate 

compound than in its absence, the candidate compound is identified as an inhibitor of 
marker mRNA or protein expression. The level of marker mRNA or protein expression 
in the cells can be determined by methods described herein for detecting marker mRNA 
or protein. 

30 In another aspect, the invention pertains to a combination of two or more of the 

assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 
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marker protein can be further confirmed in vivo, e.g., in a whole animal model for 
cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
5 agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein {e.g., an marker modulating agent, an antisense 
marker nucleic acid molecule, an marker-specific antibody, or an marker-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side 
effects of treatment with such an agent. Alternatively, an agent identified as described 

1 0 herein can be used in an animal model to determine the mechanism of action of such an 
agent. Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herein. 

It is understood that appropriate doses of small molecule agents and protein or 
polypeptide agents depends upon a number of factors within the knowledge of the 

15 ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents will 
vary, for example, depending upon the identity, size, and condition of the subject or 
sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to 
have upon the nucleic acid or polypeptide of the invention. Exemplary doses of a small 

20 molecule include milligram or microgram amounts per kilogram of subject or sample 
weight {e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 
about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 

25 subject or sample weight {e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 
about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 

30 can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal {e.g. a human) in order to modulate expression or 
activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
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researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 
5 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and the degree 
of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

10 parenteral, e.g. , intravenous, intradermal, subcutaneous, oral (e.g. , inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

1 5 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine-tetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

20 enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

25 water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

30 example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 

polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance 
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of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
5 for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 

10 compound (e.g. , a polypeptide or antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients'enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which contains a basic dispersion medium, 
and then incorporating the required other ingredients from those enumerated above. In 

15 the case of sterile powders for the preparation of sterile injectable solutions, the 

preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 

20 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

25 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be 

included as part of the composition. The tablets, pills, capsules, troches, and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder 
such as macrocrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a 

30 lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories {e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
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and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
5 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 
enhance uptake and tissue penetration (e.g., into the ovarian epithelium). A method for 
1 0 lipidation of antibodies is described by Cruikshank et al (1 997) J. Acquired Immune 
Deficiency Syndromes and Human Retrovirology 14:193. 

The nucleic acid molecules corresponding to a marker of the invention can be 
inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be 
delivered to a subject by, for example, intravenous injection, local administration (U.S. 
15 Patent 5,328,470), or by stereotactic injection (see, e.g., Chen et aL 9 1994, Proc. Natl. 
Acad. Sci. USA 91:3054-3057). The pharmaceutical preparation of the gene therapy 
vector can include the gene therapy vector in an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced intact from recombinant cells, 
20 e.g. retroviral vectors, the pharmaceutical preparation can include one or more cells 
which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

25 V. Predictive Medicine 

The present invention pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails 
are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 

30 assays for determining the level of expression of polypeptides or nucleic acids 

corresponding to one or more markers of the invention, in order to determine whether 
an individual is at risk of developing ovarian cancer. Such assays can be used for 
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prognostic or predictive purposes to thereby prophylactically treat an individual prior to 
the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs or other compounds administered either to inhibit ovarian cancer or to treat 
5 or prevent any other disorder {i.e. in order to understand any ovarian carcinogenic 
effects that such treatment may have} ) on the expression or activity of a marker of the 
invention in clinical trials. These and other agents are described in further detail in the 
following sections. 

10 A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid corresponding to a marker of the invention in a biological sample involves 
obtaining a biological sample {e.g. an ovary-associated body fluid) from a test subject 
and contacting the biological sample with a compound or an agent capable of detecting 

1 5 the polypeptide or nucleic acid {e.g. , mRNA, genomic DNA, or cDNA). The detection 
methods of the invention can thus be used to detect mRNA, protein, cDNA, or genomic 
DNA, for example, in a biological sample in vitro as well as in vivo. For example, in 
vifro techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a polypeptide corresponding to a 

20 marker of the invention include enzyme linked immunosorbent assays (ELISAs), 

Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a polypeptide corresponding to a marker of the invention 
include introducing into a subject a labeled antibody directed against the polypeptide. 

25 For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 

A general principle of such diagnostic and prognostic assays involves preparing 
a sample or reaction mixture that may contain a marker, and a probe, under appropriate 
conditions and for a time sufficient to allow the marker and probe to interact and bind, 

30 thus forming a complex that can be removed and/or detected in the reaction mixture. 
These assays can be conducted in a variety of ways. 
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For example, one method to conduct such an assay would involve anchoring the 
' marker or probe onto a solid phase support, also referred to as a substrate, and detecting 
target marker/probe complexes anchored on the solid phase at the end of the reaction. 
In one embodiment of such a method, a sample from a subject, which is to be assayed 
5 for presence and/or concentration of marker, can be anchored onto a carrier or solid 
phase support. In another embodiment, the reverse situation is possible, in which the 
probe can be anchored to a solid phase and a sample from a subject can be allowed to 
react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to a solid 
10 phase. These include, without limitation, marker or probe molecules which are 

immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 
15 certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 
20 nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
immobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 
25 (e.g., by washing) under conditions such that any complexes formed will remain 

immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 

In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 
30 directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
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It is also possible to directly detect marker/probe complex formation without 
further manipulation or labeling of either component (marker or probe), for example by 
utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et 
aL, U.S. Patent No. 5,631,169; Stavrianopoulos, et al, U.S. Patent No. 4,868,103). A 
5 fluorophore label on the first, 'donor 9 molecule is selected such that, upon excitation 
with incident light of appropriate wavelength, its emitted fluorescent energy will be 
absorbed by a fluorescent label on a second 'acceptor 5 molecule, which in turn is able to 
fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule may 
simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen 

10 that emit different wavelengths of light, such that the 'acceptor' molecule label may be 
differentiated from that of the 'donor'. Since the efficiency of energy transfer between 
the labels is related to the distance separating the molecules, spatial relationships 
between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor' molecule label in the assay 

1 5 should be maximal. An FET binding event can be conveniently measured through 
standard fluorometric detection means well known in the art (e.g., using a fluorimeter). 

In another embodiment, determination of the ability of a probe to recognize a 
marker can be accomplished without labeling either assay component (probe or marker) 
by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) 

20 (see, e.g., Sjolander, S. and Urbaniczky, C, 1991, Anal. Chem. 63:2338-2345 and 
Szabo et aL, 1995, Curr. Opin. Struct. Biol 5:699-705). As used herein, "BIA" or 
"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 

25 of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 
resulting in a detectable signal which can be used as an indication of real-time reactions 
between biological molecules. 

Alternatively, in another embodiment, analogous diagnostic and prognostic 
assays can be conducted with marker and probe as solutes in a liquid phase. In such an 

30 assay, the complexed marker and probe are separated from uncomplexed components by 
any of a number of standard techniques, including but not limited to: differential 
centrifugation, chromatography, electrophoresis and immunoprecipitation. In 
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differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Sci. 18(8):284-7). 
5 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
filtration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the 

10 relatively different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, N.H., 1998, J. Mol Recognit Winter 1 1(1- 

15 6):141-8; Hage, D.S., and Tweed, S.A. J Chromatogr B Biomed Sci Appl 1997 Oct 
10;699(l-2):499-525). Gel electrophoresis may also be employed to separate 
complexed assay components from unbound components (see, e.g., Ausubel etal, ed., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). 
In this technique, protein or nucleic acid complexes are separated based on size or 

20 charge, for example. In order to maintain the binding interaction during the 

electrophoretic process, non-denaturing gel matrix materials and conditions in the 
absence of reducing agent are typically preferred. Appropriate conditions to the 
particular assay and components thereof will be well known to one skilled in the art. 

In a particular embodiment, the level of mRNA corresponding to the marker can 

25 be determined both by in situ and by in vitro formats in a biological sample using 

methods known in the art. The term "biological sample" is intended to include tissues, 
cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, 
cells and fluids present within a subject. Many expression detection methods use 
isolated RNA. For in vitro methods, any RNA isolation technique that does not select 

3 0 against the isolation of mRNA can be utilized for the purification of RNA from ovarian 
cells (see, e.g., Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley 
& Sons, New York 1987-1999). Additionally, large numbers of tissue samples can 
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readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1989, U.S. Patent No. 
4,843,155). 

The isolated mRNA can be used in hybridization or amplification assays that 
5 include, but are not limited to, Southern or Northern analyses, polymerase chain reaction 
analyses and probe arrays. One preferred diagnostic method for the detection of mRNA 
levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that 
can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe 
can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide 

10 of at least 7, 1 5, 30, 50, 1 00, 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding 
a marker of the present invention. Other suitable probes for use in the diagnostic assays 
of the invention are described herein. Hybridization of an mRNA with the probe 
indicates that the marker in question is being expressed. 

15 In one format, the mRNA is immobilized on a solid surface and contacted with a 

probe, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the 
probe(s), for example, in an Affymetrix gene chip array. A skilled artisan can readily 

20 adapt known mRNA detection methods for use in detecting the level of mRNA encoded 
by the markers of the present invention. 

An alternative method for determining the level of mRNA corresponding to a 
marker of the present invention in a sample involves the process of nucleic acid 
amplification, e.g., by rtPCR (the experimental embodiment set forth in Mullis, 1987, 

25 U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 1991, Proc. Natl Acad Set 
USA, 88:189-193), self sustained sequence replication (Guatelli et ai 9 1990, Proc. Natl 
Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al 9 1989, 
Proc. Natl Acad Sci. USA 86:1 173-1 177), Q-Beta Replicase (Lizardi et al, 1988, 
Bio/T echnology 6:11 97), rolling circle replication (Lizardi et al , U.S. Patent No. 

30 5,854,033) or any other nucleic acid amplification method, followed by the detection of 
the amplified molecules using techniques well known to those of skill in the art. These 
detection schemes are especially useful for the detection of nucleic acid molecules if 
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such molecules are present in very low numbers. As used herein, amplification primers 
are defined as being a pair of nucleic acid molecules that can anneal to 5' or 3 ' regions 
of a gene (plus and minus strands, respectively, or vice-versa) and contain a short region 
in between. In general, amplification primers are from about 10 to 30 nucleotides in 
5 length and flank a region from about 50 to 200 nucleotides in length. Under appropriate 
conditions and with appropriate reagents, such primers permit the amplification of a 
nucleic acid molecule comprising the nucleotide sequence flanked by the primers. 

For in situ methods, mKNA does not need to be isolated from the ovarian cells 
prior to detection. In such methods, a cell or tissue sample is prepared/processed using 
10 known histological methods. The sample is then immobilized on a support, typically a 
glass slide, and then contacted with a probe that can hybridize to mRNA that encodes 
the marker. 

As an alternative to making determinations based on the absolute expression 
level of the marker, determinations may be based on the normalized expression level of 

1 5 the marker. Expression levels are normalized by correcting the absolute expression 
level of a marker by comparing its expression to the expression of a gene that is not a 
marker, e.g., a housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene, or epithelial cell- 
specific genes. This normalization allows the comparison of the expression level in one 

20 sample, e.g., a patient sample, to another sample, e.g., a. non-ovarian cancer sample, or 
between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level. 
To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus cancer cell isolates, 

25 preferably 50 or more samples, prior to the determination of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the 
larger number of samples is determined and this is used as a baseline expression level 
for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 

30 that marker. This provides a relative expression level. 
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Preferably, the samples used in the baseline determination will be from ovarian 
cancer or from non-ovarian cancer cells of ovarian tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in 
normal tissues as a mean expression score aids in validating whether the marker assayed 
5 is ovarian specific (versus normal cells). In addition, as more data is accumulated, the 
mean expression value can be revised, providing improved relative expression values 
based on accumulated data. Expression data from ovarian cells provides a means for 
grading the severity of the ovarian cancer state. 

In another embodiment of the present invention, a polypeptide corresponding to 
10 a marker is detected. A preferred agent for detecting a polypeptide of the invention is an 
antibody capable of binding to a polypeptide corresponding to a marker of the invention, 
preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g. , Fab or F(ab') 2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 
15 encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe 
or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 
20 detected with fluorescently labeled streptavidin. 

Proteins from ovarian cells can be isolated using techniques that are well known 
to those of skill in the art. The protein isolation methods employed can, for example, be 
such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
25 York). 

A variety of formats can be employed to determine whether a sample contains a 
protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
30 readily adapt known protein/antibody detection methods for use in determining whether 
ovarian cells express a marker of the present invention. 
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In one format, antibodies, or antibody fragments, can be used in methods such as 
Western blots or immunofluorescence techniques to detect the expressed proteins. In 
such uses, it is generally preferable to immobilize either the antibody or proteins on a 
solid support. Suitable solid phase supports or carriers include any support capable of 
5 binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. 

One skilled in the art will know many other suitable carriers for binding antibody 
or antigen, and will be able to adapt such support for use with the present invention. For 

10 example, protein isolated from ovarian cells can be run on a polyacrylamide gel 

electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the 
detectably labeled antibody. The solid phase support can then be washed with the buffer 
a second time to remove unbound antibody. The amount of bound label on the solid 

1 5 support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample (e.g. an , 
ovary-associated body fluid such as a urine sample). Such kits can be used to determine 
if a subject is suffering from or is at increased risk of developing ovarian cancer. For 

20 example, the kit can comprise a labeled compound or agent capable of detecting a 
polypeptide or an mRNA encoding a polypeptide corresponding to a marker of the 
invention in a biological sample and means for determining the amount of the 
polypeptide or mRNA in the sample (e.g., an antibody which binds the polypeptide or an 
oligonucleotide probe which binds to DNA or mRNA encoding the polypeptide). Kits 

25 can also include instructions for interpreting the results obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide corresponding to a 
marker of the invention; and, optionally, (2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 

30 For oligonucleotide-based kits, the kit can comprise, for example: (1) an 

oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide corresponding to a marker of the invention or (2) 
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a pair of primers useful for amplifying a nucleic acid molecule corresponding to a 
marker of the invention. The kit can also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label (e.g., an enzyme or a substrate). The kit can 
5 also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
individual container and all of the various containers can be within a single package, 
along with instructions for interpreting the results of the assays performed using the kit. 

10 B. Pharmacogenomics 

Agents or modulators which have a stimulatory or inhibitory effect on expression 
of a marker of the invention can be administered to individuals to treat (prophylactically 
or therapeutically) ovarian cancer in the patient. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype 

1 5 and that individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g. , drugs) for prophylactic or therapeutic treatments 

20 based on a consideration of the individual's genotype. Such pharmacogenomics can. 
further be used to determine appropriate dosages and therapeutic regimens. 
Accordingly, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

25 Pharmacogenomics deals with clinically significant variations in the response to 

drugs due to altered drug disposition and abnormal action in affected persons. See, e.g., 
Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body are referred to as "altered drug action." Genetic 

30 conditions transmitted as single factors altering the way the body acts on drugs are 

referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
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dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
5 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

1 0 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

15 experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 

20 ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, pharmacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 

25 identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 
and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 



30 



C. Monitoring Clinical Trials 

Monitoring the influence of agents (e.g., drug compounds) on the level of 
expression of a marker of the invention can be applied not only in basic drug screening, 
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but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving treatment for ovarian 
cancer. In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
5 antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) detecting the 

10 level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 
of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent can be desirable to increase expression of the 

15 marker(s) to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent can be desirable to decrease 
expression of the marker(s) to lower levels than detected, £ e. , to decrease the 
effectiveness of the agent. 

20 D. Electronic Apparatus Readable Media and Arrays 

Electronic apparatus readable media comprising a marker of the present 
invention is also provided. As used herein, "electronic apparatus readable media" refers 
to any suitable medium for storing, holding or containing data or information that can be 
read and accessed directly by an electronic apparatus. Such media can include, but are 

25 not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as compact disc; electronic storage media 
such as RAM, ROM, EPROM, EEPROM and the like; general hard disks and hybrids of 
these categories such as magnetic/optical storage media. The medium is adapted or 
configured for having recorded thereon a marker of the present invention. 

30 As used herein, the term "electronic apparatus" is intended to include any 

suitable computing or processing apparatus or other device configured or adapted for 
storing data or information. Examples of electronic apparatus suitable for use with the 
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present invention include stand-alone computing apparatus; networks, including a local 
area network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; 
electronic appliances such as a personal digital assistants (PDAs), cellular phone, pager 
and the like; and local and distributed processing systems. 
5 As used herein, "recorded" refers to a process for storing or encoding 

information on the electronic apparatus readable medium. Those skilled in the art can 
readily adopt any of the presently known methods for recording information on known 
media to generate manufactures comprising the markers of the present invention. 
A variety of software programs and formats can be used to store the marker 

10 information of the present invention on the electronic apparatus readable medium. For 
example, the nucleic acid sequence corresponding to the markers can be represented in a 
word processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in 
a database application, such as DB2, Sybase, Oracle, or the like, as well as in other 

1 5 forms. Any number of dataprocessor structuring formats (e.g. , text file or database) may 
be employed in order to obtain or create a medium having recorded thereon the the ,. 
markers of the present invention. 

By providing the markers of the invention in readable form, one can routinely 
access the marker sequence information for a variety of purposes. For example, one 

20 skilled in the art can use the nucleotide or amino acid sequences of the present invention 
in readable form to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the sequences of the invention which match a particular 
target sequence or target motif. 

25 The present invention therefore provides a medium for holding instructions for 

performing a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer, wherein the method comprises the steps of determining 
the presence or absence of a marker and based on the presence or absence of the marker, 
determining whether the subject has ovarian cancer or a pre-disposition to ovarian 

30 cancer and/or recommending a particular treatment for ovarian cancer or pre-ovarian 
cancer condition. 
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The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer associated with a marker wherein the method comprises 
the steps of determining the presence or absence of the marker, and based on the 
5 presence or absence of the marker, determining whether the subject has ovarian cancer 
or a pre-disposition to ovarian cancer, and/or recommending a particular treatment for 
the ovarian cancer or pre-ovarian cancer condition. The method may further comprise 
the step of receiving phenotypic information associated with the subject and/or acquiring 
from a network phenotypic information associated with the subject. 

1 0 The present invention also provides in a network, a method for determining 

whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated 
with a marker, said method comprising the steps of receiving information associated 
with the marker receiving phenotypic information associated with the subject, acquiring 
information from the network corresponding to the marker and/or ovarian cancer, and 

1 5 based on one or more of the phenotypic information, the marker, and the acquired 

information, determining whether the subject has a ovarian cancer or a pre-disposition to 
ovarian cancer. The method may further comprise the step of recommending a 
particular treatment for the ovarian cancer or pre-ovarian cancer condition. 

The present invention also provides a business method for determining whether a 

20 subject has ovarian cancer or a pre-disposition to ovarian cancer, said method 

comprising the steps of receiving information associated with the marker, receiving 
phenotypic information associated with the subject, acquiring information from the 
network corresponding to the marker and/or ovarian cancer, and based on one or more 
of the phenotypic information, the marker, and the acquired information, determining 

25 whether the subject has ovarian cancer or a pre-disposition to ovarian cancer. The 
method may further comprise the step of recommending a particular treatment for the 
ovarian cancer or pre-ovarian cancer condition. 

The invention also includes an array comprising a marker of the present 
invention. The array can be used to assay expression of one or more genes in the array. 

30 In one embodiment, the array can be used to assay gene expression in a tissue to 

ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes 
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can be simultaneously assayed for expression. This allows a profile to be developed 
showing a battery of genes specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the 
quantitation of gene expression. Thus, not only tissue specificity, but also the level of 
5 expression of a battery of genes in the tissue is ascertainable. Thus, genes can be 
grouped on the basis of their tissue expression per se and level of expression in that 
tissue. This is useful, for example, in ascertaining the relationship of gene expression 
between or among tissues. Thus, one tissue can be perturbed and the effect on gene 
expression in a second tissue can be determined. In this context, the effect of one cell 

10 type on another cell type in response to a biological stimulus can be determined. Such a 
determination is useful, for example, to know the effect of cell-cell interaction at the 
level of gene expression. If an agent is administered therapeutically to treat one cell 
type but has an undesirable effect on another cell type, the invention provides an assay 
to determine the molecular basis of the undesirable effect and thus provides the 

1 5 opportunity to co-administer a counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, undesirable biological effects can be 
determined at the molecular level. Thus, the effects of an agent on expression of other 
than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course of. 

20 expression of one or more genes in the array. This can occur in various biological 

contexts, as disclosed herein, for example development of ovarian cancer, progression of 
ovarian cancer, and processes, such a cellular transformation associated with ovarian 
cancer. 

The array is also useful for ascertaining the effect of the expression of a gene on 
25 the expression of other genes in the same cell or in different cells. This provides, for 
example, for a selection of alternate molecular targets for therapeutic intervention if the 
ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes that could 
30 serve as a molecular target for diagnosis or therapeutic intervention. 
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E. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or more 
disorders or disease states or for conditions leading up to disease states, and in 
particular, ovarian cancer. As used herein, a "surrogate marker" is an objective 
5 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 

1 0 particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e.g., early stage tumors), or when an assessment 
of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g., an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 

15 RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 
markers in the art include: Koomen et al. (2000) 1 Mass. Spectrom. 35: 258-264; and 
James (1994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic markers. As 

20 used herein, a "pharmacodynamic marker" is an objective biochemical marker which 
correlates specifically with drug effects. The presence or quantity of a 
pharmacodynamic marker is not related to the disease state or disorder for which the 
drug is being administered; therefore, the presence or quantity of the marker is 
indicative of the presence or activity of the drug in a subject. For example, a 

25 pharmacodynamic marker may be indicative of the concentration of the drug in a 

biological tissue, in that the marker is either expressed or transcribed or not expressed or 
transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 
Similarly, the presence or quantity of the pharmacodynamic marker may be related to 

30 the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 
Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
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of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker 
transcription or expression, the amplified marker may be in a quantity which is more 
• readily detectable than the drug itself. Also, the marker may be more easily detected 
5 due to the nature of the marker itself; for example, using the methods described herein, 
antibodies may be employed in an immune-based detection system for a protein marker, 
or marker-specific radiolabeled probes may be used to detect a mRNA marker. 
Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct 

1 0 observations. Examples of the use of pharmacodynamic markers in the art include: 
Matsuda et al US 6,033,862; Hattis et al (1991) Env. Health Perspect. 90: 229-238; 
Schentag (1999) Am. J. Health-Syst. Pharm. 56 Suppl. 3: S21-S24; andNicolau (1999) 
Am, J. Health-Syst. Pharm. 56 Suppl. 3: S16-S20. 

The markers of the invention are also useful as pharmacogenomic markers. As 

1 5 used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e;g., 
McLeod etal. (1999) Eur. J. Cancer 35(12): 1650-1652). The presence or quantity of 
the pharmacogenomic marker is related to the predicted response of the subject to a. 
specific drug or class of drugs prior to administration of the drug. By assessing the^ 

20 presence or quantity of one or more pharmacogenomic markers in a subject, a drug .. 
therapy which is most appropriate for the subject, or which is predicted to have a greater 
degree of success, may be selected. For example, based on the presence or quantity of 
RNA or protein for specific tumor markers in a subject, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 

25 present in the subject. Similarly, the presence or absence of a specific sequence 
mutation in marker DNA may correlate with drug response. The use of 
pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 
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VL Experimental Protocol 

A. Subtracted Libraries 

Subtracted libraries are generated using a PCR based method that allows the 
isolation of clones expressed at higher levels in one population of mRNA (tester) 
5 compared to another population (driver). Both tester and driver mRNA populations are 
converted into cDNA by reverse transcription, and then PCR amplified using the 
SMART PCR kit from Clontech. Tester and driver cDNAs are then hybridized using 
the PCR-Select cDNA subtraction kit from Clontech. This technique results in both 
subtraction and normalization, which is an equalization of copy number of low- 

1 0 abundance and high-abundance sequences. After generation of the subtractive libraries, 
a group of 96 or more clones from each library is tested to confirm differential 
expression by reverse Southern hybridization. 

To create the subtracted libraries, a first group of regular cDNA libraries was 
constructed. Library johOa was constructed from a pool of 5 normal ovarian epithelial 

15 cell cultures. Library johOb was constructed from a pool of 5 ascites short cultured 
samples from ovarian cancer patients. Library johOc was constructed from a pool of 6 
serous late stage (III/IV) tumor samples. Three subtracted libraries were generated from 
tumor samples. Library johOd was a subtracted ascites library, where the tester was 
johOb, and the driver was johOa. The johOe and the johOf library were both subtracted 

20 stage III/IV serous tumor libraries. The tester for both of these libraries was johOc, and 
the driver was a pooled RNA from normal tissues. The tissues used for this driver pool 
were: kidney, small intestine, prostate, lung, heart, muscle, spleen, pancreas, liver, and 
lymphocyte. Library cMhOg was the same as the johOc and johOf libraries, with the 
exception that normal ovary was added to the driver. cMhOh, i, j, and k are all stage I/II 

25 subtracted libraries made from pooled tumor RNAs of different histological types 

(h=serous, I-endometriod, j=clear cell, k=mucinous). The driver was the same for these 
4 libraries. It consisted of normal ovarian epithelial RNA and PBML RNA. Of the 
markers listed in Table 1, SEQ ID NOS: 1-129, 916-1029, 1566-1571 and 1607-1865 
were identified in library johOa. Markers identified in johOb include SEQ ID NOS: 

30 130-177, 1030-1081, 1572-1574, and 1866-1974. Markers identified in johOc include 
SEQ ID NOS: 178-269, 1082-1120, 1575-1577, and 1975-2060. Markers identified in 
johOd include SEQ ID NOS: 270-370, 1 121-1304, 1578-1592, and 2061-2244. Markers 
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identified in johOe include SEQ ID NOS: 371-61 1, 1305-1416, 1 593-1596 and 2245- 
2487. Markers identified in johOf include SEQ ID NOS: 612-915, 1417-1565, 1597- 
1606, and 2488-2871. Of the markers listed in Table 1A, SEQ ID NOS: 2872-2976, 
3817-3898, 4438-4443 and 4474-4675 were identified in library cMhOg. Markers 
identified in cMhOh include SEQ ID NOS: 2977-3376, 3899-4072, 4444-4455, and 
4676-5303. Markers identified in cMhOi include SEQ ID NOS: 3377-3495, 4073-4158, 
4456-4460, and 5304-5637. Markers identified in cMhOj include SEQ ID NOS: 3496- 
3742, 4195-4390, 4461-4468, and 5638-6197. Markers identified in cMhOk include 
SEQ ID NOS: 3743-3816, 4391-4437, 4469-4473 and 6198-6398. 

VII. Summary Of The Data Provided In The Tables 

Tables 1, 1A, 2 and 3 are being filed concurrently herewith on a compact disc in 
lieu of paper copies. The compact disc submitted is formatted from an IBM-PC and is 
compatible with MS-Windows. The disc contains the following four (4) files: 
Tablel.text, containing l,223kb, TablelA.text, containing l,582kb, Table2.text, . 
containing 10,600kh, and Table3.text, containing 568kb. The material on the compact 
disc, namely Tables 1, 1 A, 2 and 3, is expressly incorporated by reference. 

Tables 1 and 1A show 6398 novel nucleotide sequences. These 6398 novel 
sequences were determined to be novel through various BLAST searches of available 
databases. Of these novel markers, SEQ ID NOS: 1566 - 1606 and 4438-4473 are 
preferred, SEQ ID NOS: 916-1565 and 3817-4437 are more preferred, and SEQ ID 
NOS: 1 - 915 and 2872-3816 are most preferred. 

The sequences of Tables 1 and 1 A were re-interpreted and vector sequences 
removed and those sequences are set forth in Table 2. 

Table 3 correlates the SEQ ID NOS. from Tables 1 and 1A with those of Table 

2. 

The contents of all references, patents, published patent applications, and 
databases cited throughout this application are hereby incorporated by reference. 



Other Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
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described herein. Such equivalents are intended to be encompassed by the following 
claims. 

What is claimed is: 
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Claims 

t 

1 . An isolated nucleic acid molecule comprising a nucleotide sequence of 
Tables 1 -2, or a complement thereof. 

5 

2. A vector which contains the nucleic acid molecule of claim 1 . 

3 . A host cell which contains the nucleic acid molecule of claim 1 . 

10 4. An isolated polypeptide which is encoded by a nucleic acid molecule 

comprising a nucleotide sequence of Tables 1-2. 

5. An antibody which selectively binds to a polypeptide of claim 4. 

15 6. A method for producing a polypeptide comprising culturing the host cell 

of claim 3 under conditions in which the nucleic acid molecule is expressed. 

7. A method for detecting the presence of a polypeptide of claim 4 in a 
sample comprising: 

20 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 4 in the 
sample. 

25 

8. A kit comprising a compound which selectively binds to the polypeptide 
of claim 4. 
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9. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 
5 b) determining whether the nucleic acid probe or primer binds to a 

nucleic acid molecule in the sample to thereby detect the presence of a nucleic 
acid molecule of claim 1 in the sample. 

10. The method of claim 9, wherein the sample comprises mRNA molecules 
1 0 and is contacted with a nucleic acid probe. 

1 1 . The method of claim 9, wherein the sample is isolated from ovarian 

tissue. 

15 12. The method of claim 9, wherein the sample is a tumor sample. 

13. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 . 

20 14. A method of assessing whether a patient is afflicted with ovarian cancer, 

the method comprising comparing: 

a) the level of expression of a marker in a patient sample, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

25 b) the normal level of expression of the marker in a control non- 

ovarian cancer sample, 

wherein a significant difference between the level of expression of the marker in 
the patient sample and the normal level is an indication that the patient is afflicted with 
ovarian cancer. 

30 

1 5. The method of claim 14, wherein the marker corresponds to a secreted 

protein. 
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1 6. The method of claim 1 4, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

1 7. The method of claim 1 4, wherein the sample comprises cells obtained 
5 from the patient. 

1 8. The method of claim 17, wherein the sample is an ovarian tissue sample. 

19. The method of claim 14, wherein the sample is an ovary-associated body 

10 fluid. 

20. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a protein or protein 
fragment corresponding to the marker. 

15 

21 . The method of claim 20, wherein the presence of the protein or protein 
fragment is detected using a reagent which specifically binds with the protein or protein 
fragment. 

20 22. The method of claim 2 1 , wherein the reagent is selected from the group 

consisting of an antibody, an antibody derivative, and an antibody fragment. 

23 . The method of claim 1 4, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a transcribed 

25 polynucleotide or portion thereof, wherein the transcribed polynucleotide comprises the 
marker. 

24. The method of claim 23, wherein the transcribed polynucleotide is an 
mRNA. 

30 

25. The method of claim 23, wherein the transcribed polynucleotide is a 

cDNA. 
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26. The method of claim 23, wherein the step of detecting further comprises 
amplifying the transcribed polynucleotide. 

27. The method of claim 14, wherein the level of expression of the marker in 
5 the sample is assessed by detecting the presence in the sample of a transcribed 

polynucleotide which anneals with the marker or anneals with a portion of a 
polynucleotide wherein the polynucleotide comprises the marker, under stringent 
hybridization conditions. 



10 28 . The method of claim 1 4, wherein the level of expression of the marker in 

the sample differs from the normal level of expression of the marker in a patient not 
afflicted with ovarian cancer by a factor of at least about 2. 

29. The method of claim 14, wherein the level of expression of the marker in 
1 5 the sample differs from the normal level of expression of the marker in a patient not 
afflicted with ovarian cancer by a factor of at least about 5. 



30. The method of claim 14, comprising comparing: 

a) the level of expression in the sample of each of a plurality of 
20 markers independently selected from the markers listed in Tables 1-2, and 

b) the normal level of expression of each of the plurality of markers 
in samples of the same type obtained from control humans not afflicted with 
ovarian cancer, 

wherein the level of expression of more than one of the markers is significantly 
25 altered, relative to the corresponding normal levels of expression of the markers, is an 
indication that the patient is afflicted with ovarian cancer. 

3 1 . The method of claim 3 0, wherein the level of expression of each of the 
markers is significantly altered, relative to the corresponding normal levels of 

30 expression of the markers, is an indication that the patient is afflicted with ovarian 
cancer. 
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32. The method of claim 30, wherein the plurality comprises at least three of 
the markers. 

33. The method of claim 30, wherein the plurality comprises at least five of 
5 the markers. 

34. A method for monitoring the progression of ovarian cancer in a patient, 
the method comprising: 

a) detecting in a patient sample at a first point in time, the expression 
10 of a marker, wherein the marker is selected from the group consisting of the 

markers listed in Tables 1-2; 

b) repeating step a) at a subsequent point in time; and 

c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of ovarian cancer. 

15 

35. The method of claim 34, wherein the marker corresponds to a secreted 

protein. 

36. The method of claim 34, wherein the marker corresponds to a transcribed 
20 polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. - 

37. The method of claim 34, wherein the sample comprises cells obtained 
from the patient. 

25 38. The method of claim 37, wherein the patient sample is an ovarian tissue 

sample. 



30 



39. The method of claim 34, wherein between the first point in time and the 
subsequent point in time, the patient has undergone surgery to remove ovarian tissue. 
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40. A method of assessing the efficacy of a test compound for inhibiting 
ovarian cancer in a patient, the method comprising comparing: 

a) expression of a marker in a first sample obtained from the patient 
and exposed to the test compound, wherein the marker is selected from the group 

5 consisting of the markers listed in Tables 1-2, and 

b) expression of the marker in a second sample obtained from the 
patient, wherein the sample is not exposed to the test compound, 

wherein a significantly lower level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
1 0 efficacious for inhibiting ovarian cancer in the patient. 

41 . The method of claim 40, wherein the first and second samples are 
portions of a single sample obtained from the patient. 

15 42. The method of claim 40, wherein the first and second samples are 

portions of pooled samples obtained from the patient. 

43. A method of assessing the efficacy of a therapy for inhibiting ovarian 
cancer in a patient, the method comprising comparing: 
20 a) expression of a marker in the first sample obtained from the 

patient prior to providing at least a portion of the therapy to the patient, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) expression of the marker in a second sample obtained from the 
25 patient following provision of the portion of the therapy, 

wherein a significantly lower level of expression of the marker in the second 
sample, relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting ovarian cancer in the patient. 
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44. A method of selecting a composition for inhibiting ovarian cancer in a 
patient, the method comprising: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately exposing aliquots of the sample in the presence of a 
5 plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2; and 

d) selecting one of the test compositions which alters the level of 

1 0 expression of the marker in the aliquot containing that test composition, relative 

to other test compositions. 



45. A method of inhibiting ovarian cancer in a patient, the method 
comprising: 

1 5 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 

20 1-2; and 

d) administering to the patient at least one of the test compositions 
which alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 

25 46. A kit for assessing whether a patient is afflicted with ovarian cancer, the 

kit comprising reagents for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-2. 



47. A kit for assessing the presence of ovarian cancer cells, the kit 
30 comprising a nucleic acid probe wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 
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48. A kit for assessing the suitability of each of a plurality of compounds for 
inhibiting ovarian cancer in a patient, the kit comprising: 

a) the plurality of compounds; and 

b) a reagent for assessing expression of a marker selected from the 
5 group consisting of the markers listed in Tables 1-2. 

49. A method of making an isolated hybridoma which produces an antibody 
useful for assessing whether a patient is afflicted with ovarian cancer, the method 
comprising: 

0 isolating a protein or protein fragment corresponding to a marker selected 

from the group consisting of the markers listed in Tables 1-2; 

immunizing a mammal using the isolated protein or protein fragment; 
isolating splenocytes from the immunized mammal; 
fusing the isolated splenocytes with an immortalized cell line to form 
5 hybridomas; and 

screening individual hybridomas for production of an antibody which 
specifically binds with the protein or protein fragment to isolate the hybridoma. 



0 



50. An antibody produced by a hybridoma made by the method of claim 42. 

51. A kit for assessing the presence of human ovarian cancer cells, the kit 
comprising an antibody, wherein the antibody specifically binds with a protein or 
protein fragment corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 



52. A method of assessing the ovarian cell carcinogenic potential of a test 
compound, the method comprising: 

a) maintaining separate aliquots of ovarian cells in the presence and 
absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, 
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wherein a significantly altered level of expression of the marker in the aliquot 
maintained in the presence of the test compound, relative to the aliquot maintained in the 
absence of the test compound, is an indication that the test compound possesses human 
ovarian cell carcinogenic potential. 

53. A kit for assessing the ovarian cell carcinogenic potential of a test 
compound, the kit comprising ovarian cells and a reagent for assessing expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 
Tables 1-2. 

54. A method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer, the method comprising inhibiting expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

55. A method of treating a patient afflicted with ovarian cancer, the method 
comprising providing to cells of the patient an antisense oligonucleotide complementary 
to a polynucleotide corresponding to a marker selected from the markers listed in Tables 



56. A method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer, the method comprising decreasing expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

57. A method for determining whether ovarian cancer has metastasized in a 
patient, the method comprising comparing: 

a) the level of expression of a marker in a patient smaple, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) the normal level or non-metastatic level of expression of the 
marker in a control sample 
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wherein a significant difference betweent he level of expression in the patient 
sample and the normal level or non-metastatic level is an indication that the ovarian 
cancer has mestastasized. 

5 58. The method of claim 57, wherein the marker corresponds to a secreted 

protein. 

59. The method of claim 57, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

10 

60. The method of claim 57, wherein the sample comprises cells obtained 
from the patient. 

61. The method of claim 60, wherein the patient sample is an ovarian tissue 

15 sample. 

62. A method for assessing the aggressiveness or indolence of ovarian cancer 
comprising comparing: 

a) the level of expression of a marker in a sample, wherein at least 
20 one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control sample, 
wherein a significant difference between the level of expression in the sample 

and the normal level is an indication that the cancer is aggressive or indolent. 

25 63 . The method of claim 62, wherein the marker corresponds to a secreted 

protein. 

64. The method of claim 62, wherein marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

30 

65. The method of claim 62, wherein the sample comprises cells obtained 
from the patient. 
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66. The method of claim 65, wherein the patient sample is an ovarian tissue 

sample. 
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Sequence 1 

ACGCGTCCGGGAGACACAAAGCAGGAAGCCTCCGGGAGACCAGAGCTGGGTGCAGACATA 

CACACACACATACACACAGACACACAGAGTCACACACACTCACACACACTCTCTCTCTCT 

CTCCCTCTGTCTTTCTCTCTCTCTCTCTCTCTCTGTCCTTTCCTCCTGGACAGATCCACA 

GTTATACACAGAAACAAACACACACGCACGTAGAGAAGTGATTCACAAACACTTAAAGAC 

ATAAATCACAGGTGCAAAGCCATACCTGGGCTCAAAAACCTCCAAGAGAACGCAGCCTCA 

GACCCACCCAGGGGCCAGGGGCCAGGCTGTTTGCGAGACCAGCCCAGGCGGGACCCAGGC 

TTGCACGGGCAGGTACACGACATTCTTGGGCATACGCAGCCCGCCTGGCCGGAGCTGTGG 

GAGTCCTCAGCCCCAAGACCCAGCAGGCGTCTGAGGCCTGCCCACTAAGGAGGAGGAGTC 

ATTGCTGCCATCATTTATCATNCCCTCCCCAGCCACAGTCTGAGGAGCCCNTGTNCACCC 

TTCCCA 

Sequence 2 

CGTCCGGAAAGCTGGTGGGAATGCTAAGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTA 
CGATCCTTGCTGGTCCTTGCAGGAGAAGTACAACAGCAGAGCCGCGGCCCTCTTTAGGGA 
TAAGGTGGTCGCTCTGGCCGAAGGCAGAGAGTGGTCTTTGGAGTCATCACCTGCCCAGAA 
CTGGACCCTACCTNAGCCCANGACGCTGCCGTCCATGGTGCACCGAGTCTCTGGCCAGCC 
GCAGAGTGTGACCGCCTCCTNGGACAAGGCTTTTGAAGACTGGCTGAATGATGACCTCGG . 
CTCCTATCAAGGGGCCCAGGGGAATCGCTACGTGGGGTTTGGGAACACGCCACCGCCTCA 
NAAGAAAGAAGATGACTTTCTCAACAACGCCATGTCCTCCCTGTACTCGGGCTGGA 
Sequence 3 

NCCACGCGTCCGGGACGCGGGCGCCAGGTGCACAGCCCCAGTCCGCTGCGGGCGGGCGTC 

GACATCTGCCGCGTGAGCGGGAGCTGGAGCTACACCGCTTTCGTGACCCGTGGAGGCCGC 

TTGGAGCTGTCGGGCTCAGCCAGCGGCGCGGCGGGCCGCTGCAAGGACGCGTGGGCCTCG 

GAGGGGCTCCTCGCGGTGCTGCGCGCCGGGCCGGGGCCGGAGGCGTTACTGCAGGTCTGG 

GCGGCCGAATCGGCGCTGCGTGGGGAGCCATTGTGGGCCCAGAATGGTGGTGCCCGAGGC ' 

CGAAGGGGGAAGGACGATCCGGCCGGGTGAAGGCCCAAGCTTGGGGAGGCTACCCCTTGC 

TTGCCCTGCGGCCCCGTGCCCCTACGTGAGCCCCCGCGGGCCGCCCTTCTACCGGCCTTT 

TGGCTTCCGGGAGCTTGCGGGGCAACGCCAACTTGGGAGCTGGGGCCGCCAGCAACCCCG 

TTGCTTGTTGGACGCTGTGCCCAAAGGTGGTTTCTGGGGGGCGGGG 

Sequence 4 

AGTCNCCACGCGTCCGGGAATTGANGCCGCGGGGCGGGCGGCGGCGGCGGCTGGGCGGCG 

GCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGG 

GACTGGGGCCACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGC 

GGATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCC 

TGGTGGCCCTCGCCCTGGGACCGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCC 

TGTCTACCTGCCGGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACT 

Sequence 5 

TCNCCACGCGTCCGACTGTATGTATTCTGGATACAGGGGATACTGGGCTCGCTATGTGTG 

TGGAGCCATCCCTTCCTTGCCCCAGCCCCACCTCCCTCTCMACCCTCTCTGGCTCTTTC 

TGAGCTTCCTTTCCTGCTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTG 

CTACTTCGGGTCAGGTGGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACT 

TCAAGA GGAG AGGCTGCAGGGAGCCATCCTAAACCCAGAGGCCTGGAGCTTACCCGTGTC 

ACTTTACTTTTGTACACAAGGGGGTCTCCTTAGTGCCCTCGAGAAGGGATTCTTGGGCeC 

TGAGCTTCTACTCCTGAGGCCACCTTCTGTGCAAGCCCCAAGCTCCCTCAACTCTAGGCT 

TGGAGTCCTCAGTGGGGAAAAGCCCTGNTTTGGGG 

Sequence 6 

CGTCCGCGTCCTCGCTGCGGAAAGTTGGGGCAACCTGTTGCTAGTCTGGTCGTTGGTGAC 
AGCGAGGCTTCCGCGCTCGCTGCTGGTGAGCAGCCCCGGCGTGCCCCGCGGGCTGGAAGA 
GGCGGCGGCGTGATGCGGCCCGTGGACGCGCCCCGCGCCGGCCCGCGAGAACCTGGCCTC 
CCTGGAGCGCGAGCGCGCCCGGGCGCACTGGCGGGCCCGCAGGAAGCTGCTGGAGATCCA 
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GAGCCTGCTCGACGCCATCAAGAGTGAGGTGGAGGCAGAGGAGCGGGGCGCCCGGGCCCC 

AGCACCCCGCCCCGCGTGCGGAGGCTGAGGAGNCGGGTGGCTCGGCTGTGCGCCCGAAGC 

AGAGAGGAAGGCTGCGGAAGCGGCGCGGATGGGGCAGGCGGGATCGTGGGAGCTGCACNN 

ACCGGATCGCCGGCTTGCGAGTGCTGCTGAGCCGGCGAGGCCCNCGGGTCTGGAAGCGGA 

ANCGCGCGGG 

Sequence 7 

NTTCGGGAGTCGACCACGCGTCCGCGCGCTGGAGGAGTGGAGCAAGCACCCGGCCCGGCC 

CTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAG 

GTGGCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCG 

CGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGG 

CGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTTGCC 

GCCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGC 

CCCCCTGGGGAAGGCGGCACCAGGGGAGCCTTGGGCGCCCNGGGCTTCGGCCGCGACCCC 

ATTTGGGGTAGACCACAAGAAAGCTTCGGGACCCTTTCGGCACCTTTGGACAGCCAAGAA 

TGGCTGNTGGGCACCCTTTCTTCT 

Sequence 8 

CCCCGCGTCCGGAGCACGCAAAGGGAATAAATTGTAATTAGGTGGTGGGTGGCTAAAAAT 

GACAA TGCAAAGGTGTTGGATTAAAAAAAAAATCTGGTAGTAGAGGGAAATTATGGAGGA 

TTTTTAAAAAGGTTAATGATAATATCCATCTACTTATGTAACTTTTTTTGGAGATACCTC 

ATAATAGTGTAGAGTGCATTGGAGAGGAAAAGTAGGAGTTGTAAAGACCATTTTGGATAA 

ACTTTGAAGCAAGGGATAATGGCCTCAACCAAGGTAGTGGTGCTGAAGATTGTTTACATA 

MTAAGCAGATACAMTAGAAGGGATTTTTCAAGTGGCATTGTMCTGCACTTTTCAAAG 

GTTATTTGCCAAAAATCAAATTAACGGTATCTTCAAAAATCATGTTTGATGGATGTATCA 

TCAAGGGCTTTCTTAAAATTTTGTTGAAAGCCAAGGAA 

Sequence 9 

CGCCTTCCCGGGAAGTTTGGAGGGCCCCGNAGGGGAAGCCCCCGCGNCTTCNGGGGGCCN 

GNCGGGCTTGGAAGGCAANCCCCACCCCAAGTTTCCCCGCCNANGGANTNCAATGAANCT 

TGACCGGGGCCCCCCGGMCCCCNCGCTNGNCTTNTTNGGGGGTGGTTCCTTGGGTTCCG 

GTGGGGGGGAACCCCMGTGCTTTTTCAAGGCCCCGGCCGGGCGCGGGGCCCCGGAAAGG 

GCCTTTCAAGTTCTTNCCTTTCCCCGGNTTGAAAGAAGGNAAAAGGCCGGGAANGGAAAC 

CNGGGNAAAACCCGCNGGGNNGGGGCGGCCTTCNNCCGCCNGGGCGCCCCTTGCCNGGGG 

GGGGGNAAAGGGGNCAAAGTTTTCCCNGGGGCCCCCGGGGCCCCGGCNGNCCCTTTNAAN 

TCAAGGGGGCCGGGNNCGGNCTTTCCCCAANNCGGCCAAGTTCCTTCAAAGGGGCCCCCC 

CGGGNTTTGGGCCCGGNCCGGGNCNGNMCCTTGGGGAAGGAAMAATTCAAAAAGTTTT 

GTTGGCCCGGTTTCCGGGTNGGAATTGCCCCCCNAAAATTTGNAAGCCGGGGGGGGGGNN 

NCCTTGGGGGCCCCTTCTTTGCCCCCNTTTAAAGGGAANGGGCNAAAACCTTTNCCCAAC 

CGNCCAANNGCCCCCGTNAAMAANGGGCCGGCCTTNNTTTTGNCCGGGGCCCCNAANAA 

GGGCCTTTTCGNTTTTTTCGGGTTTTTTCCCCCGGGCCGGG 

TTTTTGGCTTTTAAAGGGGGG 

Sequence 10 

NCGCGTCCGCCGCATTGTGGCCAAGTGCCATGAGGAGCAGCTGGATCATTCTGTCCAGTC 

ATATATTAAGTTCGTGTTCAAGACCAGGGCATGCAAGGAGAGGACTGTACATGAGGAACT 

GNCTAAAAATGTGACTGGTCTTTTGAAATCAAATGACTCAACAACAGTAAAGCATGTCCT 

MAGCATTCCTGGTTCTTCTTTGCMTTATCCTAAAATCGATGGCACAGCACTTGATTGA 

CACAAATAMATCCAGCTTCCCCGGCCTCAGAGATTTCCTGAATCTTACCAAAATGAATT 

GGACAATCTTGNCATGGTCCTATCCGACCATGTGATTTGGGAAATACAAGGATGCACTTG 

AAGAMCANGAAGGGCMACCACAGCGTTGCCAGATTTCTCAAGCGCTGCTTTAC 

Sequence 11 

CGCCNCGCGTCCGGCTTCCTAGAAGAGCACAGTCCCTTAAAGCACCCTCTATTGCTACAA 

TTAAAAGTCTAGCAGATTGTMCTTTAGTTACACMGTTCTAGAGATGCTTTTGGCTATG 

CTACACTGAAAAGACTACAGCAACAAAGAATGCATCCATCCTTATCTCACTCTGAAGCTT 
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TGGCATCTCCAGCAAAAGATGTGCTATTTACTGATACCATCACCATGAAGGCCAACAGTT 

TTGAGTCCAGATTAACACCAAGCAGGTTCATGAAAGCCTTAAGTTATGCATCATTAGATA 

AAGAAGATTTATTGAGTCCTATTAATCAAAATACCCTGCAACCGATCTTCCTCAGTGCGG 

GCCATGGNGTCCAGTGCCACATNGGGGGGGTCAGAATGATTACATTGGGCTTGCTCTCCC 

GGNGGATATAAATGATATATTTCANGGTMGGGTATTTCTTATTTTTAGACAAAAAACAT 

CCCCNCATGATGATCCAGGNGCCAGAGCATTTGCCCTGAATGCAGGAGGGCTTTCATNTG 

GNACTGGGNGGGCTTTGNAAAAAATTTTTT 

Sequence 12 

TTCGGGAGTCGACCACGCGTCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGA 

CGAGGGCCGGGTTCTGCGGCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAG 

CTTCTTCACCGACGGGTCCTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTC 

TAAGCGACACTCAACCTCTGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGG 

CTGGTTGTATTATAAGCAGATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACAT 

GCTGGGCTGGATCAGAGCGATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCCGGCT 

GTGCCAACCAAGCTCTTGATCAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGC 

TCTGGGCCCAAAGCTTGATTTCTTCCC 

Sequence 13 

GTATTAATGTTCTCAGGCATGAAGCAGAATTTTACGGGATCACTCCATTAGTAAGAAGGC 

TTCTCTTATGTGMGMTTGGAGCGTTCCTCTTGTGGCAGTGTCCTTTTTCATGGTTACT 

TGCCCCCACCAGGTATTCCTAGTNCGTAAAATAAACAACACAGTCAGATCTGCTGATTCT 

AGGAATGGTCTAAATTCTACAGAAGGTGAAGCCCGGGGAAATGGTACACAGCCTGTTCTC 

TCTGGAACGGGAGAAGAAACTGTTAGGCTAGGATTTCCTGTGGATCCACNAAAGGTGCTA . 

ATAGTAGCTGGCCATCACAACTGGATTGTAGCTGCATATGCCCATTTTGCTGTGTGTTAC 

AGAATCAAAGAATNTTNANGATGGCAGCAAGTGNtTACGAGCCCATATTTGGATTGGACT 

ATCGAACGAGTAGNTTTAAATGCAAAGGTGGATGGAGGGCCACATGGAGACAAAAGACAA 

AAATG 

Sequence 14 

GCCNCGCGTCCGAAAAAATTAAGAGAAGGCCTGGCGGCCGGTCTGAAGTCATCTATAATT 

ATGTACAACGCCCCTTCATCCAGATGTCATGGGAAAAGGAAGAAGGGAAGAGTCGCCATG 

TGGATTTCTAGTGTGTTCGAAGCAAATCCCTCACGAATCTGGTAGCTGCTGGAGATGATG 

TCTTGGAGGACCAGGAGATATTAATGCATCACCCACCCCAAGTGGATGAACTTGACCGGC 

TAAATGCCCCACTTTCTCAGATGGCTTCTAACGACTTTCAGGATTAGGGCCAGCTGTGGG 

TCTACTCCTTGTTGGAGCCCATCTCACCTGGGATGCCTGCAGCCAGCCCTCCCTCGTGAT 

TTGTCTCACCT TGAG TAGGAGACATGCTTCTCCCCTAACCTTTTCCTTTCTGCCATAATT 

AACATATGTCCTTTTCAGTAAGTCCATGCCTCTGGCAGGGGATGAAAGAAGTACTCACTG 

GGTAATTAGCTACCATCTTTGCAGCACCCTGGTAACTTGAAAAATTT 

Sequence 15 

TCGGGAGTCGACCCCGCGTCCGCCGAGCGGGGCGGCGCGGCTGGCGGGGCCGGCGGCCGG 

CTGAAGCGAGAGCGCGACGCGACGCGACCGCGGCTTCCCGAGCTGCGCCTGGCCGNCCAG 

CGCCGCGGNCCGCCCGAGGCCTGGAGGGGTCCGGGCCGCCGTCCATGGTCGCGGCGTCCT 

GAGGCGGGGGACGCGCCCGGCGCCCCCGGCCCTCCTNCGCCTCCTCCCGCGGGGCGGGCG 

GCCTCCTCCGGCGCCTNCCCGCGCCCGCCCGCCGNTCGCCGCCGCCTCCCTCCCTCCTTC 

CCTGCGGCTCCCCCGGCTTTCGGAGCCCGGGGGCGGCCTGTGGCGCGCGGAGCCCGCGCC 

GGACTGCGCCTNTTTGGACCTTGAGGGGAAACATGCGTTTGCCNTGGATCGTTTGAAATT 

CTG AGTT T GGGATCCC CGNCCGGCCGNCTGGCTTTTTCGCCCGCGGG I I I 1 1 ICCTTTTT 

TCCTTTTGC I I I I 1 1 I I CCTTCTT 

Sequence 16 

NGCCCCCGCGTCCGTTTTAATTATTTTGTTNGAGCCTGCANAGTAANGTTNTTAAAAATA 
TMCGTTCATACGCATTTTAATTAACTTTGAAAGTTCATATGCATCAGAAAATTTATGAA 
AATTTGMTGAAAMAATTTCATCTATTTATTTTTCTAATTTTAATGGCAAATTTACACT 
ATTATGGCTGATMTTCTGTCAACTTACCTTCTTGTTGACTGATTCTTTTTCCCTTAATC 
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CCAGCTTTAAGGAGATAGGTGAAGTTATTGTACAAAGTTAAGTGATACCATAAAGTATAT 

ATTATAMGTCATACATGGCTTTTGGACAGTNTTATATTTCAGTTGCAGTGCTGCATTCC 

ATTAMTTTCATAAMTGCTAGGGAAMTGTGTTTGATAAAATTTTNTGCAGTGAGAAAT 

GACAGACTGAGTGCCTGACAATTTAAGCCACATATGAAAGTATGCAAGTAAAGANTTCAG 

GTCCTTAATGTCATCTATATCATGGTATAAAAG 

Sequence 17 

CCACGCGTCCGGACGAGACGAGCCCACTAGTGTCCCCCGAGCGGGCCCAACCCCCGGACT 

ACACCTTCCCGTCGGGCTCGGGCGCTCACTTTCCGCAGGTGCCCGGGGGCGCGGTCCGAG 

TGGCTGGCGGCGGCCGGCTCGGGCCCCTNTCCGCCGGGCTCGCCGGGCCACGACCGCTGA 

GCGGCAGCCACTGTTGGATCGGGCCCGGGGCGCGGNGGCCCAGGGCCAGACCCAAACCGT 

GGCGGCGCAGGCCCAGGCTCTGGCCGTTCANGCCGNGGCGGCAGTCCACGCCGATCAGGC 

CCACCGNGAGCGGAACGAG 

Sequence 18 

GGGAGTCGACCNCGCGTCCGGGCGGTGGGTGTCCGCTTCTCTCTGCTCTTCGACTGCACC 

GCACTCGCGCGTGACCCTGACTCCCCCTAGTCAGCTCAGCGGTGCTGCCATGGCGTGGCG 

GCGGCGCGAAGCCGGCGTCGGGGCTCGCGGCGTGTTGGCTCTGGCGTTGCTCGCCCTGGC 

CCTGTGCGTGCCCGGGGCCCGGGGCCGGGCTCTCGAGTGGTTCTCGGCCGTGGTAAACAT 

CGAGTACGTGGACCCGCAGACCAACCTGACGGTGTGGAGCGTCTCGGAGAGTGGCCGCTT 

CGGCGACAGCTCGCCCAAGGAGGGCGCGCATGGCTTGGTGGGGCGTCCCGTGGGCGCCCG 

GCGGAGACCTCGAGGGCTGCGCGCCCGACACGCGCTTTTTTCGTGCCCGAGCCCCGCGGC 

CGAGGGGCCCGCGCCCTTGGGTCGCCCTGGTGGCTCGTGGGGGCTTGCACCTTTAAGGAC 

AAGGTGCTTGTGGCGGGCGCCGAGG 

Sequence 19 

NATGTNGNNC NAAA AAGGCCNGCNTTANAGGCCAGGAAACNCGTAAAAAGGGCNCGCGTT 

GCTGTGCGTCTTTTCCATAGGCTCGCGNCCCCCTGACCNAGTCATCATCAAAAAATCCGA 

CNGCTCAAGTCATGAGGTTGGCCGAAAACTCCGACAGGGACTTNTAANAGNATACCCANG 

GGCGNTTTCCCCCTGGGAAGGCTCCCTTCGTGGCGCNTCTCNNTGTTTCCAGACCCCTGC 

CCCGCTTTACNCGGNATTACCCTNGTCCCCGCCNTTTTCTTCCCTTTCGNGGAAAGCGGT 

NGGGCGCCTTCTCNTCMTTAGGCTTCACCGCCTGNTAANGGTATTCTCAAGTTNCGGNT 

GTANGGGTGCCGTTTTCGCTTCCAAAGNCTGGGGCCTTNTGTGCCACCGGAAACCCCCCC 

Sequence 20 

TTCGGGAGTCGACCNCGCGTCCGCCTGGAGCCGCCAGAGTTTCCGCACCCGGGAGGGAGA 

TGCGGCCGGGGCTCAGGCTCCTTGCAGTTGTAATTTAGATTCGAGAAGTGGTTTATCCTT 

TGACTGGAAAAGAAAAGTAGCTGCAGTATTCCCCCAGCACTTGCTGAGAGCATGCCGTAT 

GCCAGGCTGTGAGGCTCGAGAGACAAGCAGTGGAAGAGTTGCGGCCTGTTTCATCTCTGG . 

ATTGTAAATCTGAGCCTCCTTCTGGCCCCTGGMGGGGACAGCATCACGATGGAATGATT 

CCTAACCAGCATAATGCTGGAGCCGGGAGCCACCAACCTGCAGTTTTCAGAATGGCCGTG 

TTGGACACTGATTTGGATCACATTCTTCCATCTTCTGTTCTTCCTCCATTCTGGGCTAAG 

TTAGTAGTGGGATCGGTTGCCATTGTGTGTTTTGCACGCAGCTATGATGGAGACTTTGTC 

TTTGATGACTCAGAAGCTATTGTTAACAATAAGG 

Sequence 21 

CGACCACGCGTCCGGCAGCCGCGGGGCGGGCGGCGGCGGCGGCGGCGGCGGCCGGGACCC 

AGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGGGACTGGGGCC 

ACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGCGGATGGCCGG 

TCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCCTGGTGGCCCT 

CGCCCTGGACGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCCTGTCTACCTGCC 

GGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACTTCCGTGTGCAGCAGGTGGGCCACTA 

CCCACCTGCCAACTCCTCTCTGAGCTCCCGATCTGAGACCTTTCTGCTCCTACAGCCCTG 

GCCCAAGGCCCAGCCACTTCTTCGGGCCTTCTACCCAA 

Sequence 22 
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CGCGTCCGCCCGGTGCCTCCGCCCATGGAACGCGCGGAGTGGCGCCGCCGCGGCTACGCG 

CCGCTGCTCTATCTGCAGTCACACTGCGACGTGCCAGCGGACCGGGACCGCTACGTGCGC 

GAGCTCATGCGCCACATCCCGGTAGACTCCTACGGGAAATGCCTGCAGAATCGGGAGCTG 

CCTACCGCGCGGCTACAGGACACAGCCACGGCCACCACCGAGGATCCAGAGCTCTTGGCT 

TTCTTGTCCCGCTATAAGTTCCACTTGGCCCTGGAAAATGCCATCTGTAACGACTACATG 

ACAGAAAAACTGTGGCGTCCCATGCACCTGGGCCGCTGTGCCCGTGTACCGCGGTTCTCC 

CTCTGTGAGGGACTGGATGCCGAACAATCACTCCGTCATCCTGATTGATGATTTTGAGTC 

TCCTCAGAAGCTGGCAGAGTTTATTGACTTTCTGGACMGAATGATGAGGAGTATATGAA 

ATACCTGGCATACAAGCAACCT 

Sequence 23 

CGCGTCCGGCTGGGCGAATNAGGGATTCCGGTTCACAATGGATGCTGATAAAGAGAAAGA 

TTTGCAGAAATTTCTTAAAAATGTGGATGAAATCTCCAATTTAATTCAGGAGATGAATTC 

TGATGACCCAGTTGTGCAACAGAAAGCTGTCCTGGAGACAGAAAAGAGACTACTGCTTAT 

GGAGGAAGACCAGGAGGAGGATGAATGCAGGACCACCTTGAACAAGACTATGATCAGTCC 

TCCACAAACTGCTCTGAAGAGTGCAGAAGAAATAAACTCAGAGGCCTTCTTGGCATCTGT 

GGAGAAGGATGCAAAGGAACGAGCCAAGAGAAGAAGGGAAAACAAAGTCTTGGCGGATGC 

CCTAAAAGAAAAAGGGAATGAAGCATTTGCTGAAGGCAATTATGAAACAGCTATCCTGCG 

CTACAGTGAGGGGTTTGGAGAAGCTGAAGGACATGAAAGTGCTGTACACCAACCGAGCCC 

AGGCTTATATGAAACTTGAGGA 

Sequence 24 

GGGAGTCGACCNCGCGTCCGCTCCCTCTGAGTTGCGCTGGGCTTGGCTGCTGCACCATGA 

CCCTGGAGGCGATCCGCTACTCGCGGGGCTCCCTGCAGATCCTAGACCAGCTGCTGCTGC 

CCAAGCAGAGCCGCTACGAGGCGGTGGGCTCGGTGCACCAGGCCTGGGAGGCCATCCGCG 

CCATGAAGGTGCGGGGCGCCCCGGCCATAGCCCTGGTGGGCTGTCTCAGCCTCGCCGTGG 

AGCTGCAGGCGGGCGCCGGGGGACCGGGACTCGCCGCGCTCGTGGCCTTCGTGCGCGACA 

AGCTGAGCTTCCTCGTCACCGCCCGGCCCACCGCTGTCAACATGGCCCGCGCCGCCCGCG 

ACCTGGCTTGATGTTGCAGCCCGGGAGGCCGAACGGGAGGGGCGCTACGGAAGAGGCCGG 

TCCGGGAGAGAGTGATCTGCTGCACCGAGGACATGCTGGAGAAAGACCTCAGAGACAACC 
GAAGCATTG 

Sequence 25 

GGAGTCGACCNCGCGTCCGGGATAACGAAGCTGCTACCATGATGATGGCTGATCTCATGT 

TCAGAMACMGACTATGMCAAGCAGTGTTTCATTTACAGCAGCTTTTAGAACGTAAGC 

CAGACTCCTCGAGTTCAGGGATCACACCATATTCCCAATATCAGACAAAATGCCACACAC 

ATGGATGTGGCAACATAGATGTTTATTGGTTGAATGGATCAGTGAATGACTGTAAAACAC 

CAAGTCAATTAAAAACACAGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCCGTG 

AGCCAAGATCACACCACTGCACTCCAGCCTCGGTGACAGAGTGAGACTTGGTCTCAAAAA 

CAACCACAAMTTTTAAATAATTATATGACATTATCTCGTTTGATTGATCTCCTAAGAAG 

ATGTGGAAAACTCGA 

Sequence 26 

ACCGTCCGGGGCCATCCAGGAGAGCCTCCTACCAGCACAGAAGGCCTGTGCCCCAGCGCC 

CTGAGCGAGACAAGCCGTTTTGATAATGACTTGCAGCTAGCCATGGAGCTCTCTGCCAAA 

GAGCTGGAGGAATGGGAGCTCCGGCTCCAGGAGGAAGAGGCTGAGCTCCAGCAAGTCTTA 

CAGCTGTCACTCACTGACAAATAGACCTTTCAGCCTGTGAGCCTCTGCACAAAGCAGAGG 

CTGTGGGCTGTCACAGATGCTGTGTCAACCAGGGCCCTAGGGCTAAGGGCCTGCACCTTG 

CGTGCATGCAGCAGGCAACAACTGCCCCTTCTTTATGCAGAGGTGCAGAACCAGGGACTC 

CTGGGCCCATCCAGGCTGCTTCCTTGGGGTGG 

Sequence 27 

NCCNCGCGTCCGGCCGGCGATGCCGCGCCCCCGGGCCGGGCTGTAGCGGGGCCGCGGCTG 
GACGTGTGCGCCGGGCAGGCGGGACATGGAGGTGGTGGACGAGACGGAGGCGCTGCAGCG 
CTTCTTCGAAGGCCACGACATCAACGGTGCCCTGGAGCCCTCCAACATAGACACCAGCAT 
CCTGGAGGAGTACATCAGCAAGGAGGATGCCTCCGACCTCACACTGCCGGACTCTCCCCC 
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AGACTCGGGCTCCGAGGCCTACTCCCCCCAGCAGGTGAATGAGCCCCACCTNCTGCGCAC 

GATMCCCCTGAGACACTGTGCCACGTGGGGAGTGCCCTTC 

Sequence 28 

CGCGTCCGCNAGGGAGGGCCGAGCGAGGCGCAGGCAACCGGGCAGCAGGCATGATGGCCT 

CGCCTAGTGACTCCAGCCGCTCGCTGACCAGCCGGCCCAGCACCAGGGGCCTTACCCACC 

TNCGCCTNCACCGACCCTGGCTGCAGGCCCTGCTTACGCTGGGGCTGGTCCAAGTGCTCC 

TGGGCATACTGGTGGTCACCTTCAGCATGGTGGCCTCTTCCGTCACCACCACCGAGAGCA 

TNAAGAGGTCCTGCCCGTCTTGGG 

Sequence 29 

ACGCGTCCGATTTTAGTNGCAAGGAGTCCATGTGTTCAACTCCAGCATTTCCTGTGTCTC 

CAGAGACACCGTATGTGAAAACAGCGCTGCGCCATCCTCCGTTCAGCCCACCTGAGCCCC 

CGCTGAGCAGCCCAGCCAGTCAGCACAAAGGAGGACGTGAACCACGAAGCTGCCCTGAGA 

CGCTCACTCACGCTGTGGGGATGTCAGAGAGCCCCATCGGACCCAAATCCACGATGCTCC 

GGGCTGATGCGTCCTCGACGCCCTCCTTTCAGCAGGCTTTTGCTTCTTCCTGCACCATTT 

CCAGCAACGGCCCTGGGCAGAGGAGAGAGAGCTCCTTCTTCTGCAGAACGCCAGTGGGTG 

GAGAGCAGNCCCAAGCCCATGGGTTTCCCTGCTGG 

Sequence 30 

CCGGTTNCTTGTNNGTTNATTGATTTAMATAGAATATCAATTGMTTTAGAAMTTCTC 

AAAAGCCAGTTTAATGCTGTTCATCTTTTAAGGCCAAAAAAAGTTTAATCCAGAGGCAGT 

CTTTCATTCTGCACTMTTTATAATTTAGATCAAAGMCTAATTATATATCTCAMTTTA 

ATAATAAAAAGGTATAGTAATGAGAATTAAATTTATGGTAAATTATATAACTCAGAATGT 

TAMGTAACTTGGAMTTCTTMTCTMGTTAAGTATCTTTTTATTTCTTACTTGTCCTG 

TTTGATTTATTAAGGGMAAGAAAATTTTAAGGAGTTGCCAGTATTTCTTT 

AAGTGGAATGTTTATTCACCCTTATTTATATACTTAAAAGACATTGTATTGGCCTGGTCT 

CGAACTCCTGACCTCAAAGGTGGATCCACCCACCTCGGCTTCCCAAAGTGCTGGGATTA 

Sequence 31 

GCCGGTGATTTTGAACMTTCTGAAATATTTCAGGTAAGATTAATAACATCCAATTACAA 

ATATATGTTTCAATATTTTATACGTATGTCTACTTTGAAAGTTAAACCAATAGTATAGAA 

AGCCTAAGAATGAACACTGATTGGACATACTCACAGAAATTAAGGGAAAAACACATATTG 

TAAMTTCCTGTCMTGTTTGAGTAGAATACAGAAGTACATAGCAGTCTTCAATTTTTAA 

ACACAATTATGGGCTTATMCTGGACGTGACATGCATCATTTATTAGAACAATATTATTT 

ATTTATACTAAGTAAGGATATAAGATCACAGAAGCTTAGTGTTATAACGGAGACTTCACA 

GACATTCATAACTAATGTTTTCTAAGGCAAATAAGGGGCATAAACCAGAACTCATGGGTC 

AGTGCCAGAGGTMGTATMAAAGGTTATGTATGAAAGACATTTATTTATAGGAGAATTT 

CTGAGGGATTCTATGCCTTTTCAACTTA 

Sequence 32 

NCGTCCGGGAAACTGGTTCNGATGGTGTCTGCCCAGGAGCGCCTGACACGCACCTTCACA 

CGCAGCAGCCACACCTACACCCGCACGGAGCGCACGGAGATCAGCAAGACGCGGGGCGGG 

GAGACAAAGCGCGAGGTGCGGGTGGAGGAGTCCACCCAGGTCGGCGGGGACCCCTTCCCT 

GCTGTGTTTGGGGACTTCCTGGGCCGGGAGCGCCTGGGATCCTTCGGCAGCATCACCCGG 

CAGCAGGAGGGTGAGGCCAGCTCTCAGGACATGACTGCACAGGTGACCAGCCCATCGGGC 

AAGGTGGAAGCCGCAGAGATCGTNGAGGGCGAGGACAGCGCCTACAGCGTGCGCTTTGTG 

CCCCAGGAAATGGGGCCCCATACGGTCGCTGTCAAGTACCGTGGCCNGCACGTGCCCGGC 

AGCCCCTTTCAGTTTACTGTGGGGCCGCTNGGTNGAAAGGTGGTGCCCACAAGGTGCGGG 

CCCGGAGGCAC 

Sequence 33 

CCGCGTCCGCAGGMATTGTTAMAATAATTTGGGGGTGTTTATTGGGGAAGGAAACAGG 

GCCTTGACAGTGGAGGACTTGGAAGACATGTAATTTAAGATATAGAGTATGATTGTTGGA 

MATMGCATGGAGATCCAGAAGGAATCTTAAGAGTTTTTCTATGCAAGTGAAGATGGAA 

GAAAATATGTATTTTACMAAGATAAATTACAAGTACCTTATTTGCTTTGCAAAATAACT 

TATCATGTTCTTCCACTATTTTTATTATATTTTAATTTTAATGAAACTTATATMCATTT 
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ACACTAMTTTTAAATACATGGCTCAAGACAAAAAAATGGGGAAAATATTTTTTAAAAAA 
TCACCCCAAATCCTGGTACTCAGACATAACCACTATTCAAACTTGGCAGAGTAATATTTT 
TCCTGTCTGCATGTGTGCCACNATGTGTGCATGCATACCACAAAGTAGATGTTTTACTAT 
ATCTCCTGGGTTATTATCTG C I I 1 1 I ICCC 
Sequence 34 

ACGCGTCCGGGACAAGAGAAACATAAGCNGGAAAAAAGGAGGAGGAATAAACACACGCCT 

GTCCATAATAAAACTCGCTCTTGAAGACTCAGCGGCAGCCCTGCACCGGAGACTGACGAC 

TTGCGCGGCTGTGACCTCCGCCCTGCAGCGGACCCTCGACTGCCCTGCACTGCGGCTCTG 

GAGGCCCCGACTCAGTGCATGGGAAAGAAATCCTCACTATCAGAAAACAGAGGGGCAATC 

TGCTGCTCTCCCTTTCCGGCCAAACACGTCACCCATCAACCGGATACCTACCAAGAGGCT 

TTCAGAGGAGGCGCCCAAGGTCTCCCAGGCCCGCCCCTCCCCAATCACGCTCCGCTCAGC 

CCCCTCAACTTTTGGCCTCCGGGAAGTTCGCAGCGTNTCTCACGCTTGGCAGGAAGTTCC 

CGCCAAGGCTTTCGGAAAATCCTTTAAAAAGCAACGCTTGCGCTGGGCGGGGCTTTGGTG 

Sequence 35 

CCCCGCGTCCGGTAGATTGCTTGTGGCTGGCAGTGAAGATGGTGGAGTTCAAC I I I I IG A 

TATAAGTGGGAGGGCTCCCCTCAGGCAGTTTGAAGGCCATACAAAGTAAGAGACAGTTGG 

TTTCTGTGTGTTCTGGTTTTATTTTGTTGTAAGCTC I I I I I I I CTCTGGACTTTGGTTAA 

AMGATAGAGATCAGTTTTATGGAGATTATTTGCCTATAGGTACTATATTTCCTGATTGT 

TCTAAGAGTGCTTAACTTGGGTTCCGTGGTGCAGTTTCATGGGGCTTATGAATTCCCTAG 

AATTGTATGTGATATTTTAGGAAATACACGTTTATCTAGGGAGCTACTCTGTAGCTTTTG 

GTTMCTTTAGTGGGGTCTGTGGCCCAGCTGAGATTATGAATTACTGACCTGAAGACAAC 

CTTACAGCTGGTAATGACAGCTCTATAGGCCTGTACTGTCTTAGAGGCTCTTATGTTGAA 

•GTCAAGTANGAAGGTGGATTTTCTTCTTGAATTATAGTGTTTTGCCCCTTAATAA 

Sequence 36 

CNCGCGTCCGCGGACGCGTGGGGGCGAGGGCCGCTGGGGCCGCGAAGTGGGGCGGCCGGG 

TGQGGTACGAGCCGGGTCTGGGCTGAGGGGCGCGGCTTCGCGGTGGACCCCAGCCCGGCA 

ACGGGAAGGCGAGCTCTCCTCCACCGTCCAAAGTAAACTTTGCCGCTCCTTCCGCGGCGC 

TCCCGAGTCCTCGCCGCCGCCGGGCCGCCGCAGTCCGCGAAGAGCCGTCCTGCGTCAGGG 

■CCTCCTTCCCTGGCCCGGCGCGGGGCCACTGCGCCATGGACGCCACAGCACTGGAGCGGG 

ACGCTGTGCAGTTCGCCCGTCTGGCGGTTCAGCGCGACCACGAAGGCCGCTACTCCGAGG 

CGGTGTTTTATTACAAGGAAGCTGCACAAGCCTTAATTTATGCTGAGATGGCAGGATCAA 

GCCTAGAAAATATTCAAGAAAAA 

Sequence 37 

GCGTCCGCGGACGCGTGGGCCGCCGCCGCCGCCGCCGCCGCGATGTGACCTTCAGGGCCG 

CCAGGACGGGATGACCNGGAGCCTCCGCCCCGCGGNGCCCGNGGCTCGCCTCGGCCTCCC 

GGGCGCTCTGACCGCGCGTTACCCGGCCCGCCATGGCCCCCTTCTCTCTNGCCCGGGCNC 

ACGCTCGACCCTGCGCNCGTATCGCCAGCAAGTCTGCTGCCTGCATGCTGCNTGGCTCGC 

GAGNNGTTGGCGCGCNTTGCNCTGNNGCCTGTTTGCACGAGCNGCCGCCGAGGCCCACTG 

CANGNTTANCTCGACGGTCTCCNAAGNCANACCGCNCCGCCGNCCCTTTCCAAGGNTCNT 

TTCNCGAGGGGANGGGCNCAAANGGCCAGGGGNCCCCACNCNTGGGGAAGAAGGGNGGAG 

TTNGNCNGATGNGNATGAGNAANCCTTGTAGNCAAGANCCCGTCCCNAAGNGGNAGNGNC 

AG CCCNG GGGAAGGGTTGGTTTTCNNANGNTAAACCGNAACCCCCCCGGAAGTNAACCGG 

NGATTTTMTTTATTTTTACCCCNCANANGNAAATAACCTTTNAANAACCATGGGNCATT 

NNAAANCAMAGNTAAATTNGGGGGNAACTTMCCGNTAAACCANCCCAAAA 

Sequence 38 

CNCGCGTCCGGCGGGTCCCGCCGGCGGGTACCTGGGCACTGCGCCCCATCTGGACTGAAA 

TGGGGACACCCCTTCGGGGGTCCCAGGCTCCTGGCCGTATTGTTCTCCTTCTCCTCGTGA 

TAACTCCGCAGTGGAGGTGGATTCCGTCCAAGACGCCCAACGTGGCTCCGCGTAGCAATC 

AGCGCTGCAATCCTGGCGGTTACCTCAGCGGCGGCGTCTCTCTCTGCGCCTCACACTCGC 

AGCCCGCGGCCCTCCCCAACTTAGGGCGTTTACAAAAGAAACTACTCCAGACGCGCTGCA 
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AAGGGAGGCGCATGTGCCCGAAAGCTGGCGATCAGACGGGGGGGGGCATTCTGCATGTGT 

GATGTTTCTGGGGGCGGTGGGGAGTGTGTGTCGGGGTCGGGGGGCGGG 

Sequence 39 

NTGGTCTTTCTTCCTAAGAAGGAGATGGAGCCAATTTCTCACAGCTCGTGCCTCAGTACT 

GAGGGTATGGAGGAAAAGGCAGTCAGTCAGTGTCTAAAAATGACGCACGCAAGAGACGCT 

CGGGGAAGATGTAGCTGGACCTCTGAGATTTACTTTTCTCAAGGTGGACAAGCTGTAGCC 

ATCGGGCAATTTAAAGATCGAATTACAGGGTCCAACGATCCAGGTAATGCATCTATCACT 

ATCTCGCATATGCAGCCAGCAGACAGTGGAATTTACATCTGCGATGTTAACAACCCCCCA 

GACTTTCTCGGCCAAAACCAAGGCATCCTCAACGTCAGTGTGTTAGTGAAACCTTCTAAG 

CCCCTTTGTAGCGTTCAAGGAAGACCAGAAACTGGCCACACTATTTCCCTTTCCTGTCTC 

TCTGCGCTTGGAACACCTTCCCTGTGTACTACTGGCATAAACTTGAGGGAAGAGACATCG 

TGCCAGTGNAAGAAAACTTCAACCCAACCACCGGGAT 

Sequence 40 

CACCGNNCGCGCTCCTTCTGCCGCCAGGGCGAGGCTGGCACCCGGCCAGCGCGGGCAGGG 

CCACGGGTGCCCGGCTGTTTCCCGGTTGTGGAAGGCGCTCAAGGTGCGCGGCCCGGGGCG 

CGCTACTGGGGGCGCCCTCCGCGGTGGGCAAGCGCGCCAGGGATCGGCCTGGGCANGCCG 

CGGGGCGCGCGAANGCTGCGCTTTCCCTACGCCCCCCCTCGCTTCCTCCGGCACGGCGGC 

MCGGAGATTTCCTCTCGGGGAAACTACGCGGATCCTTTTCGGGGATCCTCGCCCCGCCC 

CAATTTCTNCGCCCCCTCCCCTTTGCTGGGGCGCCTGGGCTGGCCCGCGCAGGGGA 

Sequence 41 

CNCGTCCGGTTCCTAACACAGACGAACTCAGCTTCCTTTGCCATGCCTCTGACTCGAGCC 
AGCCTTTCTTTTTATCCTCCGTTTTTCTCAGATTCCCTCC^ 

TAGATCGCTTCCCTCGCCCCAATCTTTCCTTGAACCCCTTTTCCAGTGTCCCAAAGCTGT 

CCACTCTTACGCCTCTTCCAGAAACACAGGCTACCTCCCCCCAATTCCCCAGTGCCACTC 

TGGATTGTMTATCCCACTCAGGAGCTTCTTTCTTGTAATTTCCCTCCCCCCACCCCCCA 

GCGTCCCCGGGTCGTCTGTTTTCTTCTATGAAGCAAATATTACTCATCAAATATAGGAAC 

AMGGCCTMGTCCTTTCTGNGCTTTATTTCTTNGGGTGACTGGATCTTAGATCCTATCA 

TTTAAGTAGATGATGGTT 

Sequence 42 

GGTGTCGACCNCGCGTCCGCGCTTCTCNCCTCGGCCCGTGGAGCCGGGGCGTCCGGGCGT 

AGCCCTCGCTCGCCTGGGTCAGGGGGTGCGCGTCGGGGGAGGCAGAAGCCATGGATCCCG 

GGCAGCAGCCGCCGCCTCAACCGGCCCCCCAGGGCCAAGGGCAGGCCGCCTTCGCAGCCC 

CCGCAGGGGCAGGGCCCGCCGTCCGGACCCGGGCAACCGGCACCCGCGGCGACCCAGGGG 

GCGCCGCAGGCACCCCCCGCCGGGCATCAGATCGTGCACGTCCGCGGGGACTCGGAGACC 

GACCTGGAGGCGCTCTTCAACCGCCGTCATGAACCCCAAGACGGCCAACGTGCCCCAGAC 

GGTGCCCATGAGGCTCCGGAAGCTGCCCGACTCCTTCTTCAAGCCGCCGGAGCCCAAATC 

CCACTCCCGACAGGCCAGTACTGATGCAGGCACTGNAGGAGCCCTGACTCACAAGCATGT 

TCGAGGCTCATTCCTCTNCAGCTTTCTTGCAGTTGGGAAGC 

Sequence 43 

GTCGACCACGCGTCCGGGAGCTGCGGCGCCCTCCCTTATCGCCTTGGCAACGACCCAGCC 

GCGCCGCGAGGAGAACCGGGAATGGAGGTCGTGGCGTGAGGGGCGCCGAGCGAGGGGAGG 

CGCGGGCCACGGGAGTTCCGGGAGTTCCGGCGTTGCCCGGCAGTCCGCAGTCCTCGGCGG 

GAAGGCTGTCCCGGCGCCTCAGGCAGCTCTGCGTGGGCCGGGGTGACTTCCTCGCGATCC 

CCTGCGCGAGGTGAAGGGCAGGGACCTTTGCCGCGCCTTCCACGCGCCGTGCCCCACCGG 

CGAAGTGGGCTCCATCTTCCTTCAAGACTTGTCGCTTCGCGGACAGGGCGCCCGTGGGTT 

CTTCCCGGCCCTTCCGTACCGTCCTCTTTCCAAGCGGGCTTCCGGAAAGCGGGTTCCTTG 

TCTTTCGGACGCATTTTCACCCCGCGCCGGGGAGGAGCTTNCCGGGNAAGGGTTCCACGG 

CGGCCGAGGGTTTTCC 

Sequence 44 

CGTCCGCGAGACTCCCGCGCCCACCACCCCCGGCGGAGCTGCTGCTGAGCCACTCAATCT 
GAGCCCTGGCTACTAATAAAGTTCGTTTAAAAATCATAATCATTCTTAAGAGAGCGAAAG 
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AGGGTGCGAACTAGCCGCTCGGCCCCGCANGGAGAGCTGGCGCGTCNGGAGGAGACAGCN 

GCGGCAGCGGTTGCGCCGCGACCAGGAGGAGCCGGTGGCGCCGGGCGGCGGGTCCGCGGC 

CGGTGGGGGACNGTGAGTAGCGGCTCGCGCTGCGGTGCAGCGGCGCTGCACTCACTCGCC 

CTCTCCAGGGGCTGGGGGTTCGTCCGGCTCCACTGGGGAAGACTCAGCTTCTCCCCGGGG 

TCCCGGGTTGTGCCTTACTCTCCGGAGTGGGCAGGGGTATCGAGGGCAGGGGCCTCCCGG 

CCCGGCTCCCCCATCCCCCGGTTCGGACCGACGAGCCGCCGGCTTCTTCCCTTCCCTGAG 

CACCGATCCCAAGTTCCA 

Sequence 45 

TCACCACGCGTCCGGGCGTCCGGGTACCCGAGGGCTCTCCCGCGTTGCTGGCACCGCTGG 

CGCCGCGGTCTCGTAGCGCATGGGCCTGCTCCGAGGCGGGCTCCCATGCGCTCGGGCCAT 

GGCGCGCCTGGGCGCTGTGCGCTCCCACTACTGCGCCCTGCTGCTGGCCGCGGCGCTGGC 

CGTCTGCGCCTTCTACTACCTCGGCTCAGGCCGGGAGACCTTCTCCAGCGCCACCAAGAG 

GCTGAAGGAGGCCCGCGCCGGGGCTCCCGCCGCGCCCTCGCCGCCCGCGCTGGAGCTAGC 

GCGGGGCTCCGTGGCGCCAGCCCCCGGCGCGAAGGCCAAGAGCTTGGAGGGCGGCGGTGC 

CGGGCCGGTGGACTACCACCTGCTGATGATGTTCACCAAGGCGGAGCACAATGCCGCGCT 

GCAGGCCAAGGCCCGCGTCGCGCTTGCGCTCA 

Sequence 46 

CCACGCGTCCGTGTCGGGCCGGCGCTCCCTTCTCTGCCNGGTGGCGAGTACACCTGCTCA 

CGTAGGCGTCATGAGGTCTCCGGTTCGAGACCTGGCCCGGAACGATGGCGAGGAGAGCAC 

GGACCGCACGCCTCTTCTACCGGGCGCCCCACGGGCCGAAGCCGCTCCAGTGTGCTGCTC 

TGCTCGTTACAACTTAGCAATTTTGGCCI 1 1 II I GGTTTCTTCATTGTGTATGCATTACG 

TGTGAATCTGAGTGTTGCGTTAGTGGATATGGTAGATTCAAATACAACTTTAGAAGATAA 

TAGAACTTCCAAGGCGTGTCCAGAGCATTCTGCTCCCATAAAAGTTCATCATAATCAAAC 

GGGTAAGAAGTACCAATGGGATGCAGAAACTCAAGGATGGATTCTCGGTTCC I 1 1 1 I 1 1 A 

TGGCTACATCATCACACAGATTCCTGGAGGATATGTTGCCAGCAAAATAGGGGGGAAAAT 

GCTGCTAGGATTTTGGGA 

Sequence 47 

CGCGTCCGCGGACGCGTGGGCGGGGCCGCGGAGCCGGGCCGGGGCATGCGCCGTCTCCGN 

CTCGGGGCCGNCGGGGGCGCCCTGCTGAGCGCTACCCACGTGCGTCCGCGCCACCTCGCG 

GGCGACCCCGCGGCCAAGGCCCCCGGCGGAGCGGNTCCCGGGCGCCCCGAACTAGCCCCC 

AACTTTGGGCGAAGTTTGCCTGCGCCTCTCCCCGCCCCCACGCGGCGCGCCGGGGCCGCG 

GACGGNAGCGGCCCCCGGGGATGCGCCTTCCCGGGGTACCCCTGGCGCGCCCTGCGCTGC 

TGCTGNTGCTGCCGNTGCTCGCGCCGCTGATGGGAACGGGTGCGCCGGCCGAGCTGCGGG 

TCCGNGTGCGGCTGCCGGACGGCCAGGTGACCGANGAGAGCCTGCAGGCGGACAGCGACG 

CGGACAGCATCAAGNCTCGAGCTGCGCAAGCCNGACGGCACCCTCNTNTTCTTNACCGCC 

GACTTTAAGA 

Sequence 48 

GCGTCCGCTGCATTGCGCCCACCGACTCCACTATGTTGAAGAAATTCGACAAGAAGGATG 

AGGAGTCAGGTGGAGGCTCCAACCCATTCCAGCACCTTGAGAAGAGTGCGGTACTCCAGG 

AGGCCCGTGTATTTAATGAAACTCCCATCAACCCTCGGAAATGTGCCCACATCCTCACCA 

AGATTCTTTATCTCATAAACCAGGGGGAGCACCTGGGGACCACGGAAGCGACCGAGGCCT 

TCTTTGCCATGACCAAGCTCTTTCAGTCCAATGATCCCACACTCCGTCGGATGTGCTACT 

TGACCATCAAGGAGATGTCTTGCATTGCAGAGGATGTCATCATTGTCACCAGCAGCCTAA 

CAAAAGACATGACTGGGAAAGAAGACAACTACCGGGGCCCGG 

Sequence 49 

ACGCGTCCGCAGAANGCTCTCAGATGGGACAGTCTTTTACTTTCATTCTCACCTCTGTAA 

ATAGCAGGACAGGTTGGGGTGGGCCTGACTTCTATTCTGCTTTCAGGGGGTACTTACTGG 

AAAATCAACTTAGGAAGTGAATTTGAGGGTTGGTGTAATTTTAAGCCCAGCCTCTGATCC 

TTGGTTGCACAAAGCCTMTTTCCAAATATTTCTAACAGATTCAAGACTGTATTGGCAAA 

GAGGTAGAGAGCTATGATAATGACATAAATTACATAAAAATCCAGTTGAATGAATAAGAA 

GGAATTTGGGCGTATAACCCATGGAACACCAATGGTGCTAAGAATTTGCCAAACCCTCAG 
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CTTCGTATAAGTCCCAAAGAACTCAGGCTTATAGCACTAAGCAAATTACCAGTGGGAGGA 
GGAGGACCTGCCACGGAGCTGGATAATTACATTTAAATATTTTTGCCANTCTGTTGGAGA 
C 

Sequence 50 

NCGCGTCCGGCAGGCGGGTCGTAGAGAGCGTTCANGCCGTCTGTATATCTCCCCAGATAC 

CTGAAACTGACCACCTGAGTACGTTTTCCCATTGCTGAGCTGTTTCCCTGATATCTGGCC 

ATGCAACGGAGATCAAGAGGGATAAATACTGGACTTATTCTACTCCTTTCTCAAATCTTC 

CATGTTGGGATCAACAATATTCCACCTGTCACCCTAGCAACTTTGGCCCTNAACATCTGG 

TTCTTNTTGAACCCTCANAAGCCACTGTATAGCTCCTGCCTTAGTGTGGAGAAGTGTTAC 

CAGCAAAAAGACTGGCAGCGTTTACTGCTCTCTCCCCTTCACCATGCTGATGATTGGCAT 

TTGTATTTCAATATGGCATCCATGCTCTGGAAAGGAATAAATCTAGAAAGAAGACTGGGG 

AAGTAGAT 

Sequence 51 

GCGTCCGGAAGGTCCTTGAGCCATCTGGATGGCGGGCAGTCTGGCACACTAATGTGTTCA 

AGGTGCTGGTTGAGATCACAGATGTGGACTTTGCAGCCTTGAAGGCAGTGGTGAGGCTTG 

CTGAACCATACCTCTGTGACTCTCAAGTGAGCACTTTTACCATGGAGTGCATGAAGGAGC 

TCCTTGATCTGAAGGAGCATCGGTTGCCCCTGCAGGAGCTGTGGGTGGTGTTTGATGATT 

CAGGA 

Sequence 52 

GTCGACCNCGCGTCCGGAAAAAAAAAATGACCCAGAGATATTAACAACTTGACCTGGTTA 

TACAGTTAGATATTTGCACAGTCTGGACTCAAACTGGAGGCTTCTGACTCCTCATCTAGG 

CTCCTCTCACTCTGCCATTGCATGGGTTTTCTCATATACCTTCTCTCATAAGGTTTTCAC 

AAATTTGTCACCGTCAAATAATTATCAAAATTATTCACACTATTATAGATGAAAATAATG 

TGCTTATAAAGATTAAGTAACTTTCCTGAGGGCGCAGGTATCTGGTTCAGATAACAACTA 

GCCTGGCTTAGAATAAACACATATTTCCTGGTTCTGAAGTTGGTGTCCTTCCTACCACTT 

TCTGCTGTCTCCTAAAGATAAAGAATGTTATTGGCTCACTGAATTAATCCATTCTGTTCC 

TGGCTGAAATAAAAATTGGTATATTCCTTACGTGAAGTGTCAACAGGAAGGGGGCTTTTA 

CAACTTCCTTT 

Sequence 53 

GGAGTCNCCACGCGTCCGCGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGC 

TGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAGGTGGCGAC 

CAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAG 

TGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGGCGGACGGA 

GAAAACAACfeCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGCCGCCGCCTCC 

CCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGG 

GAGGCGGCACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCA 

CAGAAGCTCCGGGACCCTTCCGGCACCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTC 

CTNCTCTTCTCTTGGAGGGCGCTCTGGCCCATCAGACCGGATTATTTTTTCAA 

Sequence 54 

CNCCCGTCCGGAATNCCCATAGTTAGCTGCTGTGCTTTCACAACTTCTTTCTCTGTAAAT 

TCCTCGCTTGGCNCTGAGAAGGAAAAAAGATGTTCGTAAAGGGCTCAGCGAGGAATTTAC 

AGAGMAGAAGTTGTGAMCCACCATAGTTAGTTGCTGTGCTTTGMTTTCTTTTGCTCA 

AATGGCCTCAGCGAAATCTTATTTGCCTATAGCATATCTACAAAAAATTTTCCTAGACCG 

TCTTTTCTACAACTGGATGGTAAAGTTGATTGAAGTGTGCCTCATGTAGCTTTATGTTTG 

GGGCATTTGAAGGGCTATGGCTGGACCAGAGTGTAATATAAATGCTTAATAGAGAGGGGA 

AAAGAAGAGTGTAAGAACCATTATAGGGCTGGGCTCACGCCTGTAATCCCAGCATTTTGG 

GAGGCTGAGGCAGGCGGATCACGAGGTCAGGAGTTCNAGACCAGCCTGACCAACATGGTG 

AAACCCCATCTCTCTAAAAATACAAAAAT 

Sequence 55 

GTCGACCNCGCGTCCGCAGCCTTCGGCTGCGGAGGGGGCTCGGCGGCGGCCGGCGGAGAA 
AGTTGCTCCGAGAAGAGGCTGGGTCGAGCTGGGCCGAGCCGGGCGCGCAGGGCGGGCGTC 
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GCGGGCGTCCCGGGCGGACGCGGCGCGGAGACTGCCGGCGCGTCCCGGGGGTTCCGATTT 

GAAGACCTTGCTTCTCATCACCCACTGGATTATGCCCCAGGCTTTCCTACCCAATGATCC 

TCTTGCAACACGCCGTGCTTCCTCCACCTAAGCAGCCCTCACCCTCGCCTCCTATGTCAG 

TGGCCACCAGGTCTACAGGAACCTTGCAGCTTCCACCACAGAAGCCTTTTGGGCAGGAGG 

CTTCCTTGCCTCTTGCAGGGGAAGAAGAGTTATCGAAGGGAGGGGAGCAAGGACTGTGCC 

CTGGAGGAGCTATGTAAGCCCCTGTACTGCAAACTCTGCAATGTCACCTTGAACTCTGCA 

CAGCAAGCCCAGGCTCATTATCANGGTAAAAATCATGGTAAGAAA 

Sequence 56 

ACCNCGCGTCCGGACCTGTTGGCGACATGGTGGCACCCGTGCTGGAGACTTCTCACGTGT 

TTTGCTGCCCAAACCGGGTGCGGGGAGTCCTGAACTGGAGCTCTGGGCCCAGAGGACTTC 

TGGCCTTTGGCACGTCCTGCTCCGTGGTGCTCTATGACCCCCTGAAAAGGGTTGTTGTTA 

CCAACTTGAATGGTCACACCGCCCGAGTCAATTGCATACAGTGGATTTGTAAACAGGATG 

GCTCCCCTTCTACTGAATTAGTTTCTGGAGGATCTGATAATCAAGTGATTCACTGGGAAA 

TAGAGGATAATCAGCTTTTAAAAGCAGTGCATCTTCAAGGCCATGAAGGACCTGTTTATG 

GGGTGCATGCTGTTTACCAGAGGAGGACATCAAGATCCTGCATTATGTACACTGATCGTT 

TCTGCAGCTGCAGATTCTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAAGTAATGTG 

CCTTCAAACTTTAAACTTTGGAA 

Sequence 57 

GTCGACCACGCGTCCGTCTCATTTGTGAAGAGGTCTGTCTTCCTGAGGAAGCAGGGGACC 

CTCACCTGTGAACCAAGTGTGCCATGGGAGCTGCTCCATGTGCAGGTCCAGGTCTCCTGG 

TCTGCAGGGAACGGCACAAGAGGGCTGGCCTAGGCCAGGAGGATGTGATCTGTCCTAGAA 

GGGGGCTGACCTGCTTGCTGACCCCGCTTGCTGCTGCCTGGCTGACCTGACTCAGCCACG 

GCTGTTCCGAGGGCCCTTCTGAGTACGAACTTCCAGTTGGAGGATCTGGGTGAAGACCCA 

GCTGCTTGAGATAGCAGCCTCTGGCTAGGCCCTTGGCGTGGCCAAGCCAATCAGGCAGGT . 

TTAGAGCCTGGTGCCCCTAGACAGGTCCTGCAACCAAGAACAGGGGGTAGCCTTCAAAGG 

CCAGCCCTGCCTTCCAACCACCGCTCCACAGCGAGGGAAACCAAGGCTCTTAGGGCAGGA 

GGGTTGT 

Sequence 58 

CCNCGCGTCCGGGGAGCAGGGATCAACGGTGGTCCCCGTAAACCTGACAGTAAACCTGAC 

AGAGGCTGCAGGAGTGCATTTCCACCCAGGGTGCACTCAGCGAGTGGAACTCCACACCCG 

TTTCTTTGGAGTCAAGGCGCGACCTCTCAGGGAGGAGACTGCTCCTGGTTGCCCACTGCC 

GGGTCATCCCAGCTTGCAGTGGAACCCTCCGCAGCCTGGCCTCTTCCAGGGTAGCCGCTC 

ACTCCCCTCTCTCTTGTCCTAGGATAAGGCCGAGGAAGGCTGACGAGTTCCAGCTCTGGG 

GATGCCCTATCAGCTGTGTCACCTTGAACAAATCATTTCTCCTCTTGGGTCTCTGTTTCC 

TCCAGTGTGAAACGTGGTGAAGGCATGAGGGGCTATGGGAGCCCCAAGGCCTCTTTCAGA 

GATCTCCTCTGGGTCCCATGTGACCCCGTGGCTATCCCCAAAGGCAAGAGGGTCCCCAGC 

CCTGCACCAAGGCCCTGGG 

Sequence 59 

CCGAGGAAAGGAGTTGGTTCGCGCAGGTGCGGCGCCTGGGTCCCCATGGCGCTGTGGCGC 

GGCTCCGCGTACGCGGGCTTCCTGGCGCTGGCCGTGGGCTGCGTCTTCCTGCTGGAGCCA 

GAGCTGCCAGGCTCGGCGCTGCGCTCTCTCTGGAGCTCGCTGTGTCTGGGGCCCGCGCCT 

GCGCCCCCGGGACCCGTCTCCCCCGAGGGCCGGTTGGCGGCAGCCTGGGACGCGCTTATQ 

GTGCGGCCAGTCCGGCGCTGGCGCCGCGTGGCAGTGGGGTGAGTGCCAACGGGGCCTGGG 

TCTNTGAGCCTCCGAGGTCGGCCTTGGAGGTCGGGCGGAGCCGCGCAGAAACAGGGCTTC 

TCAGAGGNCCCCGGGAGGCGCTTGCTGTCCGCGCTGGCCCC 

Sequence 60 

CGCGTCCGGTGGGAAGCCAGAAGATAAAACCAAATGGCTGGGCACGTCTTTAGGTTATTC 

CTAGCTAAGAGTTAAGAGTTGTAAGCTCTCTCATTCTTTGTTCTTCAGCCTTAAACTATC 

TTTCCTTCTATTAACTTTATTTGTCTCAGTTACAATGATAGAGGTAACTTCACATACTAA 

AAGAAATTAGGTTACCATGTGAAACATTCTTCTTGGCTTGTGCTAATGTTATCAGATCCA 

MCAGCATCTGMAGAAAATTTTCCAAGTACGATGTTGTTCTCTTGTTTTCTGAAATACA 
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TATCATATGTTMAGTGAGAGTTTTTATACATGTTGAAAGAAGTTGAATGACATAACAAA 

TAGTTACTGAGGCCTCCATTTTCTTACTTCACAG7TAAAATTCCTGTTTCTCTTTGGGTA 

TAGGAGGGTAGAAAGAAGTGGGAGAGTAATAGCATTTTAAAACACAGAATCAAAAATCAT 

ATTAAAAGTAG 

Sequence 61 

CGTCCGGAGCATCGCGCACTGCGGCCGGGTCACCGACGTGGGCATCCGCTACGTGGCCAA 

GTACTGCAGCAATGCTGCGCTACCTCAACGCGAGGGGCTGCGAGGGCATCACGGACCACG 

GTGTGGAGTACCTCGCCAAGAACTGCACCAAACTCAAATCCCTGGATATCGGCAAATGCC 

CTTTGGTATCCGACACGGGCCTGGAGTGCCTGGCCCTGAACTGCTTCAACCTCAAGCGGC 

TCAGCCTCAAGTCCTGCGAGAGCATCACCGGCCAGGGCTTGCAGATCGTGGCCGCCAACT 

GCTTTGACCTCCAGACGCTGAATGTCCAGGACTGCGAGGTCTCCGTGGAGGCCCTGCGCT 

TTGTCAAACGCCACTGCAAGCGCTGCGTCATCGAGCACACCAACCCGGCTTTCTTCTGAA 

GGGACAGAGTTCATCCGGCGTTGTATTCACACAAACCTGAACMAGCMATTTTTTTAAA 

AGCAGCGTATGTAAAGCACCGACACCCACTCAAAACAAGCTCTTTCTTTCNGGAAGGGTA 

TTAAGGAAT 

Sequence 62 

NCCACGCGTCCGCCAGNCTGTGAAGGATCCCAGACTGGCATATGCAGGAGGAAATGGGGC 

GGGCGAGGAGTAAGGACCCCAAAAAGCAGGGGTAGGGAAGGGCCCTCCCAGCGCCCCACT 

GTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGAAAGGTG 

AAAAGAGGAACAAAGAACATTCTCCTGGACGCCACCCACAGAAAGCCACGTGCAGGCTTG 

GCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTCAGAGCT 

GGGGCAGCGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTAGTTACCGGCAC 

ACGGGCACCAGCGCCCCCTCTCAGCAAACTTNCACGTCTTATGAAATTAGCACTGGATTT 

CCACTTCAATTGGA 

Sequence 63 

CCCACTGTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGA 

AAGGTGAAAAGAGGAACAAAGAACATTNTCCTGGACGCCACCCACAGAAAGCCACNTGCA 

GGCTTGGCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTC 

AGAGCTGGGGCAGCTGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTATTTA 

CCGGCACACGGGCACCAGCGCCCCCTNTCAGCAAACTCCACGTNTTATGAAATTAGCACT 

GGATTTCCACTTCAATTGGA 

Sequence 64 

NCGCGTCCGCTTCATCTTAGGATAAAGTCTAAATCTTTGI 111) I GCTATTGTACTAAAC 

TCATMATCCTAGGTTATAAAGATMAGCCTTAAACTTTATCTCATCATCCAGCCCAATT 

TCCAGCCACAATGAAGTACTTAAAACTCTGTGTCTTTGTACTTGCTGTTCTCTTGGCCTC 

CAATTCCTmCATCTTTTCCATTCGGTAMGTTTGTTTATCCCACAGGCCCTATC 

AAGCCTCCAGCAACTTCTCCAGACAGAGGTGTTAGCAGTGTAGGATCAGATTTCTCAACC 

ACGTCACTCCCATGTCTGGGTAGATATCTCTGCCCAAGTGTTCTCATAGCACTTGAGCAG 

TACTCTCTAAGCGCCCAGGATCTACCATGTTGC I I I I I I AAAATTTGATTAATTTATTTT 

TTTATACTGCTCCTTGTGGAGCANGGAGTGTTCCCAGAGTAGCCCACCATGTTATATTGA 

ATGGATCTGTGTGCATMTGCAGCTGTCCATCTACATCGTATATTTTTGTCTCCTCAAGG 

GTAGGGA 

Sequence 65 

GTTTGTATTCTGGATACAGGGGATACTGGGGCTCGCTATGTGTGTGGAGCCATCCCTTCC 

TTGCCCCAGCCCCACCTCCCTCTCAAACCCTCTCTGGCTCTTTCTGAGCTTCCTTTCCTG 

CTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTGCTACTTCGGGTCAGGT 

GGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACTTCAAGAGGAGAGGCTG 

CAGGGAGCCATCCTAAACCCAGAGGCCTGGAGCTTACCGTGTCACTTTACTTTTGTACAC 

AGGGGTCTCCTTAGTGCCCTCGAGAAGGATTCTTGGCCCTGAGCTTCTACTCCTGAGGCC 

ACCTCTGTGCAGCCCCAGCTCCCTCAACTCTAGGCTGTAGTCTCAGTGGGAAAGCCTGGC 

TTGGGGGTCTCCTAGGAATGTCCACCTGAAGGCACACTTGATAGGGGCTTGCACAACTTA 
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NCT 

Sequence 66 

CGTTCGGCAGCAGCAGCTGGCGGCCAACCGGCACAGCCTGGTGGAGAAGCTGGGGGAGCT 

GGTGGCGGGTGCCCACGCACTGGGTGAGGGCCAGTTCCCCTTCCCCACTGCTCTGTCGGC 

CACCCCCACGGAGGAGACGCCCACCCCACCCCCAGCCGNCACCAGCGACCCCCCGGCCGA 

AGACATGCTGGTGGCCATCCGGCGTGGGGTCCGGCTCCGCAGGACCGTCACCAACGACAG 

GTCGGCGCCCCGCATCTTATGATGGCGCCACCCTCCCCATCCTCTCAGGCCCCAGTGCGA 

GCAGGTGGCCTGGTCTGTGAGCCGCAGGCACTCAGAGCAAAGGCCCAGCCAGGAGAGAGG 

ACAGAGCCAGGGCAGAGGCCATGCCACTTTATGGAAAGACACCTCACTTGGATTCCAGCA 

TTTAAACAGGAAGTGACTTCTTAGCAAGCCTGGCCAGGACGGAGCCTGCAGGCCTGGGCC 

TGGTTCGGGGTCTGTTTTATGCTCTTCGGTCCCTTCTCTTCCTTCTCTGGGGCCGTGCCT 

CTTCCTACCCATAAAGCACCAAAACCAGGGCCGCTGCCATGACAGAGGGGCCAGGCTGGC 

CTTCCTTTCACATCCCGGCCTNTTCCAAGGCTGGTCCTGCCTNACTTCTTCTGGAATGTG 

GGCCCCCTCTCCCTTGCTGACCCCTCTTCCTNTTGCTTIvrrNTTTGCTTTTCCANGCCCT 

Sequence 67 

CGCGTCCGG CAACCTAAAAATAGGATGCACCAATAGCATGTGGTTCCAAGTAAGTTGTGA 

TTTTTATTTTGAGACAGTGTTCCACTGGAAGGGAGGGAAGGGCTTACATTCACAGACAGT 

MAGCAGGGCTGTGTAAGGAGCTACATTTACTTAACAGTCTCCTTTCCAGCTAGGTTTGT 

TTTATTTATTTGCAAGTCAGCTMGMTTAACCTTTTAAACCATCTAAAACAGGCAAGCA 

ATATAMGATTTCTACTAGTGCAAGGTAAGTGGTTTGAATATACAAGTGCCCTTTCCTGC 

CACCCAGTCTCACTACCGTTTTAGTCCTGCAGCTGGGTAAAGCCACTATTGTGTGGAAAC 

TCTCCCATGTGCCCTGTCTCTACCTCTGGACACACCAGCTCCTTCTTCCACCTATTCTAT 

TCCTCAGTTAAGCCMGTGATCAGAAGTAGTATTAAATGGGTAGATAATTTTA 

Sequence 68 

GCCACGCGTCCGGCGCCGGGTGCGCCAACTACGCAAAGACCAAGCGGGCTCCGCGCGGAC 

CGGCCGCGGGGCTAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAGCGGGGAAAAGGA 

ATTGCTGCCCGGAGTTTCTGCGGAGGTGGAGGGAGATCAGGAAACGGCTTCTTCCTCACT 

TCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGGAAAGGACTGGGGAAAATA 

GCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTGTCT 

GATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCGCAGG 

ACAGACGGCCGATCCCGCCGCCTCCGTACCAGCACTCCCAGGGAGAGTCAGCCTCGCTCC 

CCAACGTCGAGGGCGCTCTGGCCACGAAAAGTTCCTGTCACTGTGATTCTCAATTCCTGC 

NTGGGTTTTTTT 

Sequence 69 

ACCCANACCTGGGAGGAATTAATGGAATGCTTGNCCCTGGGCAGCCTTAGAAACAGACCC 

NAGCTTATCTAANGCTGCTCCGAGGCAGTGACCCAACTANGGCTCAGGAAGTCAAGAANA 

TTGAC CMG CTTATAGTGATCACCTCTTGACCTTTGTGTCACAGTCNTTTTGCTTTTTAA 

AACCCTTTTGTGAACCGNTTATGGCCTTTGATTCTGACAGGCATCNTAGTTGTGAAGGGG 

AACANGGGCAGGATATAATGTTTCGTTTACCAAATACAANAAAATCNGANGTACCCAGNT 

AGATCACAANATTTTTTGGGAGAAGGNCTNTTGGGTCTCTTCCAGGAGNTCACTTCANNN 

TTGGNMCTTGACAGGGGCTTGGGGAATTTANTATTCCCCTTGGGCGCAGGGNCNCAAAN 

GGGTGGCANTTTCCCTCCTTGGAGN I I I I I I I I I CAAGAANTCCTTGCNTNGGGGAAAGA 

TGGTTACNANNATCCCGGAATTTCCAACCCCCTTCCCTA 1 1 1 1 1 1 I GGTTTAAGG 

Sequence 70 

CGCGTCCGGCACATTAAAAAAAAATACTTATTTTTTATTATGGAAAGGTCTTGGAACATT 

CTGATAGTGAGCTTCCGGCATTCATTTGCTGTATCTGGCTTAGGAGATGCTAGGGTGGCA 

AGAAGAGGCACAGGCTTAGATGCGCTGGGTGGAGAGTTGGCTTTAGTAATGATGGTTGAC 

TCTAACGACTTGATTATCAGCTGTGCCTTTTTTCTTCCTGCCTTCTGAGGTGTGTTGCCT . 

GCATCCTAATTCACGTACTGAGTAGCAAGCTAAGCAGGTTGTAGCTGGAGATTGTAAGAA 

ATCCTGAATGGAAACCAAAGAAAGACTGTCACATCACATGATGTGCCCTTTTCAATCCCA 
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TGTCTCTTCCCAGTGGCATCCCAGTGCTGTCTCTGCCCCCTGCTGCTTCTGTAAAGATTT 
TCTG ACACAAGTAACTGCCTCATAGACCTTCC I I I I I ATGAAATCCTGAGTTTTGGTTTG 
GGTACGTCCTTTTTAGAT 
Sequence 71 

GCGCGGCTGTCGCGAGGGCGGGGGTCGGGGCTGCAGGCGGGGCAGGGCTGGGTGGGGGCG 
CGCGACGCACCTGCCTGCTTCCTGCACGGGTGGNCCCCAAGGACTGCGGGGCCCCAGCCC 
AAAGCGGACCTTGA 
Sequence 72 

CGCCCCGCGTCCGGGCGGCTGGTGGGCGACCGGGCGCATCCTCATTGCATGTGCGGCGGC 

CCTACCTCGGCCCTGGCCTGACCCCGGCGGCCCTGCCCGCCCCTCCCTCCAGCATCATGG 

C CAGC CCAAGAACCAGGAAGGTTCTTAAAGAAGTCAGGGTGCAGGATGAGAACAACGTTT 

GTTTTGAGTGTGGCGCGTTCAATCCTCAGTGGGTCAGTGTGACCTACGGCATCTGGATCT 

GCCTGGAGTGCTCGGGGAGACACCGCGGGCTTGGGGTTCACCTCAGCTTTGTGCGCTCTG 

TTACTATGGACAAGTGGAAGGACATTGAGCTTGAGAAGATGAAAGCTGGTGGGAATGCTA 

AGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTACCGATCCTTGCTGGTCCTTTGCAGGG 

AGAAGTA 

Sequence 73 

GCCCCG CNTCCGG AAATGTCCGA I I I I I I I I I AATTTAATGAAATTGTTAATGAGGAAAA 

ATTTTTAATATAGGTCTTATCTACCACACATCCCCATAGATTTMGGATTTTAATAGAAA 

GTCATGATGTATGTATTTAAGCCACGTTAAAAGAAAAAATATAACTATGGACCGGTATTC 

AGTGAATACAGTTTCATGGTTTTTMTTCTTTCAAAGCACATTAAAAATGGTGTGCTGAT 

AAACCCCAAGTAAATTAACCCTTTTTCCGTATAAATCCATTTTTTC 

AATT ATATTT ATTG NTGTTTACTG AATCCTGGTGTG AAAGCATATC AG AT ATGTATG A AC 

TGCTACTGCTGTACTTCCGATTTACGGACATCATTTTATTGCTATTTGTAGACCGTGATA' 

ACATGAACATGAGTCCTATTTATGTGGGCCTTCAGTGGATGGGCAGTGCCACTCANGTCT 

CTGGGGGTTTCCTCTCTTAATTTTAAAGTAA 

Sequence 74 

AGTCGCCNCGCGTCCGTGTGTTTCTCCTCGGTCCCCAACTCTACCTTCCCCAACCACAGT 

TCCTGTCCCCAGATGTCCTGATGCCCACCATGGCAGGGGAGCCCAATAGACTCCCAGGAA 

CTTCAAGGAGTGTCCAGCAGTTTCTGGCTATGTGTGACAGGGGTGAAACTTCCCAAGGGG 

CCAAGTACACAGGAAGGACTTTGAACTACCAGAGCCTCCCCCATCGCTCCAGAACAGACA 

ACTCCTGGGCACCCTGGTCAGAGACCAACCAGCATATTGGGACCAGATTCCTGACTACTC 

CAGGGTGCAATCCTCAACTAACCTACACTGCCACACTACCAGAAAGAAGCAAGGGCCTTC 

AGGTTCCTCACACTCAGTCCTGGAGTGGATCTTTTTCATTCACCCTNCCACCCTNCCATT 

GNTCATCCTGTGTACCCACCATCTAAGCAGTCTTCATGTACCCCTGAGGTCAAGCTTGGA 

A 

Sequence 75 

CCCGCGTCCGGGCTGGCATGGCTCTATATAAGATTGTTGCANAAANTCCCTACTACTTTT 
GGTCTGTGATGAGCTTAATTATGCAATCTATATNGGCACAGGATGAAAACCTCTCAAAAA 
CAATGTTTCTGCCCCTTGCTGAGAGAATGGTCGAAAAAATGGTGAAAGAGGACAAGATAG 
• AAGCTGAGGCTGAAGTTGAACTTTATTATATGATCCTGGAACGTTTGGGAAAGTACCAGG 
AGGCCTTGGATGTCATCAGAGGGAAATTAGGAGAGAAGTTGACAAGTGAGATTCACAGTC 
GGGAAAATAAATGCATGGCTATNTACAANAAGCTGAGCAGGTGGCCAGAGTGCAATGCCC 
TTTNCCGGCGCCTCTTACT 
Sequence 76 

GNTGGAGGGAGCTTTGCTACTCTGCTCTTGGCATGACTCCAGGATTTTTTTCTGGAATCC 
AACCTCTGTCCTCTTAGGAGAAGGAACCTGTCCTTGGTTCAGATGGGTGGGCATGAGGAG 
GAAAATTTCCATTAGTGTAGAAAAGTGCTGGACAGAATCCGGTTTGGAAAATTACAAATC 
r CAGTTGGTCAAAATAGGCCATTTCCTATGTGTGACCTATTCGTGGTATGCCAACTGGACT 
GCTTCCTAMCAGGACGAGGAAAGTGAGGMTATTTTTATATGAAAGCCTTAGCCTGTCT 
GGCACCCATGMAAAMCTATTTATGCACTCCTACTTTCACCCGTCTTTTTGCATTCTCT 
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ATTTGTAGCACAACAGAGTTGAATGCCACAAAACACCCCGTTTATAGTGAGCTGTTTTCA 
GTGACCAATATCAGAAGGAGGCTTGCTTCTGGACTAGCCTACTAATTGCCAGCAGCCACC 
ATTTTTCATG 
Sequence 77 

GGAGAGTGCCTTGCCGGACCCTCAGGACGGAGCTGCTGGGCTGCTACAGTGACCAGGACT 

TTCTGGCCAAGCTGCACTGTGTGCGGCAGGCCTTCGAGGGGCTTCTGGAAGACAAGAGTA 

ACCAGCTTTTCTTCGGGAAAGTGGGCCGACAGATGGTGACAGGCCTGATGACCAAGGCTG 

AGAAGAGCCCCAAAGGCTTCCTGGAGAGCTACGAGGAGATGCTGAGCTATGCCCTGCGGC 

CCGAGACCTGGGCCACAACACGGCTGGAGCTGGAGGGCCGAGGGGTGGTATGCATGAGCT 

TCTTCGACATCGTGCTGGACTTCATCCTCATGGACGCCTTCGAGGACCTGGAGAACCCTC 

CGGCCTCGGTGCTTGCCGTCCTGCGGAACCGCTGGCTGTCANACAGCTTCAAGGAGACGG 

CCTTGGCCACTGCTTGCTGGTCGGTCCTGAAAG 

Sequence 78 

CACGCGTCCGGAGAACGTGATTTCTCAGCCGAATGAGTTTGAACATACCCCACAGGAAGA 

TGACTTGGGGTTCAAGGAAGAAGATTTGGCTCCAGATCATGAAGTAGGAAATGCCTCTCT 

CAAACCTGAAGGCATCCAGAACTGGGATGACTTATGGGTCCAGAGAGAGGGTCTAGGAAA 

GCCTCAGCCTCGGGACAGAGGCCCCCGGCTCCTGGGTGAACCACGCTGGGGCCAGGCTAG 

TAGTGATCGGGCCGCTGTGTGTGGTGAGTGTGGCAAAAGCTTCAGGCAGATGTCAGATCT 

GGTGAAACACCAGCGGACCCACACAGGGGAGAAACCCTACAAGTGTGGGGTCTGTGGCAA 

GGGCTTTGGGGATAGCTCTGC 

Sequence 79 

CGCGTCCGCAAGAAGATAACCCCAAACTCTTTTCTCAGAGAGTTTGTAGCCTAGTTTGGG 

ATAGATAAGATCCACATATTTAGTCATATAAGACTACAGGAGAGTAGAATAGATGCACCA 

GATGGTGTCGAATGAAAGTGGTACTTTGTAGACTATAAGTGCTGTAAATTCTAAAGGACA 

GGTTACTTTTGCCTGGAGTGGTCAAGAAAGATTTTATTTAAAATAAGGATTTGACGGGCA 

GACTTAGCAGTCAAAAGGGAGAAAGCGGGTAAAACAAATGTAAGCCATCATAAGAGTGCA 

TGTGGTTTGGAAGCATCAGGGAAAAGACTAGCCAACCTGAAGTAAAAGGTTCGTGCAAAT 

TGGGCAATCAATAGCATTAAGTTGGAAACAGCTTGGGGACAGACACATAAGAGGGCCAGA 

GTGTGMTAATTTCATCTMTACTTTATAGCACTTGGACATTTACAGAGCACTTTTCTC 

Sequence 80 

TNCTCTCTGCCCCCCCACATTCGTCCTCTTGATTCTCTGCTTCTCTAGCTCAGCCGCTGA 

GCTTCGTGCCTAGCCGCCACTAGTCCTTGACCAGCGTTCTGGCAACTCTTGCCTCCAAGT 

TCTTCCAGCTCCAGGCTGAGCCGATGGGGATTCAGTTTTCTGACATCACAGCTNAGTTCT 

TGATTTCTGCAGCAAAACCTTCAAGGCTTCCATCACCTCGGCTCTAGAGTCCATCATGCT 

CCTCTCCTATGACCTGCTAGGGCTGATCAAACGTTCTGTCTCCTGCCGTGCCCTGCCCTG 

CCCTGCCCTGAGCTTCGCTTAGCCTGTTGCAGGCTTTGTGTTTTCCTTCTTGTGCTGTTG 

GACCACGCAGCTCCTTCTACCCATAAAACCCCCTTCTCTAGGTCGGTGGAAATCTTGGTC 

ATCCTTCCGTGTCTAGNTAAACTGGNCACCTCCTCCACGAAAGCCTTCTAAAAACTCCTT 

CTCAGGGGAACTGGTTTTCTTTCT 

Sequence 81 

CACGCGTCCGCAAAATAGCCCCACATCCNGGCAAAAGGGGCCTTTCCCTTGGCCCAGAAG 

AAAAAGGAACAAGTGGAGTGCAGAAGAAAATCTGTACTGAGAGACTTGGGCCTAGCTTGT 

CTTCCAGTGAGCCAACCAAGGCTGGTGCTGTCCCATCCAGTCCCTCGACGCCAGCACCAC 

CCAGCGCCAAACTTGCCGAGGACTCAGCTCTGCAGGGTGTGCCCTCTCTGGTGGCAGGTG 

GAAGTCCACAGACTCTTCAGCCGGTATCCAGCAGTCACGTGGCTAAAGCTCCCAGTCTGA 

CCTTCGCTTCCCCCGCCAGTCCTGTCTGCGCATCAGACAGCACTCTCCATGGGTTAGAGA 

GCAACTCTCCCCTTTCACCACTGTCCGCTAATTATAGCTCACCTTTATGGGCTGCAGAGC 

ACCTCTGCGGCAGCCCAGATATCTTTTCAGAGCAGCGGCAGAGCAAACATAGGCGCTTTC 

AGAATACCCTAGTAGTCCTACATAAAATCTGGGTTGCTGGAGATCACTTTTGAAAACCAA 

G 

Sequence 82 
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ACGCGTCCGCACCCCTGTGTCCAATGACATGTGCACCCAGGTCGCCAAGCGGCCTGTGGA 

CACCCAGGCCTGTAACCAGCAGCTGTGTGTGGAGTGGGCCTTCTCCAGCTGGGGCCAGTG 

CAATGGGCCTTGCATCGGGCCTCACCTAGCTGTGCAACACAGACAAGTCTTCTGCCAGAC 

ACGGGATGGCATCACCTTACCATCAGAGCAGTGCAGTGCTCTTCCGAGGCCTGTGAGCAC 

CCAGAACTGCTGGTCAGAGGCCTGCAGTGTACACTGGAGAGTCAGCCTGTGGACCCTGTG 

CACAGCTACCTGTGGCAACTACGGCTTCCAGTCCCGGCGTGTGGAGTGTGTGCATGCCCG 

CACCAACAAGGCAGTGCCTGAGCACCTGTGCTCCTGGGGGCCCCGGCCT 

Sequence 83 

CCCCGCGTCCGCTCTTACGCATTACTCTATGTCTACTGTTATGGGTGTGTAATTTTATAC 

CATAGATGTTTACTCTTTAAACAGACACTTCTAGTCTGTTTTATTTCATGTGTCTGGGAG 

CGGATAAAGTGTGAGGTTCAGGGAGAAAGAGAGGTCTGTCTCAATGCCTTGGCACGGCAT 

GAAGACAATCTCCCCTCCTTGTCCCCTTTCCCTGCTAGCTCCTGATGACTGACAGATTCA 

CAGCAGAACAGAAAGGACTGGGAAGGGATGGAGGTGGGACATCTGGCACTGACCTTCAGG 

GGCTGACCCTGTGGGGGAACATCTGCCCTGAAGAGTTGGAGCCTTCATGTGATGACACAG 

AGCTGAAGTGTGATATTCGGGAGGGGATAGAGAGTGCTTGGAGGTTTTCTGATTTTGAAG 

AATCCCAAGTCAGTC 

Sequence 84 

GTCCGGCCGCTTCCGGTCTCCCTCCCGGGCCGGCGCTGGCCTGACTGCGGCCCCGGTCCG 

TAGCACTCCGCCCTCCGCTTCTCCCGCCCTGTAGCCGCGAAGACTGCTTCAGCCTTTCCC 

TGTGCTGCCCCTGCCGCGCGATGGAGACGAGCTCGAGCTGCGAGAGTCTTGGCTCCCAGC 

CGGCGGCGGCTCGGCCGCCCAGCGTGGACTCCTTGTCCAGTTAATGTGTTAAGAGCCATT 

GACATTTGAAGATCATCAGAAGTGAAGATAAAACATCTCAAAAATTATAATTGCCTCCAC 

TTCTCATTCAGAGAATTCAGTGCATACAAAATCAGCTTCTGTTGTATCATCAGATTCCAT 

TTCAACTTCTGCCGACAACTTTTCTCCTGATTTGAGGCCCATGCAGTCCAGTTCGGGAGC 

TAAGT 

Sequence 85 

CCGCGTCCGCGTGAGGTGTGGGTGTTCGTTTCTCAGGTAAAACATGGCTAAAAGCTTACG 

GAGTAAGGTGGAAAAGAAAGATGCGTGCTGAAAAGAGAAAAAAGAATGCCCCAAAGGAGG 

CCAGCAGGCTTAAAAGTATTCTCAAACTAGACGGTGATGTTTTAATGAAAGATGTTCAAG 

AGATAGCAACTGTGGTGGTACCCAAACCCAAACATTGCCAAGAGAAAATGCAATGTGAGG 

TAAAAGATGAAAAAGATGACATGAAAATGGAGACTGATATTAAGAGAACAAAAAGACTCT 

TNTAGACCAGCATGGACNGTCCCAATTTGGNTGAACCCAAAGGCAAANAAAAANGNTTTG 

ANGGCAAACCGANNGAAAAAAAAANGGGGAAAACCAACCNAAANCCCTTAAAANGGGCCA 

ANGGGGTTTGGCCNCTGNNAATNNTTTNAACCCNTTTGAAAACCCCCCCTGGNNGANACC 

NCCCGTAAA AAATNTTCCCCCCN NTTTTTTTTTT 

Sequence 86 

CCACCGCCGTCCGAGGAGGGATCACCAAGCCGTGGGCCATGAAAGTCGGGGGGGGGCACC 

GCAAGCTTGAAAGCTTCATCATTGACCTTTCNCAAGAAATTACCGGGGCCAAGGCCGCTT 

GTT TCNAAAAC CCCCATGGAAACNAAAACGANGGGGAGGCANGGGAGMCCACCCCCTTG 

GACi I I l I l l INATTTCTTCTTGGACCTACCGAAGGAGGGCACAAATTGCCGGGAAGATT 

GCTTGCCTTTCCACCTTGGGACAAGGGGATCCTGGGACTTTTCCGGCCCGGGGTTCCCNT 

TCCCGTTGGCCCGGCAAGGGATTGGGTCNAAACAATTGACCCAAGGGAAGAATCCCCGGG 

GACGGTCACMCCGGGGNACCAAAGGNAMGCTTTTTTGGGANGGGAACCCTTTTTTTTA 

ATT 

Sequence 87 

CCGGGTCCCTCCCTGCGGAGCCGCTGGTCCGGCTGGCGGAGATGTGACCGCGGGCCCGGC 

CGGCCTGCCTCAGGCGTCGCGTCAGCTCCCGTGTCCGTGCCCTTAACCCACACCGATGGC 

GGGATCCGGCTGCGCCTGGGGCGCGGAGCCGCCGCGTTTTCTGGAGGCCTTCGGGCGGCT 

GTGGCAGGTACAGAACCCGCTGGGTAGCGGCTCCTTCCNCTTCGGGGTATTCGGGTTTCG 

CTTGTTGGGGAACCCTTGGTTGGCCCCCGGGGCCCTTAAANCAGTTNTTGCCGCCAGAAA 

CCACCGGGGTTGCGGCCTTTGCCGCCNAGTATGNTTTCCGAAAAAAAAGGGCCGGGCTTG 
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NACAAGTTGCANGGGTCACAAAAACATCGTGAATTTTGATNGGAGTGGTTACAATCCAC 
Sequence 88 

CGTCCGTTTAATTATAACCTAGATTGTCTGGGCAACGGCAGGAACGGAGTGCCACTGTGG 

AGCAGATAACTGCAGTGGTTTTCTAGGAGTGCGGCCAAAGTCGGCATGTGCGTCAACAAA 

TGAAGAGAAGGCAAAAAATGCTAAGTTAAAACAGAAGAGACGAAAGATCAAAACAGAACC 

MAGCAGATGCATGAAGATTACTGTTTTCAATGTGGAGATGGTGGAGAGCTGGTCATGTG 

TGACAAAAAAGACTGTCCCAAAGCATACCACCTGCTATGCCTTAACCTGACTCAGCCACC 

ATATGGAAAGTGGGAGTGTCCGTGGCATCAGTGCGATGAGTGCAGCAGTGCAGCTGTTTC 

CTTCTGTGAATTCTGTCCACATTCATTTTGTAAAGATCATGAAAAGGGGGCCCTGGTTCC 

CTCTGCACTGGAAGGCCCGCCTCTGCTGCTCGGAACATGACCCCATGGCTCCTGTGTCAC 

CAGAATACTGGAGCAAGATAAAATGTAA 

Sequence 89 

NGTCGCCCCGCGTCCGTAAAATGTTAAGTCCCCTAAAAGTGAATAAATTTTAAATACCTA 

CTTTTAAAAATACTGTCTTCTAMTTGACATAATTGCTTTTCTTACCAAAAGAAGGAGAG 

GTTCCCCTMTTCCTTTTGGGCCATAGATCCGCTTTTAGGATCTGATTAAAGATGTGGAC 

TTTCACCGCAGGAAGATACCCATGGGCACACCATTCAAAATGTATCAGATTTCAACGGTT 

TTACAAACTCCCCCGGGATTAAGTCAAATGGGTGGATATTACAGCTGTTTTCATCAGATA 

TGGTTATTTGTTGACAACCATCAGGACAACAATGTTTATAGATGGAAGGATAACTTCCTG 

GTTTTTTTCCACATTGMTGTGGCTAGTTACATATCTCAATTTAAAATAAATTGTGGAAA 

GCCAAAAAAGTATGGTCAAGCTAACCTTGGGGTGGCTTTTACCTGATGCCTACAAGCACA 

GAAAAATAGTTTTTAA 

Sequence 90 

CCGCGTCCGATTATGCCAAGAGAAGGTATTACTTTAAGATGTGAAAAATGTAAAATGGAA 

AATTACATTACTAAGAAAAACAAAAAAACACAAACTGNNAAATTAGAANTGAAAAAACAT 

TGCCACAAATGCAACGCACATACAAGTCATAAAGAAAAAAAATAATTTAAAATGAATAGA 

AAATATTTAAGACTTAAGCTTTGAAGAACACAATTTAGATGCAATAATTTCATTTTCTCA 

ACAAACAAGATTATGAGTTTCTAATCTTCAAACAAGTGATGGAATAGTGTTTCTTGAAAA 

AGAAGAAATCCATTTATTTGTTGATTCAAGATATATAGAAGCTGCTCAAAAAGATGCAAA 

AAATGTACMGTTCATTTATTGACAGCAGCTAATTTAAAAGATTTTGTAAGTAGTAAAAA 

TTACTTAAAAATTGGTGTTGAMAAGAATACTTAACTTTAGCTGATTTTAAAAAACTTCA 

AGCTTGATTTCCAAGTGCAGAATTTGTCAAATCAATGCGCAAAAATTAAGACTTATTA 

Sequence 91 

CCGTTGTCCCATATATCTTGTTCCAGCAGCCATATATCTTGNGGTCTACACGCCTAAAGC 

ATGATTTCCCTTGAAGTCTTGGGGTTGNTTAAAGGAGAGTCCCTTCAATATAAAACCTCT 

GMATATTAGTGAGMTGGCTCACTAATGTGAACAATGTTTAAATTATTTATTTATATAT 

AGAATTACTGAATATTAGTACTGGGAAAATTTATAGAAATCATCTAGTCTTACCCTTCAT 

CTTACATATAAGAAAAATGGTCTTTTCTTCTAATCACATTTACAAAATATGATATAAACC 

TTGAGCATGAATGTATGAGCCTAATTAGAGAAACAGAAAATCAGCATGTCAGTTTTCCTT 

CATTCAAAATAACATAGTCTTTCTAAGCAGTCATTCTGGGAG 

Sequence 92 

ACCACGCGTCCGCAAGGCCCGCCCCTTACGTACTCGGAGCTCGGATCCCAGTGTGGACCT 

GGACTCGAATCCCGTTGCCGACTCGCGCTCTCGGCTTCTGCTCCGGGGCTTCTTCCCTGC 

CCGCCCGGGGCCCTGACCGTGGCTTCTTCCCCGGCCTGATCTGCGCAGCCCGGCGGGCGC 

CCAGAAGGAGCAGGCGGCGCGGGGGCGCGCTGGGCGGGGGAGGCGTGGCCGGAGCTGCGG 

CGGCAAGCGGGCTGGGACTGCTCGGCCGCCTCCTGCCCGGCGAGCAGCTCAGACCATGTC 

GCCTGAAGAATGGACGTATCTAGTGGTTCTTCTTATCTCCATCCCCATCGGCTTCCTCTT 

TAAGAAAGCCGGTCCTGGGCTGAAGAGATGGGGAGCAGCCCGCTGTGGGCCTGGGGCTCA 

CCCTGTTCACCTGTGGCCCCCACACTTTGCATTCTCTGGTCACCATCCTCGGGACCTGGG 

Sequence 93 

NCGCGTCCGCCAAGATGGCGTCCNTCATGGAAGGGCCGCTGAGCAAATGGACTAACGTGA 
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TGAAGGGCTGGCAGTACCGTTGGTTCGTGCTGGACTACAATGCAGGACTGCTCTCCTACT 

ACACGTCCAAGGACAAAATGATGAGAGGCTCTCGCAGAGGATGTGTTAGACTCAGAGGAG 

CTGTGATTGGTATAGACGATGAGGACGACAGCACCTTCACAATAACTGTTGATCAGAAAA 

CCTTCCATTTCCAGGCCCGTGATGCTGNTGAGCGAGAGAAGNGGA 

Sequence 94 

ACGCGTCCGCGGACGCGTGGGTGCGGGCCGGCCNCCCTGGACGAAAGAAGAGGGCCCCTC 

CAGGCCAGTCTGGGCACCCTGGGATAGCGGCTGCAGCCAGGCATGGCCGACTCTGCACAG 

GCCCAGAAGCTGGTGTACCTGGTCACAGGGGGCTGTGGCTTCCTGGGAGAGCACGTGGTG 

CGAATGCTGCTGCAGCGGGAGCCCCGGCTCGGGGAGCTGCGGGTCTTTGACCAACACCTG 

GGTCCCTGGCTGGAGGAGCTGAAGACAGGGCCTGTGAGGGTGACTGCCATCCAGGGGGAC 

GTGACCCAGGCCCATGAGGTGGCAGCAGCTGTGGCCGGAGCC 

Sequence 95 

CCCCG CGTCC GAGGTGACCTCCTTGGCCCAGATCATCTTAGAGCCAAGAAGCAGGACCAT 

TCGTGGTTTTGAGGCCCTGA7TGAAAGAGAGTGGCTGCAGGCTGGTCACCCATTCCAGCA 

GCGCTGTGCACAGTCAGCCTACTGTAACACCAAGCAGAAGTGGGAGGCTCCTGTATTTCT 

TCTCTTCTTGGACTGCGTGTGGCAGATCCTTCGTCAGTTTCCCTGTTCTTTTGAGTTTAA 

TGAGMTTTCCTCATCATGCTCTTTGAGCATGCTTATGCCTCACAGTTTGGAACATTTCT 

GGGCAACAATGAAAGTGAAAGATGTAAGTTGAAGCTACAGCAGAAGACGATGTCTTTGTG 

GTCCTGGGTTAATCAGCCCAGTGAGCTGAGTAAATTCACCAATCCCCTCTTTGAAGCCAA 

CAACCTTGTCATCTGGCCTTCAGTTGCTCCGCAGAGTCTTCCACTGTGGGAAGGTATTTT 

CCTACGTTGGAATAGATCCTCTAAGTATTTGGATGAAGCATATGAAGAAATGGTTAACAT 

CATTGAATATAATAAAGAATT 

Sequence 96 

CCGCGTCCGGTTTNCCTGTTGGTTAGGCTGGTCTTGAACTCCTGACCTCACGATCTACCC 

ACCTT GGCCT CCCAAAGTGCTGGGATTACAGGCCTGAGCCACTGCACCAGGCCCACCCTG 

TCTCTATTTTCTAAAATAATAAATCTGATTTTAATGTGGCTGGATATAAATCATATCACA 

GTTGG ATTTG GAAGTTTGGGTTTTATTCCTAACTTTGATGGGAAGCCATTTTAAGCAGAA 

AGATGATTTTAAAAGACCACTATATTTTCTGTGTGAAGAATGAACTGGGAGATTTCATAG 

TATTATTAACAAAAATAGAATAGTTGGGGATCTGGTTTGGCTTGGGAAATGGAGGAAGTT 

CAACTTTGGGCATGCTCCATTTGCATTGCCAAGACATTGCAGCAATTGGAAGTGCAGTCA 

GAGAGCTTAGGAGAAACACTTGGCAGATGGACATAGAGAAGTAGTACTCAAAGCTTGTGG 

ACATTGATTAAATAATCATACAGGAGTATGGGCTGACAAAAGATTNCAAAGAGAAAACCT 

Sequence 97 

GTCNCCACGCGTCCGGGACTCTCGGCCCTGGAGAAGGAGGTGGACTTTGACTCCGACCCC 

ATGGAGGAGTGCCTGCGGATCTTCAACGAGTCCACCAGCGTCAAGACGGAGGACAGAGGC 

CGGCTGGCCCGGCAGCCCCCCAAGGAAAAGAGTGAGGAGAAGGGGCTTTCGGGTCTGACC 

ACTCTGTTCCCCGGGCAGAAGAGGAGGATCTCCCACCTTTCCAAGCAAGGCCAGGAGGTG 

GAGCCCCCGAGGAGGGGTCCCGCGGTGCCCCCGGCCCGGCCCCCGACGGCGCAGGAGGTG 

TGCTACCTGCGGGCCCAGCAGGCGCAGAGGGCATCGGCGAGCTTGCTGCAGGCCCCCGCC 

AGGCTGGCAGAGAAGTCGCCCTNCGTCCACATTTCCCGCCCCTGGCGAGAA 

Sequence 98 

CGCCNCGCGTCCGGCAAAGCAAAAGGGAAATTATTTGGTGGATGGTAGCTCAAAATTGGA 

ACTCTTGTTCTMTTCAGTTACATTGGCTTTACCCTCCTTAGATTTTTCATCAAAGGGCT 

GTCCCATTG CMTC TTACTMAACATTTTGTTAAMTAAACTCTTTTCCTTTTTATATTA 

ATAATTAGGCTTTTAAATAAAGATGTTATTCCTTTAAAATGGTGGGCTTACCATCATTGA 

AGATGTCACTCAGGTGGCCTTGCTTGATCAMACGCCTTTTTTAAAAACCAAGCTTTAAA 

AACATGTTTATMTTTCATGAAGTACATATATATTGTTCCCATAGTCTTCAGCTTTAAAA 

CTATAAATATGCCC AMTTT TGTTATTTGCCCTACTTTAAGTAGGTTTATTGNGTTTGTT 

TTTTTCAAGTACTTGl I I I I CTCTGATAAGACTCAGGAATTCTGAAATGTGAAAATGNCT 

CAATT 

Sequence 99 
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CCTTAAAAATTANTNCCTGACNCTCNTGANTTTGGGNGGAAAGCCCTTGTNTCNNCTCTC 

TNATGNACTCTCATGGGTTTTTTTGTATGATTTGMTATNMTGTGCCTAAAGAATTTTT 

GCTCTCTTAATCTATGNATACATACTTGAACAAATCATTCTTGCTTAACTGCTGATCTTT 

TGTAAAACTATTG 

Sequence 100 

GCCCCGCGTCCGGCTGGAAGCAACAGTTTGGCAGCCTGGGGTACACTCAGGTTATTCGTT 

ACAACTATTATTATTTGATGTCTTTTTTTAAACTCAGGTCATCCACTTTTGACTGTCATC 

C ATGG AAGAGCTCTTATTAAAAGCCTCAGACTTTCGGGACCTATGATTCTTTGGCACAAC 

CTTTTGGAAAA TTCTT AAGCAGGGATGAAGCAAACTTGATTGGAGTTGGGGAAAAAGAAG 

ACAGATTAGTATTTTTCATGCTGACAAAAAATAGCTGCTATGACTTTTCCCGCAACGTGG 

ACAGGGGCCAAGTGAAGCTGAACTGGTCATGGTCTGTCGCCCAGTGTTCCCTTGTCGTCG 

GCGATTTTGCCCCCGACCCTTCTTGGTGGGCTTAGTGGTGGCAATCTGTCTCTTCTACCA 

GACTCTGACCCTCCGAGGGTCGAGGAAGCTCACAGCCGCTGCCCCTGGGGCTGTCCCACA 

CACATCCACTGAAA 

Sequence 101 

CCACGCGTCCGGGCGTCTCGGCTCTTCTGTATCTCCCTGGCCTGGTCGCTCTCGGCTTCT 

GGGCCTCGCCCTTCCTGTCTGTGAAATGGACTCTGGGTGAATCCAAATGGGATCGTCTCG 

GGCTACGTCCTGTCCCTCCGGGACTACAAGTCCCAAGGTGCTCGAGGCGACCTTGGCTCC 

CCCTCCCCACCGGGACCCGCCTCCCTCCAGCCCAAGTCACGTCGTCTAACCTGTTCCCAG 

CTCCTGCCCCGCCCCGTTCTCCGCTCCCCAAGCCGGAGCCCGAGCTGGAGGAAGCCCCCA 

GGTGCCAGGATCTGCTCGGATCCGNGCCCGCTCCGGCCGGCACCATGGACAGTGAGGCAT 

TCCAGAGCGCGCGGGACTTTNTGGACATGAACTTTCAGTCGCTGGCCATGAAACACATGG 

ATCTGAAGCAGATG 

Sequence 102 

CCACGCGTCCGGTCCGGGGTGGAATCACGTCGCTGCGGCTGCCGACGACCCACACCCGGC 

CGGCCGCCTCCGCAGACCCACCTTGGCCGCGCGGCAGGGGGCGCGCAGAGCCCCGAGGGA 

GCGAGTCCCCGCGCGTGGCAGCTCGGCGGCTTCTCCCTTCGGGAGGTCCGGCTCCCGGCT 

CTCCGGACCCGCCTGGCGTCCTCGCCTGCGGCGGGGCGGACGACAGCGGCGCCCAGGAAT 

GGCTTCGGCGGGCAGCGGCATGGAGGAGGTGCGCGTGTCGGTGCTGACCCCCTTGAAGCT 

GGTCGGGCTGGTGTGCATCTTCCTGGCGCTGTGTCTGGACCTGGGGGCGGTGCTGAGCCC 

GGCCTGGGTCACAGCTGACCACCAGTACTACCTTGTCGTTGTGGGGAGTCCTGCCGGAAA 

CCCGCCAGCTTGGACATCTGGCACTGTCGAGTCCACGCTCANCAANCGATTGGCAGATTG 

C 

Sequence 103 

NCGCGTCCGAGAMTTGCMTTTTTTMTTTTTMTTTTAAGAGGAATTCGTGCCAGAGA 

GAACTATTAAGAAAGGGGTATATCCAGTCTAAGGATTATTAGGCTCAAGTCCATGAATAG 

GCTCTGGGAAGTTTGTAAACACTTGGAAATTATTTGCAAAATGTGTGTGTGAATGTGCTT 

TACCTTANAGAGTTCATGMTTTCATTAGATTGTTGAAAGAGTTTTAGTATTAACAAAGG 

AAAAACAAACCACCACCATCACATAACAAAACCACAACAGTGATTTAATCTTTTCACCTA 

ACAATAAGTAAATTGAGGCTCTGATGGCTAAATTAATAGCCTGAGGCTACACAGTCAGTG 

GCAGAGCCCAGGGTANAGAGAGAACCAGCACAAGCCATTGTGGGAGCCGAGGGTAAAGAG 

AGAGCTAGGTGTTGTACCTTAGTAAATAAATCAGAA 

Sequence 104 

GNGTCGCCCCGCGTCCGGAAGGTGGAGACCGCTTACCCTGATCNGGGATGTATCGGCTGC 

GGGTGCGCAAGGCAGTCCAGGAGTGACCTGGGGCTGTGGAGAGCGACCCGTGGCCTTGTG 

TTTCAGAGTTTACCACCTAGGATGACTTCAGTGACTAGATCAGAGATCATAGATGAAAAA 

GGACCAGTGATGTCTAAGACTCATGATCATCAATTGGAATCAAGTCTCAGTCCTGTGGAA 

GTGTTTGCTAAAACATCTGCCTCCCTGGAGATGAATCAAGGCGTTTCAGAGGAAAGAATT 

CACCTTGGCTCTAGCCCTAAAAAAGGGGGAAATTGTGATCTCAGCXJACCAGGAAAGACTT 

CAGTCCGAAGTCCCTTCATTTGTCTCCTCAAGAACAATCTGCCAGTTATCAAGACAGGAG 
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GCAATCCTGGCGGCGAGCAAGTATGAAAGAAACGAACCGGCGGAAGTCGCTGCATCCCAT 
TCA 

Sequence 105 

CGTCCGCGCAGCGCTTGAATCCCGTGGCCTAACCGTCCCTCGGAAGACCGGTCCCGCTCG 

GGAGGCTCTGCAGTCGCGCCTGGGGTCAGGGCCGGGGGCGAATGTGGCTCGCGTTCTAGG 

CCTCCCTGGGTTGGAAAAAGACTATGTTAGCAANGTGTCACGCCATGCTTTTGCCAACTT 

TGCAATTAAAGGTTGACATTCCTGCATMGCATTTCTCTGTGAMATGTCCTTGCCTCTT 

ACAGAGGAGCAGAGGAAAAAGATTGAAGAGAATCGACAAAAGGCTCTGGCCCGCAGAGCT 

GAGAAGTTATTGGCAGAACAGCATCAGAGGACTAGCTCGGGCACCTCCATTGCTGGCAAC 

CCATTCCAGGCCAAGCAAGGCCCATCCCAAAATTTCCCAAGGGGAGTCTTGTAAGGCCAA 

GTGAGCCATTGGTGTCATTTTCAAGCAACAGAATCTCAGTAGCTCATCTAATGCTGACCA 

AAGACCTCATGATTCCACAGTTTTCANGCAANGGGAATATGGAAA 

Sequence 106 

CCGGCCCTATCCCTATATTGTTTGCTTGTTTGGGATMCCTAA 

ACTACTAATTTGTTTTACCTGATGTATCTTCTCTTTCAATMTTTTATGTTACCTTCTGT 

TTAGAATAATATTTGCCACAGATATTTAGGTTTAATTCTGTGTTTGAATGATTCCAATGC 

CTTT CTCTACCCA CTTTGAACACTTCATCCTGGAATGGTTGGCTGATGTATGTCTCTAAA 

CAATI I I I I I I I IAGGAGAAGGTATGTGGGTAATGTMTTCCTAMCCTTTGCTTTTCTG 

AMAATCTTTCATTTGCCTTTATACATGACCAGATTTACTGGGTATATAGATTTGTTGAT 

GMAAAAGGTAAAAAGAGCMCTTTTGACATCCAGAGGTTTGTCTGGCACTCACAGCTAG 

CCCCGTGTTATTCTCCCTATT 

Sequence 107 

GCGTCCGTCTNAACCCTAAAGCTAAAAAGTCATTGTGAACCTTTNGGTCTGATGCTAAAG 

AAGGGAAAACAGGTACAGGAAATCCCATGTGGATGCTTGCTTNCAGGATTTCCTGCCATG 

ATTCCAGAATCCCACAGCTNCAACATGATTGCAAAAAGACTCCCTGCTCATTTTNCCTCA 

GCATGCACAGCGCTGTCCTGTCTCAGTTGCAACTCGACAGAGCCGCATTTACTCCAGAAC 

CCAATCCACACACCTGCTCATCCTGCCCCGAGAGGAGTGCCTGAAGCCAATAGCAGGGAA 

CTAGAGCAGACTTGGGTGGATCTTCATTGGATATTAGGTATCTTGCCCTAGATAGGCAAG 

CAGTGGCCTTACAGATGCTGACAGATGATCTGATTAGATGCACAGNTGCTGGGTGGCGTC 

TGGGGCCAGTCTATTGGNCAGTTCTGGGAGNGGGAACTATTTGGGCTCTGCAAAGATG 

Sequence 108 

CGTCCGCTCCCTGGCCCTGCTCCGGGAGCTGTCGTTGTCTCCGCCAGCAGCCCTGTGGCT 

GCAGGAGCGCCAGGCCCAGCTTCGCCACTCGCTGCCCCTGCAGAGCTTCCTGCTGAAACC 

TGTCCAGCGCATTCTCAAGTACCATCTGCTGCTGCAGGAACTAGGGAAGCACTGGGCGGA 

GGGCCCAGGCACTGGGGGTCGCGAGATGGTGGAGGAAGCTATTGTGTCCATGACAGCGGT 

TGCCTGGTACATCAACGACATGAAGCGCAAGCAGGAGCATGCAGCGCGCCTCCAGGAAGT 

GCAGCGGCGGCTGGGTGGCTGGACCGGACCAGAGCTCAGTGCTTTTGGGGAACTGGTGTT 

GGAGGGCCGCGTTCCGAGGAGGCGGANGGGNGGGTTGGCCCCCGGCTACAAGGGGGT 

Sequence 109 

AGAATTGTGTATGCCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCA 

AAGAAGATGGAACCAGACTGGAATTCTGTCTCCAGAGAGAAACCCAGCTGTTTGGGTCAA 

AGA CAGA TGCTTCAGACTTGGGTGGGAAGGTGAAAGATGGCTATTTAGAAAGCTGGTGGC 

ACGTTTTACAT AAGG GMTGTCAGATGGGAGATGCTAGTTGCCATTTTAACAAAGCAGGT 

MATCGGTAAATTTTAAACTCTGTCCATGTTCTGTTAGAACTCAGGGACAAGGGATCCAT 

GAAAAAG 

Sequence 110 

ACGCGTCCGCACGGAGAGAACTGGNCCTGGAGCGGGGGCGCGGGGAGGGGGGCGTCGTCN 

TGGGTACMTTGCGCANGGGGAAAGGTCGAGAGGTCGCGCTGNCGCCGTTTTATTTGAAG 

ACATCGTCCAGTTCTGACCATGGACTCNCAGCCATCGGCCCTTAGTTTCCATTCCCTCTA 

G NGGG CCTTCNGAGGGNTCTACTGACGTACCTCCTTCCCTTGGTACCGGACCGGGGAAGT 

GTTTTCGGGCGCGGGAGGTTCCGCATGCCCAGGCCTGGCCAGGGGA 
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Sequence 1 1 1 

CGGGCCCTTAGTCCAAGCCTTGATCGGCGACTAAGTGACGGCAGTGACTGCCGCCATGCC 

GAGCTGGACGGAAGNCACTTCTGAGAAGGGCGGAAGTGTCTCGGGCTCCTTAGAGGGAGG 

ACACCATATTAGTGCCAGTGGGGAAGTCACCGGGTGGAATTACTTCTTTGTGGAGTTTGT 

GCTGTAGCGACAATGAAAAACGAAGAGTCAACTTTTATAAAACAAAATAAAAATTAAGTC 

AAATCATGCCAACCTTTATTAGATCGGCTAGCAGGGTTAAACTTAATTCAAAGCCCCTGA 

TGAATCGGGCCTTCATTGCACCCCCAAAGGCTCCGCCACCCTGATT 

Sequence 112 

CGCGTCCGGGCGCCGGTACGCCTGGTCCCCGCGTGGAGTCTTTACTCAAAACAGCTCCCG 

CCTCAGGCCGAGATGAGGAGCCCTTCANAATAGCTGCTGTCTCTGGGNGGACCCGGGCGT 

CCTTGGCAGCCCAGCTGNTCTGGACAAAGCCCTGCCAGTCAGGCCTCGGCTGGCAGGAAC 

CATGGCAGAGGCTGGGGATGCTGCGCTATCGGTGGCCGAGTGGCTGCGGGCATTGCACCT 

GGAGCAGTACACGGGGCTCTTTGAGCAGCATGGCCTGGTGTGGGCCACTGAGTGCCAAGG 

CCTCAGCGACACCCGCCTGATGGACATGGGCATGCTACTCCCT 

Sequence 113 

TGTCGACCCCGCGTCCGCGGGANGTTCATGGAAACGCAGGACACGACAGAATTGTGTNTG 

CCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCAAAGAAGATGGAAC 

CAGACTGGAATTCTGNCTCCAGAGAGAAACCCAGCTGTTTGGGTCAAAGACAGATGCTTC 

AGACTTGGGTGGGAAGGTGAAAGATGGNTATTTAGAAAGCTGGTGGCACGTTTTACATAN 

GGGAATGTCAGATGGGAGATGCTNGTTGCCATTTTAACAAAGCAGGTNAATCGGTNAATT 

TTAAACTCTGTCCATGTTCTGTTAGAACTCATGGACAAGGATCCATGAAAAAGACCTGTG 

ATGTTTCNTCTGGCGCTTTACTGGCCTGGGCACACCTACCAATCTTTTAGGATTTGACTG 

GTTCCATTACATTTCCT 

Sequence 114 

GTCGACCCCGCGTCCGTATCACTGTAATTTAAGGAAAGAAAACTTCAGTTCTGCCTCTGG 

ATACCMGATGCCCATTGCTCAGTTCAGACAACTGATATTAAAATAAAGCTATGCTCCTT 

ACTTACTTCTTTTATTATAAACAAATTCCTTTGCTTTGGCTGATACTAGCTGAGTCATTG 

ATCATCATTGGTACCATGATATTGTAATCTATGCTGCTATTTGGCACAAGACTGAAGTTC 

ACA CTACA GTAGAGAATACTATAAGATAATTTGCAATAAATACTGATAATAATAATACCA 

GATATTTTAACTMCTTTTTCTACCTTTATTAATAGCAATCAGCACACTTGAATGTGTAA 

ATTTCACAGTAACTTTAGGCAGAACTTAAGCTCCAGGCCACATTTGTATAAGAACACCAA 

GTATTCAAGGCATAAAGTCTGTTGTAAGCCAAAAAAA 

Sequence 115 

AGTTCAGTCTGCAGCAGTCCCTGCACCCACTTCCCAGTTGCTTTCATCTNTGGAAAAAGA 

TGAGCCCCGTAAAAGTTTTGGCATCAAGGTCCAGAATCTTCCAGTACGCTCTACAGATAC 

MGCCTTAAAGATGGCCTTTTCCATGAATTTAAGAAATTTGGAAAAGTAACTTCAGTGCA 

GATACATGGAACTTCAGAAGAGAGGTATGGTCTGGTATTCTTTCGGCAGCAAGAGGACCA 

AGAAAAAGCCTTGACTGCATCAAAAGGAAAACTTTTCTTTGGCATGCAGATTGAAGTAAC 

AGCATGGATAGGTCCAGAAACAGGAAAGTGAAAATGAATTTCGCCCCTTGGATGAAAGGA 

TAGATGAATTTCACCCCAAAGCAACAAGAACTCTNTTTATTGGCAACCTTGAAAAAACC 
Sequence 116 

CCCGCGTCCGCACCAGGCCCGAGTCTTCCCTTCATGGAGGGTGACGTGAGCAGCAAGGAT 

AAGATGGGCAAAGGAGTGGATGGGACATATAAAAAAGCTCTTCAGGAAGCTGCAGCAAGG 

TTTGAGGAATTAAAGGCCCAAAAAGAGCTAAGACAGCTGCAGGAAGACCGAAAGAATGAC 

AAGAAGCCA CCACC TTATAAACATATAAAGGGTCTCCCTCTGTGACCCAGGCTAGAGTGC 

ATTGCTGCAATTTTGGCTCACTGCAACCTCCGCTTCGTGGGCGCAAGTGATTCTCCTGCC 

TCCTGCCTCAGTCTCCTAAGTAGCTGGGATTACAGACATGAGCCACCAACGCCTGGCTAA 

TTTTGTGATTGGCAMAAAGAGATTTTGTGACACATAAAGATGATATGAAATTCAACTTT 

CAATCAAGTATCCAGAAAATTTTA 

Sequence 117 

CCACGCGTCCGGCCCTTGCCCCTGTCNCACANGAATGGACCCACGGCCCCACCCAGCGCC 
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GTCAGCGCCCGGCACTGCCACCCGGGTCCGGGCCGCTGCCTGCACGTGGGATCCGTCGGG 

CAGCCGGGGACAGAAGAGACCCCGCCGTTGGGACGCAGGGCAGAGCCGGCCACCTAGTCC 

CTTCCAGCCAGCAGAGGCGAGGGAAGGCGTCACTGCCCCGGCGGGGAGACGGGCAGGACG 

CCCTGCCCCGCACCAGCAGCCTCCGCCGGGGCGCCCTCAGCTCCCTGCTTGGCTCTGTCT 

CTCCACACCCGGCAGGGCCCGCGGGCTGCCCCAGCCTGGGGGGGTCGTGGGCAGCTGCTA 

CTCAGTG CCAACCCCGTGGGG CACAG A 

Sequence 118 

NAGGGAGTCGACCACGCGTCCGGTGCGGAGCAAGCATCACACCATGGCGTATGAGTGTTC 

CTCTGTGTAGACTCAACCTGCGCCTCGCCGTCCCCCATTCGCACACCCGATGCCCGGGGG 

TCGCTACGGACTTAAAATCTCCGCACCGCACCCTCCACCTCAGAAACGTTCCTGGATCCG 

AACACTGCCCCCTGACGACCTAGAGAGATCCCGGCTCCCAGCCCACTGAGTGGCTTCAGC 

CTCGCTGGTAGGTCTCTCCTCCAAAGCTCTGGAAACAGACTCCTGGGAGTGANGGTAGNG 

GGGGAGCNGCAGGCACCGCCCCCTTTCCCCAAGTCCNCCGCCCACTTCATCCCTCAGGCA 

CCTNCCAACTCCTGGCCTTNTCTGCACGAGGCGCCTGCCCCGGGCCCCGCCTACAGGGGA 

CCCAGCTCTTCTTGCAGCCATTGGAAGNTGATCACCTGGGAGGTGA 

Sequence 119 

CACGCGTCCGGTTTTACTGCTCTTTGCCATGTGGTAAAAAGAGGCTGAGACATATTTAAG 

AATTCCMGAGGATATTATGTGTCAGMmCAGACACTGATGAGAAGTTTTTAATTGTT 

CI I I I 1 1 ATTTGATTTTGGAATTCAGGTGCACTCTATTCAAGTGCAAGGATATCAGAAGT 

I I I I I 1 1 I ATTTAAAAAA I I I I 1 1 I I I CGAGATGGAGTTTCACTCTGTTGCCCAGGCTGG 

AGTGCAATGGCAGCTTACTGCAACCTCCACCTCCTGGTTCAAGCGATTCTCCTGCCTCAG 

CCTCCCCAAGTAGCTGGGGATTACAGGCACCGCGCCAACACACCTGGGCTTATTCTAATT 

TAAGTAAGAAAATGGGAAGTCTTACCCATNTTTGGTCAAGGCTTGGGTCTTCGAACCTNC 

TGACCTTAANGGTGATNCCACCCCANCTTTGGCCTCCCAAGCCGTGCTNGGGATTATAGG 

GCATGAAGCCCACCCANGCCCGGNCCCAGGATTTTTATATTTAAGCCCTTCTTGCTCTTN 

AAAAAAAAAAAAAGGT 

Sequence 120 

NGTCGCCNCGCGTCCGGGAACCTACCGGTACCGGCCGCGCGCTGGTAAGTCGCCGGTGTG 

GCTGCACCTCACCAATCCCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTC 

TCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGG 

GGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGA 

AGTATTAGAAATGAGCTGAAGACCATTCACAGATTMTATTTTTGGGGACAGATTTGTGA 

TGCTTGATTCACCCTTGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAA 

CTGGAACCAGCCAGTGAATCTTAAATGNTCACTTAAATCAGAACTTTGCCNTTAANAAAG 

AAAATTGGGGNGTCCTGGGTTTA 

Sequence 1 21 

CCNCCCCGCGTCCGATCAATTCCTGGAATTTATGGTTATAACTTCGAAACAGAAGATGGA 

CTAATTTTATTTTTATCTAATTTTATTGTTGGAAATTTAGGAATTTACGGAAATACTAAT 

TTAAATTATTTAAAAAGATCATTAGAATCAAGTAAACCAATTTTGATGGCTATGATTGAC 

TCAACMGAGCTAATTATCCAGGTAAMCAATAGATAAAATTTTTGCTAAAAAGTTTTTA 

GAAAAAACATTTTTAACAACAAAATCCACTTCAAGAATAATTGTCGGAGCATACGATGAA 

GAGATGCTTTCAATTCAAGAAATCCTTGATTTAGCTTACAAATCAAACGTAAAGGTTGCT 

GTATATGGNAGAAATTATGACAATCTTTTAGAAATGAATCAACGATTAGCACAAAAACAA 

AATCTTGAAATACATTATCCAGMTTTTTTGATTTCAGGCAAGCTAATAAAATCGATAAT 

TTTGTMTCTTMTTACATCMCACCTGGAGCGAATTTACCMAGATTTTTTAGAAT 

Sequence 122 

CGCGTCCGCGAAACTGAGAACCAGTTCTCCCGAAGCCGCGGGTCTCCGGCCGGCGGCGGC 

GGCGGCGGCGCAGGTGAGCAGGGCAGGGGGCAGCCGAGGGAGCGCGGGGAGCGGGGGCCG 

GGGGGCCACGTACGAGGGGCTGCAGGCCCAGCCGGGGCGGGACTCGCCAATCCTGCGTCC 

CCAGCTCAGGACGCGGACGCTGATCCGAAGCCCCTGGCCCCGGCTGGGTCAGCACTGGGA 

GAGCAGGCCCAGGTCCGCAGCCCGGGTGTGGGGCCCTCCCCAAATCCAGGGAAAGGATCG 
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TGGAGCGGGGTGGGGACTGAAAGCCATTTCCTTTCCCGTGAAGAATTTTTATCAGTGCAA 

GTAACAAAATATT 

Sequence 123 

CGCGTCCGCTNAAAAAMTAATACCAAAAAAAGTTTTTGTAAAGACAACGCTCTCGCTGT 
GTTGCCCCGCCACTGTGGCCTCCTTAGCTTCTTCCCTGGGGCCTGCTGGACCTT TCCATA 
CTCCAGAAACTAAAGGGGGTCCAGGACCCTGCTTNAACCCTAGGATCCCGCATCi i i i i i 
1 1 I I I I I I II I GGACGCAGGGTCTTGCTGTGTCCCTCAGGCTGGAGTGCAGTGATTCACT 
GCAGCCTCAAACTCGTGGGCTNAAGTGATTTCTTTAG CCTCAGC CTTNTAAGTAGCTGGG 
GACTACAGTCATACACCAACATGCCCAGCTAANTTTCCTTTnTTAATTCTTGTAGAGNA 
TGTTTGAGACGGCTTGGGCTIMTGTTGCC 
Sequence 124 

CCNCGCGTCCGTGCTGATAAAACTCCTTTGACCTGACGATTGCT CTAAGTC CTAATTGCC 

ATATTTATATTCCCATAGTAAGAGTGTTTGGAGATAGTGTTTGAGCTTTTTTGCTGGTGT 

TAAAAATGCATAATGAAAGATGGCACNAGAGAGGCATATTATATCCAATTCATGAAGTTG 

TTTGTGTTAACAGAAAGCTTATTTTAATCACTTAACATTGTTGATTTGTCTAATCACAGT 

AGCGCTATTGATTAGGAGCCTGACCTTTANATGGTTGACTTGTGAGTGTATTCAATATGG 

TGAAATAANGGTGTTTGATATATGGCTGCAGATTTTAGAAGGTGTCATTAGCAAAGGTAT 

ACGGAATAAAATANGGGTTATAGTATTCCTTACTCAAATTCTGTATGTGCTAGAGCTGGC 

TGGAGTCTGTTGGCATGCTCATTTGGTGTAAGGNCCGNTAAGGACTATGCT 

Sequence 125 

GCCCCGCGTCCGCACTTTGTATTGATAACTTAAAATGGCATCAGTTTATCTTAGACATCA 

GCTTGCTTTTTATCTCCTTTTTTAGTGAGTGAAATAGAGCAACTAGCATGCCTGTGTTCC 

CAGCTACTTGGGAGGCTAAGGTGGGAAGATCAATTGAACCTAGGAGGTTGAGGCTATAGT 

GAGCTGTGATTGCACGAGTGCACTCCAGCCTGGGCAATGG AGTGAG ACTCCTGTCTCTAA 

AACAGCAACAACAAAAATAAAGCAACCATAGTGCATAAGGGAAA TTAAA TGTTCCCTATA 

GAAATATGTGTATGTCTGTGATAAGTGGTATGCAAATGCTAATTATTTTATAAAATAAAA 

GTTCAGAACTATTCTTATCATTGCCACTTGAACAATTAAAGGGTTTGCTTTATTTCCTAA 

TGTTTAATAGGAACCCTTTGCTTCAAACAGCCTTTGTTGAAATCATGTAAAAATTTGTTA 

ATAG 

Sequence 126 

CNCCACGCGTCCGGCGGCCCAGCCGCCGCCTCCCGTTCTCCCCTCCGCAGCGGCGGCGGT 

GGCGGAGAAGGAACTCGACACGCACCGACCGCCCTCCCGCCCCAGCCGAAGCGGAAGCTG 

TAGCCCGCTCTGGGCCGGGGCCATGGGCGCCCCGCGCCGCCCGGGTCATGAGGACGGAGG 

CGGAGGCAGCGGGGCCGCCGCTCGAGCCCGGGGACTTTGTGCAACTGCCTGTGCCCGTCA 

TCCAGCAGCTCTACCACTGGGACTGTGGCCTGGCCTGCTCCAGGATGGTGCTGCGGTACC 

TGGGCCAGCTGGACGACAGTGAGTTTGAGAGAGCCCTGCAGAAGCTGCAGCTGACCAGGA 

GCATCTGGACCATCGACCTGGCCTACCTGATGCACCACTTTTGGCGTGAGGCACCGCTTC 

TGTCCCAGACCTGGGGTGTNGACAAGGGCTACAAGAACCAGTCCTTCTACAGGAAGCACT 

TT 

Sequence 127 

CNCGCGTCCGCGGTCGCGTGGGCGGACGCGNGGGTTCGTCCTGGACAAGTCTGGGAGTGT 

GGCAAATAACTGGATTGAAATTTATAATTTCGGNNAGCANCNGGGCGGAGAGATTCNGTG 

AGCCCTGAAATGAGATTATCTTTCATTGTGTTTTCTTCTCAAGCAACTATTATTTTGCCA 

TTAACTGGAGACAGAGGCAAAATCAGTCAAGGCTTGGAGGATTTAAAACGTGTTANTCCA 

GTAGGAGAGACATATATCCATGAAGGACTAAAGCTAGCGAATGAACAAATTCAGAAAGCA 

GGAGGCTTGAAAACCTCCAGTATCATAATTGCTCTGNCAGATTGGCAAGTTTGGACGGTC 

T 

Sequence 128 

GGAGTCACCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGGTTAATCTTAGGCCTGAGGT 

TTGGGGCCGGGTGACAAGGAAGTTAAACTCGTCCTCCCTGCCAGATTCTACCCCCTTTCG 

GAGCTGAGCTCCAGCCMACCTGTGGAGTTTTCTTTGACCATTTTAGGACATGTTACTGC 
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TTCTGAGTTGGCTGCCCCAGCTGCTCAAACAAGACCTTTCTCCTGGGTTCGTAGTAGTGA 

AAAGGAGCAGCAGAGCAACTGAGGAGGAGGGCGGGTGGGAGGCATGGGACTGGGGCTTGG 

GGAGGTCAGGCGAGACCGGGGTGAGAGCTCAGAGAAGCTCCTGTGACTTCCATGCTAAGA 

TCTTGCCAGAGAACTCTGGTCAGTCCTCGGGTGTCTGGATGAAGTAAAGGAGTTAGGCAT 

TTCTTCCTTTGATTCTCTGGCTTACCT 

Sequence 129 

CGTCCGGCCCGCTCCGGGCGTGGGCGTGTTCTCGGCGGGCGTGCCTGGAGGAGGAGCTGG 

GTCCTTGTGCGGCTGCAGAGTCAGATGGGGCGGGGATTTCGGGGCACCGGGTCCTCACCT 

TCACGAGAAAGGCCCCACAGCACGTCCCCACTACCCGACGACTCACTCTTCGTGGCTTCT 

CTCTCCTCCCCAAGAGCAGGGGTGGGCCTGTCTCGCGTTCCCCTGCGGGAGTCAGGAAGC 

GTCCTTCCTACCTACCAGTCCTCCCCTCTGGTGTCTGGGGACACTTCCTGGGGGCCTTTC 

AGGTGGTTGGCGCCGGTGCAGGGCCTGAGAGCCTGGG 

Sequence 130 

GCGTCCGGTGGCATCATGACTTCTGGGGCAGTAGACTGAGCAGCAACACCAGCCACAAGT 

CCTACCGGCCTCTCACCGTCCTGACTTTCAGGATTAACTACTACCTCTCGGGAGGCTTCC 

ACCCCGTGGGCTTTCACGTGGTCAACATCCTCCTGCACAGTGGCATCTCTGTCCTCATGG 

TGGACGTCTTCTCGGTTCTGTTTGGCGGCCTGCAGTACACCAGTAAAGGCCGGAGGCTGC 

ACCTCGCCCCCAGGGCGTCCCTGCTGGCCGCGCTGCTGTTTGCTGTCCATCCTGTGCACA 

CCGAGTGTGTTGCTGGTGTTGTCGGCCGTGCAGACCTNCTGTGTGCCCTGTTCTTCTTGT 

TATCTTTCCTTGGCTACTGNMAGCATTTAGAGAAAGTAACAAGGAGGGAGCGCATTCTT 

CCACCTTCTTGGGTGCTGCTGAGTATCTTTCTGGGAGCAGTGGNCATGCTTGTGCAAAAG 

AGCAAGGGATCACTTGTGCTGGGTTTAAAATGCCGGAATTTGACAATCTTTGGGTGATAG 

Sequence 131 

GTCCGCTGGGGGCCCTGGGGCTCTCTGCGTCGAGAGCGCTCGAAGACCCGGGATTCCTGG 

CCCGATCGCGGGCGGGGGGAGACCCCAGCTCCACCCCAGCTCCCGCCGGCTCGGGGAAGG 

GGCGGCCCCTTTAAGAGCGCGCGGCCCCGCCCGCCCCCTCCGGGCAGGATCCGAATTCCA 

GGGAGGCGGGGCGGAGACGGCGGCGAGGAGGAGGCCGCGGCGCGGGACGCATAGAGCTGC 

GGCTCGGGCGGCGCCTCCTGCGGCGGCCCGGCCCGGCTCCGGCCCCCGCTGGGGCAATGC 

TCCCCGGG 

Sequence 132 

TCGCCNCGCGTCCGGGCACACACATGCCAGGCTATTTTAAGAACTACTACAACTATGATA 

AAGCTGTGAATATGTAGCCATGAACCAAAACAAAGTCTCTGTCCTTGTGGAACATTTGTT 

CTGTCAGAGAAGACAGTGTGTTGGCTCACATTGTGGTCAGTGCTGTTGAGCAAAATAGGT 

CAGAGTAAGGGGGATGGAGACTGGTGGGAGGAATGCTGCTTTATCCAGGATGGGCAGGGA 

GGACTCGATGGTGTCAGCACTGAAGGATGTAAGATCTGCTGCTCTGGGGAGAGGAGCAGC 

ATGGAAGGAGTAGAGTGCAGAGGCCATGAGGAAGGATCAGGCTTGACTCCTTTGAGCAAG 

GGGGATGGGAA GAGTG ACGGNAGAAAGAGGGACAGGCCACATGGCCTGGTGGCCTGTGCT 

GAGGCCTTGGGCTTTTCTTCMGTGAGATGAGATGCCATTGGCCAGTTTGGGCAGTGATT 

TNATCAGACTTGGTTCAGCAGGACCATNCTGCTTGGCAATGTGGAGAGCANGCTGAAG 

Sequence 133 

CGCCNCGCGTCCGAACAGGCCGGGCACCAAGGCGCAGGATTTCTATAATTGGCCTGATGA 

ATCCTTTGATGAAATGGACAGTACACTAGCTGTTCAACAGTATATTCAACAGAACATAAG 

AGCAGATTGCTCCAATATTGACAAAATTCTTGAACCACCTGAAGGCCAAGATGAAGGTGT 

GTGGAAGTATGAACATTTAAGGCAGTTCTGCCTTGAGCTAAATGGACTTGCTGTCAAACT 

TCAGAGTGAATGCCATCCAGATACTTGCACTCAAATGACAGCAACTGAACAATGGATTTT 

TCTTTGTGCAGCTCATAAAACTCCAAAAGAGTGTCCTGCTATAGACTATACTAGACACAC 

ACTTGATGGTGCTGCATGTCTTCTGAATAGCAATAAATATTTTCCCAGCAGGGTTAGCAT 

AMGGMT CATCT GTAGCGMCTAGGATCAGTATGCCGTAGGATTTACAGMTATTTTTC 

ACATGCTTATTTTCATCATCGGCAGATATTTGGATGAATATGAAAATGAAAC 
Sequence 134 
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GCGTCCGCGAAAGCTGGGAAGCCAGGTCTACCTGCCCCAGACGAATTGGTGTACCAGGTG 

CCACAGAGCACACMGAAGTATCAGGAGCAGGMGGGATGGGGMTGTGATGTTTTTAAA 

GMATCCT TTGAAGATGATGCTGCTTTTTACAMGCATCGTTTTAAAGCACATGGCCTTT 

I 1 I I I I 1 I AATTATTAGTGGTAGTAATATATAGAATGTATTACATAACTGTCACTGAAGT 

GGTTGGGGAAAATGTGGTGACTGAGGTACAGGAAACTACTAATCTTGCCATCTTGCTTTA 

AGGTGTTATGGTGGCACAGTTACTGCTCGCCTGTTAAATTTCAAATGTCCTGTTTGATAC 

TACTGGAGAACACTATTTTTAATACAGAAAAAGGTCCCTATAATGCACTTCAGAGAAATT 

AA 

Sequence 135 

TCGACCCACGCGTCCGGGAGTCCCCCCTGCCCCCCATCAAATGCTTCCTGCAATACTTTG 

CACACCAGAGACTGGGCCTCCCCAGATCCAGGGGGACAGGGGTCCCTGGGGGAGTCCCCA 

GGGCCAGCCCCTCCAGGCCAGCTGCACACACTTGACACTGATTTGCACAGTCTTGCACAA 

ATAGGGGGTAAGAGCCCAGTGGCTGGGGTGGGCAATGGGGGTAGCCTCTGGCCTAGGGAG 

TCCCCTGGCACTGCCAATGGGCACAGTCCCGAGCACACACCCCCTGGCCCTGGACCCCCA 

GGCCCCTGCCCCACCAAGCGAAGGCTGCTTCCTGCTGGAGAAGCCCCAGATGTCAGCTCT 

GAGGAAGAGGGGCCAGCCCCTCGGAGGCGCCGGGGATCCCTGGGCCACCCTACTGCTGCC 

AACAGTTCTGATGCCAAAGCCACACCCTTCTGGAGCCACCTGCTGCCTGGGCCCAAAGAG 

CCTGTTTTGGACCCAACAGACTGCGGTCCCATGGGGCGGAGGCTGAAAGGAGCCCGTCGC 

GTGAAAGCTTGAGCCCCCTTCGAAAGCCTNCGGAAGGGGCCAGGCCTGCTGAGCCCCCCC 

AGT 

Sequence 136 

CGACCCCGCGTCCGTGAGMTTCAGCTTTGGAGTCCCGGGTGAGGGGTTTTAGATAAACC 
CATCAATATCACCCACATTCTGTGACTCTTTGCATCACTCGTGTTATTTATTTATTTATT 
TATATTCTGCCTTGTTCCAGAAAAGTGTTTAAGGCAACAACGCTTGI I I I I I GGTGTTTT 
CTTTTGACATTTGMMTTTAGTACATTC 

AAGAGAAGGAACAATTGTTTTTAGTAAGTTTTCTTTTTCCT I I I I CAATGAATTGATTCT 

TGAAATTAAAAGTTCTTGAGAGAAGGAGAGGAAGATACAGCAGACATAGGACTGAGCCAA 

GGAAGAGTCTGCCTGAGAGAGACGCTTGGCCTGTGCTTTGCTGCCATCCGTGCGGCCTTG 

GCCACA 

Sequence 137 

TCCGATTTTTAMTCTATTGGCCGTGTTGTCCTACCTGAAGTTCTTCAACTGCCAAAAGC 

ACAGCCCTTTTTCTCTGAGCTGGTGGTTCTGGCTAACACTGACAGGGGTCGCTTGTTCCT 

GTGCAGTGGGCATCAAGTACATGGGTGTGTTCACGTACGTGCTCGTGCTGGGTGTTGCAG 

CTGTCCATGCCTGGCACCTGCTTGGAGACCAGACTTTGTCCAATGTAGGTGCTGATGTCC 

AGTGCTGCATGAGGCCGGCCTGTATGGGGCAGATGCGGATGTCACAGGGGGTCTGTGTGT 

TCTGTCACTTGCTCGCCCGAGCAGTGGCTTTGCTGGTCATCCCGGTCGTCCTGTACTTAC 

TGTTCTTCTACGTCCACTTGATTCTAGTCTTCCGCTCTGGGCCCCACGACCAA 

Sequence 138 

CGACCNCGCGTCCGGAAGGACCCTCTGAGCTATTTTGCGGCATACGGGAGCAGCAGCTCA 

GGCTCCTCGGACGAGGAGGATAACATCGAGCCGGAGGAGACGAGTCGCAGAACCCCGGAT 

CCGGCGAAGTCGGCGGGCGGCTGTAGGAACAAGGCGGAGAAGCGGCTCCCGGGACCTGAC 

GAGCTGTTTAGGAGCGTGACTCGCCCGGCCTTTCTCTACAATCCGCTCAACAAACAGATA 

GACTGGGAGAGGCACGTCGTCAAGGCGCCTGAGGAGCCTCCAAAGGAATTCAAAATATGG 

AAGTCAAATTATGTACCACCTCCTGAGACCTACACCACTGAGAAGAAGCCTCCGCCTCCA 

GAGCTTTGACATGGCAATAAAATGGTCTAACATATATTGAGGACAATGGTGATGATGCTC 

CACAGAATGCTAAAGAAAGCTAAGGCTTNTACCA 

Sequence 139 

CGACCACGCGTCCGGGCTGGCGAGCCCGGCTGAGGAGCCTCTTGGGTCGCACTTACCGCC 
GCGTCCGCTCCCGGTCCCTGGCCCCTCAGCGGCATGGCGTGCGGGGCGACGCTGAAGCGG 
CCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCC 
Sequence 140 
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CGTGTCCGGTGAATGGGAGCGGAACTCACAGGACACAATGAGCCGGGTCACTGATGGCCT 

TGCTTTCTAAGAATCTCACAGTGAGCCCTAGAACTCTCTACGTGGTAACACTGTGTGCCT 

TTTTCAGAGAAGAGCCTATCTTAGATCTTAGCCTAACGTTGGGTCTATTGTGTTGCTGGA 

GAGACCAGCACTGACATTCATCTCAAAGCACATGGTATGTTTGACTCCTATGTTGACTCA 

ACTACCCATCTTGTACTGGGACACTCGC I I I I ! I I I I I I I I 1 I I I GAGACGGAGTCTTGC 

TCTGTCACCGGGCTGGAGTGCAGTGGCACGATCTCGGCTCGCGGCAGCCTCCGCCTCCCG 

GGTTCAAGTGATTCTTCTGCCTCAGCCTCCTGAGCATGTGGAGCTCAGGCTGAAGGTGAT 

GTGGCCGCCC 

Sequence 141 

GTCCGATTGATTCTTCTATGATGCGTGTTCATTATACAATACACATTCTCGGAAAGGCAG 

AATTATTTGCTTTTATGATTGTTTTTTGTG ACCTAAG AAACGGTTCTTCCCCCATTTTCC 

TATCCCAGGCCAAAMATATTCTCTCACGTTTTTCTGTMGAGCTTTATAAGTTCAGCTT 

TTATATTGAGGTCTGTGATTCATCTTGGATTATTATGTGTAGGTTGTAAAATAGGAATCT 

AGGTCCAGTTTTTCCATGTGGATATTCAGTTATTGTGGGGCCAATTGTTGAAAAGTCTCC 

CCAAAGMCTGCTTTCTTATCAGAAAGATAAGATATATTAAAAGTGTATATCTAAGTCTG 

GGTTCTCTTTTCTGTTCTMTGGTTGATATTTTTATCCTTATGCCAGAACCACACTGTCA 

TGATTGCTGTAGCTTTATMTAGTCTTGMTCAAGTTGCTTTTCAGTTTTGTATTTTT 

AAATTGCTTTCGGTATTCTAAGTCCTTTGCATTTCTGCTMMTTTAGAAGCAGCATGTC 

TACCAAAAGGAAAAAAAAAGCC 

Sequence 142 

TCCGGCGGAAGAAGGTGCGTCCGCGGCTGATCGCGGAGCTGGCCCGCCGCGTGCGCGCCC 
TGCGGGAGCAACTGAACAGGCCGCGCGACTCCCAGCTCTACGCGGTGGACTACGAGACCT 
TGACGCGGCCGTTCTCTGGACGCCGGCTGCCGGTCCGGGCCTGGGCCGACGTGCGCCGCG 
AGAGCCGCCTCTTGCAGCTGCTCGGCCGCCTCCCGCTCTTCGGCCTGGGCCGCCTGGTCA 
« CGCGCAAGTCCTGGCTGTGGCAGCACGACGAGCCGTGCTACTGGCGCCTCACGCGGGTGC 
GGCCCGACTACACGGCGCAGGTGCGTGCACCCCGTCCGCACCCCGCCCCTGCAGCCGCCT 
GGTCTCCCCGCCTCCCCTCCTCCCTGCAGGTTTGCGTGGCTGAGGCTCCCACCTCCTGAC 
CTCGGGGGCCGAGAGCTTTGCGAGCTGACCCCGCTTCTTCTGGCTTTGCAGAACTTGGAC 
CACGGGAAGGCCTGGGGCATCCTGACCTTCA 
Sequence 143 

CCCGCGTCCGAGATCCTGTAGGTGAAGTTCTCCTGTGCTCCACAGCCACCCAGAGGAATT 

CCAAAACCAGCAGTGGAGGACTTGGGGAGGACAGGAGGGAAAACATGGCGAGTTCATCAG 

CTCTGTTTCCTTTATTAAAATATTTCTGTAATTGGTGGTGGGAAATTGAAGAAATCAAGT 

GATTGCATCAGCGCTGGAAAAAGCTGCCAGCACTTGGCAGTGGAAGAGAATATATGCTTT 

ATACTGGACTTTTTGAAAAAGAGGCTGAGTTTGGCCAGATTGCCGACCAGCAATGGAAAA 

ACTAATTAGGTGCCTTGCCTGTGAGCCAGACGCCCAGCAGGGCTGTGGCGCATGGCTCCC 

GCCGCCTCTGAAGAGGACACTTTCTAGTGAATTCAGTTCGTGCTACCCTTGAGCAGCCTG 

TGCTACAGCAGGCACATTTGTGAATCTCCCAGCCTGTGCCTGGCGTCNGAACTGTAGCTT . 

CCCAAAGAC 

Sequence 144 

CGCGTCCGAGTAMTCTGTTCTGCACTAMTW 

CGTATACCATTCGTTATATACATTTCAGTGTTTTTMTMTGTCMTTTCTT^ 

AGGAGAACTCTAGTTATTTTTATAAATCTMTTGACTTAATTTTGTGGGAATATAAAAGA 

AGTGATTAAAMCCTTTGGTTTAAAGTAGTTAATCCTGAAATCAAGCTCTGTAAATATTG 

TGTAGGGATATGGAGAAATCCTCAAAAAGAAAGAGCTAAAGAAAATGGCAGGGATGGCAT 

CTTGGGAGTATAACTGAAAGTAGGAAGATGTGGATAGMGAGTCTTATTTTTAATCACAG 

GGCATATGTGCTATTTGAATTATTTTGACAAAAGTATAAAAATATGGAATTATGCATTGT 

GTGTGTGTGTGGTTTCATCTGTTTAAGAAATATGTGATGGAGACTGTCCTATCATCAGGA 

AATTATTCCAGT 

Sequence 145 

GTCGACCNCGCGTCCGCGGAGACACCGACCGCGGCGGCAGCAGCAGCAGCAGCAGCGAGA 
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GGCAGAGGCGGCGGCGGCGGGGAGGACAGCACGGCCGAGGCTGCCCAGAGGCGCCTCCTC 

CACACCCCCCGCCGCAGCAGCACCGGCGACAGATTTTTTAAAAAATGGATTTGGCCAACC 

ATGGACTTATTCTACTGCAACAGTTAAACGCTCAGCGAGAGTTTGGTTTCCTGTGTGACT 

GCACGGTTGCAATCGGCGATGTATAGTTCAAGGCACACAAATCAGTTCTTGCTTCATTCT 

CCAATTACTTTAA 

Sequence 146 

CCACGCGTCCGATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAACCATGTCCAGCCCTACA 

TCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGGCTTCACTGAGGTGCGAG 

ATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCATCAACGAGGGCGGCATTG 

ACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTGGCGTACCACCCCCTGAAGATGC 

AGAGCTGCTATGAGAAGATGGAGTCGCTGCGACTGGACGGGCTGCAGCAGCGATTTGATG . 

TGTCCAGCACGTCCGTGTTCAAGCAGCGAGCCCAGATCCACATGCGGGAGCAAATGGACA 

ATGCCGTGTATACGTTCGAGACCCTCCTGCACCAGGAGCTGGGGAAGGGGCCCACCAAGG 

AGGAGCTGTGCAAGTCCATCCAGCGGGTCCTGGAGCGGGTGCTGAAGAAA 

Sequence 147 

NACCACGCGTCCGCCNCGCGTCCGCTTGCACCCGGTGAAGAGCGTGCGTGTGCTGAGGCC 

GGAGCCGCAGACGGCTGTGGGGCCCTCGCACCCCGCCTGGGTGCCCGCGCCTGCCCCGGC 

CCCCGCCCNCGNCCNCGCCCCCGNCCNNGCTGCGGAGGGCTTGGACGCCAAGGAGGANCA 

TGCCCTGGCGCTGGNCGGCACAGGCGCCTTCCCGNTGGACGTGGAGTAC 

Sequence 148 

TCCCAAGAGCTGCANGNNNCAGCCGCGACAGCAAGAACCGGNAGAGCCGGCAGACCGCGG 

CGGCGGCGGCGNCGGAGGCAGGAGCAGCCTGGGCGGGACGCAGGGNCTCCGCGGGCGCAG 

GAAGGCGAGCAAGAGATATNCTCTGAGAGCCAAGCAAAAGAACATTAANGGAAAGGGAAG 

GAGGAAANGAAGGCTGGATACCGGNGCAGTGAAAAAAGGCACTTCCAAGAGNTGGGGGCA 

CTCAACTACGCCACNAGACTCTGACCGGGTGGCCCAATCAAGCCAATGAAGAAACCTATA 

ACCCGGNTTAACCTNTNATTGGCCTGCCTTCTNNNTNGGGGGGTGGGGGCCCAAGCCCNC 

TTAACCCCCCAACCTTCNNTTTCAAACCTATCCCAACCTTATTCNAAAANGAAGGGGANC 

CTNAAGGATGGGGNTTTCCCCCAAGGCNAAAAAGGAAAAAAAGGGGGCCCCCNNGGGAAG 

CCTTCNTTCTTGGGGAAAAACAAGGCAAAAAAAATTGGAAGCCTGAAAACCCGGCTTCAA 

AAAAAAGGGG 

Sequence 149 

GGCCGAAAGGGGGGCAGGTGGTCGGGCCGCGCAAGCGGAGATGGAATGGGGCCCGGGCTC 

AGACTGGTCACGGGGGGAGGCTGCCGGCGTGGACCGCGGGAAGGCGGGGCTGGGGCTCGG 

CGGGAGGCCACCCCCACAGCCGCCCCGGGAGGAGCGCGCCCAGCAGCTGCTGGACGCGGT 

GGAGCAGCGGCAGCGGCAGCTCCTGGACACCATCGCAGCCTGCGAGGAGATGTTACGGCA 

GCTGGGCCGCCGGCGCCCGGAGCCGGCTGGTGGCGGGAACGTCTCANCCAAACCTGGAGC 

G 

Sequence 150 

CACGCGTCCGGCCTGCTGTTNACCTGCGGGACCCCAGGAACCTGGACTTGTTTCTCAAAG 

TGGTTCATGGAGATGTCACCCCCTACGACCTGGTGCGGATGAGCTCGATGCAGCTGGCCC 

CCCAGGAGCTGGCCCGCTGGCGGGACCAGGAGGAGAAAAGGGGCCTGAATATCATTGAGC 

AGCAACAGAAGGAGCCGTGCAGACTTCCAGCCTNCAAAATGACCCACAAGGGCGAAGTGG 

AGATTCAGCGGGACATGGACCAGACACTGACCCTGGAGGATCTGGTGGGACCQCAGATGT 

TCATGGACTGCAGCCCACAGGCCCTGCCCATCGCATCAGAGGACACCACGGGGCAAGCAT 

GACCACCACTTCTTAGACCCCAACTGCCACATCTGCAAGGACTGG 

Sequence 151 

TTTTTCCTTAGAATCTTCGAGMAAAGATGAAGGTATTATTCTCAGTTTCGAGATCAGGA 

CTCCTCACCACCAGGCGGGGGCTTTAAGGTAGACACTACAGGGAATCTGATCTCAGGGTG 

ATCCTCTCCCTTCCACTTGCAAAAAGAGAGGAGCAGGTGGGCCACTGCTCTCTGAGATGT 

TAACACCCCTCACACTCCACGGGCATGCTTTGTCATTCTGCACACCGGTGTAGCTGCAGC 

TCTGTGTGAATTCAGATCTCAAGAGAAATGTAAATCMAAGTATGAGTTTCTTTCTTTCT 
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TGGGTGCCACAGTAGGAATGAAATGATGGGGACTTTTGGAAGCCCCTGGACTTGTGGCCC 
CTGTAGAAGAGCAGCTTGGGCAGGGTGTGATGGCCATCTCTGTCTCTAGGGGCCCTGTGG 
A 

Sequence 152 

TGGCGAGAGCGCTGGTGCCGAGTGAAAGATAACAAGCTCATTTTCCACAAGGACAGGACC 

GACCTGAAGACCCATATTGTGTCTATTCCGCTCCGTGGCTGCGAGGTGATCCCGGGTTTG 

GATTGTAAACATCCTCTGACGTTCCGGCTGCTGCGCAACGGCCAGGAGGTTGCAGTATTG 

GAGGCATCTTCTTCTGAAGACATGGGCAGGTGGATTGGGGATTTTACTCGCAGAGACGGG 

GATCGTCCACAGACCCGGAGGCTCTGCACTATGACTACATTGATGTGGAAGATGTCTGCA 

ANGTGTCATTCAGACAGGCCAMCAGACCTTTCTGTTTTTATGAACAGGGCGTGTTATAT 

GTGCTAACCCATATCTAGGGGGCACCTTCAACGGGTTATTGCCCACCCAGCGGGACGGCA 

CTTTAATATGACGATGTTTCCGTNCCATAAACCGGNTTGNTTAAAGGGTAAAA 

Sequence 153 

GTTCGACCCACGGGCGTCCGCGGGACCGCCGTGGGNNNNACATACTATGCGNACAGGCGC 

GTTGNACACAAANGGCCCATTCCTGTAGCCTCACACTTGACTACACATGGGGGANTCACT 

CGGA TTCGGNTCTCCACGTGGNNGNTCTTTGTTCTGTACTCTACGTAGCTTTGGCTTTTG 

TTTTCTCGTCGCAACAGGGCATGAGACTTCGTCGACCTTNGGGGTCTGTATAGTCTTTGA 

CTTACTACGTGTAGGTCTCAATACAAAGTGGGANATANTCATATCCGTCCGCGAAAAGTA 

ATTCTTGGAAAAATTTACCCTTGTCTCCCGCNTTATGAAACGTGAACTAAGTAACTCACT 

TTGCCCCTGGGGCGCCTCTNTTTAACANTGTTCTTTTGNCGAAATCATCATAACCTTCAA 

CTGAAAACAATGTGGTCAACAAACTGACTATGGAGGTCTTAGGCTCNGTCTCTAAGATCT 

TTAACCTTGTTTATCGGCGCGTGCGGCGTNGTCCGAACGAAGAGACTATAACCCGCACTA 

TAACNAAAACTCTTTTTTAATCCCACCACCTCGTGAGGGANGGGCCCTAAGACTGAAACT 

GTAGTAAGTCCTATTGATTTGCGTAGGAGGANTTAGGAAA 

Sequence 154 

NCGCGTCCGATAGTCTACCAGCCTTACCTGGTTGATTACAC7TGTAAAAGAAAGATTAAA 

AGCAGGCCAGTGACTCTGGTCTGCTTGAACATGTGAATGTAGTGGTTTGAGCAATCTGGA 

GTTTGCCCTAGTGTCAAATTCCAGACTGTCCATAGTGTCCAAAACCTGAGGCAGACACTA 

ATGTTAACCCCCAGCACCCCGTGATTGGAAACAAACCTAAATACGTATTGGGAACTTAAT 

AGCAATTrTAAGCATTCTGATAGATTTTTTGTAGGGATGGGGTCATGCCATGTGGCCCAG 

GCTGGTCTGAAAACTCTGGCCTCAAGTGATCTCAAGCTTTGGCCTTCTAAAGTGTTGGGA 

TTACAAGGTGTGAGGCATTGCACCTGGCTTAGCCGTCTTGATTTGACATTGTAATGAAAA 

AGTGTGAGTCTTATTCTACCAGGGGCCCTTTTTTGTCCTCTTGAAAATNGAATAACCANG 

GGAAGGGGGAA 

Sequence 155 

CCNCGCGTCC GTCCA TCACAGCCTCCGAAGGTGCTGGGATTACACGGCATAAGCCACTGT 

GCCCAGCCTGTTTTTAATAATGATATTAAGTGGGTTTGGTTCATGTGTTATTAATCAGTG 

TTAATAATCGTAC I I I I I I I I I I I 1 1 I I AAAAGAAACCATGGGTATTCTAAAATCAGGAG 

TCCAAATAAAAGAAAGTTCTCGGCTGTGCGTGGTGGCTAACACCTTGTAGTCCCGGCACT 

TTGGGG 

Sequence 156 

CGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGT 

AGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGG 

AGGGAGTCTGAGGGTTGGGGACCGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGG 

CGGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGT 

TGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCGAGGCCCGCGCTTGAGGGATCT 

GAAGAGGTTCCTAGAAGAGGGTGTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTT 

GGGGGTCGCCCTTCTGAGGAGGCTGCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCC 

AGAATCCCCTGTAGTTGATAATGTTGGGAATAAAGCTCTGCACTTTCTTTTGGCATTTCA 

AGTTGTTAAAAACAAATAGGA 

Sequence 157 
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CGCGTCCGGGTGTCNAGGCCATGGGGCAGCCCTGGGCGGCTGGGAAGCACGGACGGGGCG 

CCCGCGCAGCTGCCTCTCGTGCTCACCGCGCTGTGGGCCGCGGCCGTGGGCCTGGAGCTG 

GCTTACGTGCTGGTGCTCGGTCCCGGGCCGCCGCCGCTGGGACCCCTGGCCCGGGCCTTG 

CAAGCTGGCGCTGGCCGCCTTCCAGCTGCTCAACCTGCTGGGCMCGTGGGGCTCTTCCT 

GCGCTCGGATCCCAGCATCCGTGGCGTGATGCTGGCCGGCCGCGGTCTGGGCCAGGGCTG 

GGCTTACTGCTACCAATGCCAAAGCCAGGTGCCGCCACGCAGCGGACACTGCTCTGCCTG 

CCGCGTCTGCATCCTGCGTCGGGACCACCACTGCCGCCTGCTGGGCCGNTGCGTGGGCTT 

NGGCAACTACCGGCCCTTC 

Sequence 158 

CGAGCACGCGTCCGGGGACTCAGGCATGCACCACCACGCCCAGCTAATTT7TGTATTTTT 
AGTAGAGACAGGGTTTCTCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACATTCGGTGA 
TCCACCCGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCAGCC 
CCTTCCTGTTGAGTAAAAGGAAGAACTTCAGGGTAAGACACTGTACAGTGCCCAGCATCT 
GGAGAGCCGCCAGCATTACCCCTGCCTTAGGAGGTAGTCGTCTCCTCATCACTACAAGGT 
ATTGAAGCCTGAGGGCCCCTGGGCAGGACGATAGAGTGAGATTGCCCTGGGGACTCAGGA 
AAGGAAACATGCCGTATTTNTAGGGAAGGAGCTGCTGCTGCCTCTCAGTGACTCTGGTTC 
: CAGGAGGGAAGAGCCGAGAGCTAGGGTTCCCTTTCATAGGGAGAAACCCAGCAGGGTTTG 
GGGTGTTCT 
Sequence 159 

ACCACGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGG 

AGGTAGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAA 

CGGGAGGGAGTCTGAGGGTTGGGGACGTCTGTGAGGGAGGGGAACAGCCCGCTCGAGCCT 

GGGGCGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGG 

CGTTGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCCGAGGCCCGCGCTTGAGGG 

ATCTGAAAGAGGTTTCCTAGAAAAAGGGGTGTTTCCTCTTTTCGGGGGGTCCCTCACCAA 

GAAGAAGGTTCTTTGGGGGGTCGCCCTTNTTGAGGGAGGCTTGCGGNTTAACAGGGCCAA 

AAAANTTGCCAT GGGAT GTCCAAGAATCCCCTGTAATTTGATTAAATGGTGGGGAAATAA 

AGCTTTGCAAC I I I I I I I I GCNATTTAATTTGGTTAAAAACA 

Sequence 160 

TCCGCTCCCCTGTTTTCTTCCTTTTTCTTTTTGCTTGTATGCACAACGGTAGGACTTACT 

TCGTAAGAAACAAAATGCCAGTATTTTCTTAAGCCATGATGTGAAACCAATGACCCTGTG 

ACCACATGGCACAGAACACTAAATTTTGGTCCCATGGCTGAAACTTGAGGGTGACTAAAA 

GTAATGCCTGTGAAACATGATATCTATCTGGGATGGCCATTTGATCTCTAAAAGGAATTT 

TGTNCACTCCACAGAACTCCTATCTATAGTAAAATTTGATTTTTC 

Sequence 161 

CGTCCGGAAAAATATTAAACAACTCATTTTAAGATTCAAATTAACTAATTCCTGCATATA 

TGACATTCCTTACATMGCGAACACTAMCAAAAATGGCTAGAMTGTCTTTTTCTTTCT 

TTTCTCTCTTTGTTGTTTAAGGTATTAAGCACCGAATTATTACATGAGACTGGCAGATAG 

CTATTAATCCTCTTACAGATTTGAGAAAGTTGATTCTCAAATATTTATGCACCTTCTCCT 

TCATTGTTTTCTTTAMTCTGTCCTCTTAAAAAGCTTCTTAAGAGCTCAGTTAATGCTTT 

TGACTTAAC TAGGA GAAAAAGGCATGATAATACAGGCAAGATGGCATTGTTAGCAATTCT 

GGTAGGTGGTTTTGGAATGAATCCTAAGAGGCAAGGGGATCTTAAGGACAAGGAAGAGAA 

GAGAGAGGGGGNGGGATCCCTTTGATCTCTTTCTCTGGNAATCTTAAATGCNTAATTTTA 

CTAAAACATGTTCTCAATTCATTCATAT 

Sequence 162 

CCCCGCGTCC GGAT TAATGAGTGTATGCCTAGCTCTTTCTCCAGTTTACTTTTAGACCAT 

ATTGTTGTTTGTTTTGAATATCATTCCTTAGGCTATGTTGAGAGTAGAGTGGCTTCCCAT 

TAGGAGMCTMTTTAGGGCATGTCTTTTGCTGAATCCCGTCAGCATATTTAACAAATTC 

CCAATTCTAGATMTTTCCTTTTATTTCTCTAGTACCCTTTGCCAGGGGCTCTACACATC 

AAAGGTGTTCATGAAGTATTTGTCAAAGGAAAGAACAGTAATGACACCTAACACATAATG 

AGTGA7TAGTATGTTCCAGGCATTGCGTGAGCTATTTACTGTGAGTGATTTAATGTTATC 
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TTCCCAGCAGACCTCTGAGGTAGGGTACTAGTATGATCCCCATTTCGTACATGAGGAAAC 
TGACACTAAGGGACATAAAATAAGTTTTTGAAGTCACAAAGTGAATAAAAGGAAGACCAG 
GGTTTTAATTGGAAGCCCATA 
Sequence 163 

TAGACTTTTGCAGTGTTAMCACAGCTTCCTTAACTCTTAGAACTGGAAGTTGTAGAGCT 

CTCCTTTTGGTGCCTTTCCAGCCTTTATACACACTATTGTAGCTTTCTTAGGTTTGAT^ 

GTAG CGTTTCAAGTAGTTTAGCTGAGACAGNGAATGTATTAGGTTCAACATGACCTTGTG 

TTTTATTTGTGTTTGCCMCAGGATGCCTTATTTGTTTGAGAAAAAG 

ATTCTAMCTATCTCCTTT^ 

TTTACCACCATTTAGTGCCTTAAATCCTATCAAGAAAGCAGTGTTACTGCTCAATGCCCA 
AATAAGACACGCGGATATTGCTATTGCTTGCTTTTGAGTTMCAGGCCNACTTTTTATAC 
TTAAAACCTCA 
Sequence 164 

GCCCCGAAGTCCCACTGTCCCTGCCGAGGCGCCGGCGCCGTCCCCTGTGCCCTTGACCAC 

GCCAGCCTCCGCCGAGGAGGCGATACCCCTCCCCGCGTCCTCCGACAGCGAGCGGTCGGC 

GTCCAGCGTGGAGGGGCCCGGAGGGGCTCTGTACGCGCGCGTGGCCCGACGCGAGGCCCG 

GCCGGCCCGGGCCCGGGGCGAGATTGGGGGCCTGTCGCTGTCGCCATCGCCCGAGCGCAG 

GAAACCGCCGCCACCTGACCCCGCCACCAAGCCTAAGGTGTCCTGGATCCACGGCAAGCA 

CAGCGCCGCTGCAGCTGGCCGTGCGCCCTCACCACCGCCGCCAGGCTCCGAGGCCGCGCC 

CAGCCCCAGCAAGAGGAAACGGACGCCCAGCGAACAAATCGGCGCATACGGTCGAACACN 

GGAAGCCCCCGGACCCGGACCCAACGCCGGGGCCCCCCG 

Sequence 165 

AGTCCGCCCCACGCGTCCGGTGAGTTTAGCGCTGCTGTCCGGATGGGTTGGTAGCAGACA 

GGGTGGAGTAGGGTTAAGCACACTGGTCACCTTAGGATTGGTTTCCTGGTGCTGGAGAAT 

GGTTAGGACACAGGCCTTGGAAGGTTTTTTGAGTGTGAMTATTACTCAGCGTTTTCTGC 

AGACCTCGCGGGCAATGCCGCTTCTAATTTTATCCAGGCCTTCTTCTGTAGGGAGGGCCT 

GTTAAG AGTTGAGCAGCCCGATTTCTGAACCCCTCTAAAAAGCTGTGGCTGATTGGTGGC 

I I I I I I I I I ICTTGGAGAGGGGGGTGTCAAAGATTTCTTTAAAATCGTTAGTGATGTGGT 

CTCGCTTTA 

Sequence 166 

ACGCGTCCNAAAATGTGTGGTACATGGAATA I I I I I I ATTATGCTTATTTCGTATTGCCA 

GGTAGATGGCCAGCCTGACATTCAAAATTATTTTATCAGCCCCTAMTGTTAATATTTCC 

CAAACTATTTAAATCAGTAGMGACATTTTTACTATTAAGAATAAAAAGTTATAATATAA 

AATGGATTAMTGCCAGATTATATGCTAMCAAGTCCTTTMATTTTTMCTTAATATTT 

TTAACAGA IT'I Mill! I GAGATGCAGTTTTACTCTTGTTGCCCAGGCTGGAGTGCAATG 

GCACAATCTTGGCTCACTGCAACCACCACCTCCCGGGTTCATGCGATTCTCCTGC 

Sequence 167 

CCGTCCGCGAGGTTAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCATCTCTA 

CTAAAAATACAAAAAAAATTAGCCAGACATGGTGGCAGCCTCCTGTAGTCCCAGCTACTC 

GGGAGGCTGAGGCAGGAGAAGGGCATGAACCTGGCAGGCGGAGCTTGCAGTGAGCCCAAG 

ATGGCGTCACTGCACTCCAGCCTGAGGGACAGAGCAAGACTCTGTCTCAAAAAAAAAAGA 

AAAAAAAGTGGCACAGATCAATTATAAATCACTGCTTCAAGGCCAGTGCTCTCACTTTGT 

ACATTAATAATCTCAGGCCCAAATAAGATAAGTGATATGTCAACGTATGTTCACTTTGGT 

CTTTACATGGCAGCTATAGTATACCGGAATATTATAAGCTCAGATCGTCATAGCTACATA 

ACTCCTTTAGTTGGGAAGANACGCCGTAAATGCCCATCAAGANTAGCAAGTCTTGCATTT 

GACT 

Sequence 168 

CGCGTCCNGGTAACTTGMTMGGATTATGTGCCCCACCCTTACTCTCATTCCTGCTTCC 
TCTTGGGCTCAAACAGGGTATGAGTATGMGATTTTGCCTTTAGTTCCTGAACTGAACCT 
GCT TGCTA TCCCTTTCCTCCCCACCACTACCTTATTCCTTCTCTGCCTCCAAATTGCCAC 
TTTGTTTTGAGGCTTCCTTCCCTACCTTATTATTCTGMGGMGTAGAGATCTTGCTTCT 
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GAAACCCCTCCTAAGAAACTGCCCAGGGACAAGATAAATTACAAACAATTCATGGGAGTT 
TACTACCTMGTTGCTTCTAGGGCATATGTATACCATACTAGTAGTCTAGATTTCTGG 
Sequence 169 

AGGGAGTCGCCCCCGCGTCCGGACAGGATCTATGGATGGMGAAGATGAAAACAAACACA 

CAGCCATAAAGGTAATTGTTTCTGGCCAACATCTTTCTACTGATGCTTTGTTTTGATTGT 

ATGTTGCTGTTTATATTTTCTCAAACTTGAGGCTCTATTTTATGMATGTTGMTATAM 

TACATTGTATTTAACTTGAAAAATTCCTGGAAATATACCTGATAATTACCACCTGAGGAA 

TCGNTTTATTTTATGAAAGTAACAGCGTGATGAATACTGTAATTACAAAAGAAAATTAGT 

ACTCACTGACTTATACCCTTG ) ) ) I II U I 1 1 1 1 1 I 1 GGTTTGG 1 11 1 111 1 GGTTTTGN 

TTTAGGNGACA 

Sequence 170 

GGAGTCGNCCCACGCGTCCGGCTGGCCAGCAGACCCTGCAGCAGTCATTGGTGGTTCTGT 

CTTTCCCTAACCCTTATGNGCTGGAGGACAGAGGAGCCACCTCTTCTCACTGGTTGATAC 

TCTGTTCCTAAGACCTCAGGTTACCCAGTGTAGCCTGCTGTGACGCTTCTGCTTCATTTT 

CTGCCAAGTGATACAGAACCTTCGTTCTGCTGCTTTGGGGACAGGTGATCCTAGCCCCAG 

CTTAGGGCCAGTGCTCTGTGCCACTCTGGAGAAATAGGGAAAAGATAGGGGTGGCTCAGT 

ACAGCAGCCCTGTGAAAGTCAAGGCCAGAGCTTTTC I I I I I I A A l I I I I I I ATT ATT ATT 

ATTATTATTT 

Sequence 171 

TTTAGGGAGCCGACCCACGCGTCCGCTTGGCAAACCTCCGGGGACTGTCAGAGGAGGAGA 

GGAGCGAGAAGGCTATGCTTCGCTCCCGCAtTGAAGAGCAGTCCCAGCTCATCTGCATCC 

TGAAGCGGAGGTCAGATGAGGCCCTGGAGCGCTGCCAGATCCTAGAGCTGCTCAATGCAG 

AGCTGGAGGAGAAGATGATGCAGGAGGCTGAGAAGCTCAAGGCCCAGGGTGAGTACAGTC 

GGAAACTAGAGGAACGCTTTATGACCCTAGCAGCCAACCACGAGTTGATGCTCCGCTTCA 

AGGATGAATACAAGAGTGAGAACATCAAGCTGAGGGAGGAGAATGAGAAGCTGAGGCTGG 

AGAATAGCAGCCTCT 

Sequence 172 

CCACGCGTCCGCTTAGCCGCTGCCAGAGTTCCATATGTTCTGGAACCCTTGACTCCTAGA 

GTTCAGAACCCAGCCAACTTGCAGTTTTCAGAATGTTCAAGAAACTTCTGACACTCAGAG 

TTGCAGAACCTCCTGGTCCCTGCAGATTCCTGGAAATCAGAATATGGTGGTTGNAAAGAA 

TCTTGTGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCGAGGCG 

GGCAGATCGCCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAATCCCGTC 

TCTACTGAAGATAACAAAAATTAGCCGGTCATGGTGGCGCCCGTGCCTGTAATCCCAGCT 

CGGCAGGGCCGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAAGAGGTTTGCAAGTGAGC 

CNAAAGATCGAGCCACTGCACTCCACCTTGGGTGACCAGAGTCTTAAAAAAAAAAAAAAA 

AAGG 

Sequence 173 

CGTNCGGTGAATATTCAGTMGCTMTTAMATTTGTMGTMGATTTCCTTATTAAAAA 
AACTATTAAAATAAGANGAAGCCCGTTNAAATAATNATTAGAGNNGGAAAAAAGAAGATG 
ACTATCNMTTACAGCACTTTTTTTCAGCTATACATAMGGCCTTTTCANT 
Sequence 174 

CGTCCGGTGACCCATTAAGTATATTTCGTGACCCANAGTTTGAAAGAGATTATAGGTGTA 

GCTTTGCTAGTTTGTAATTGATATAGAACAGTGACTATCAGGGAAGNTGAAGAACGGCNA 

ATTGAATGTAAATCATGTCTGGATGGTGAAGATTCTAAGAATGCANCTAGGGAAAGGGCT 

GCAAAAAGAAGGTGGCAGACTAATGTAGAATGGTGCAACCAGATGAAGACATGGGTGGCT 

TTAGGAATTCCAAAGTGGCCGTGAAGGCCAGGCACGGTGGCCCACGCCTGTAATNCTAGC 

ACTTTGGGAGGCCAAGGTGGGTGGATTGCTGAGCTCGGGAGTTCGAGACCAGCCTGACCA 

ACCAGGTGAAACACCATCCCTACAAAACAT 

Sequence 175 

GGCGGCGGCGGCGGCGGCGGCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGTNGCGGG 
TGCGGGAGATGCCGTGCGGGACTGGGGCCACNTTGAGCCGCCCGNCTCGTCCCCGCCTTC 
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TGTGGGAAGGATGTGCGCGCGGATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTA 
CGGCCCCTGGCTCTGCCTCCTGGTGGCCCTCGCCCTGGACGTCGTGAGAGTGGACTGTGG 
CCAGGCTCCCCTGGACCCTGTC 
Sequence 176 

GAGCTGGCTGGTGTTTGAGCTGTGGCAGAAGCACCTGGGGCTCCAGGGAAGCANGCTGGG 

AACTGCAGGACCTTGCTCAGCCAGGAGCACTTCCCCCTCCTTGAGGCAGGAATACTGAGG 

TGCCTCCCCACAGATGGAGAANGTGGAGAGGAGGATGGGCCTCAGGAGCATCTCAAGCCC 

CAGTAGCAGGANMAGAAAGMAGAGATGCCTGGTTTTCACAGACTGGTTCCTGTGGCTG 

GGATGACTGCATCC I I I I I I I I I I I I I I I I GAGACGGAG Mill GCCTTTGTCGCCCAGG 

CTGGAGTGCAATGGGGTGATCTCGGATCACCGGAACCTCCGCCTCCNGGATTCAAGCAAT 

TNTCCCGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCTCCACCGTNCGGCT 

AATTTTGTA 

Sequence 177 

CCTTGTNAGGGGACACAAAGAAAAATTGAATAAACTGTATGATTTAAAAGATTATCGGGA 

GAGTTACCTCCCGATATAAAAGGAAGGATTTACAGAATGTGACCTAAGGTCTGGCGTAAA 

TGTGCACCGGAACCGAGAAGGCCCGGATTGTCATGGACGATGAGATACACCGGAATATCA 

TGGACATATTCTTTAAAGCGCCCTTTATCTTCAAATGCGGCACGGAAACCGGAGGCTTTG 

AAGAACTCAAGGAAGCGCGGCACGATACCGCCCGCAATAAACACGCCGCCAAATGTCCCG 

AGATTGAGCGCCAGATTGCCGCCAAAACGGCCCATAATGACGCAAAACAGCGACAATGCG 

CGGCGGCAATCGGTGCAAGCTGTCAGCCAGCGCCGCGTTCGGTAATATCTT 

Sequence 178 

CACGCGTCCGACCGGAAATGCTGACCTGACCTTTGACCAAGTACGNGCGGTGGGGGGGGG 

GGACAAGTGGGGTGGTGGTATTAACTGGCTCCGGTGGGTCTTCAAGCCCCAGGAACCCTC 

CAAGGGGGAAACAAATGGAGGGCCCTAACGCAAGAAGCTTCAATCGGTCCCTTGGACTGG 

GCTTCTTTCGGTGGGTTGGTGGGTATGGCCCTGGGCTGGCCTTTNCCTNMTCTTTCCTT 

CCCTNCCTGGGGGCAATCTGGCTGGGTGGCCAAGTGGCTTGCCCCGGCAACAACTTTGGC 

TGGCTGGCTTACGGTCAANGGGTGGCCCCTGGCTTGCCCCAAGAACAAAGGTGGCNTTGC 

TTTGCCCCCGGAAGGGCCCCTNGTAATTGCCCGGCCCCGGGCCAAAAAAGCCAAGCCACC 

TTCMGGGTGGTTTCCCAAGCATTTTTAATTGCCCCCCCAAGCAACCCTTAATTGCCCAA 

CCCTTGTTCTTTCCCCGGCCCAAAAGAACCCCCNAACCCCCCCAACCCAAGCTTATTGGA 

ATTTTCCCAAATGGGGGCCCCCCTTGGCCCTTACAMACCGGGGGTTTACCCCTTTGGGG 

AGGGGAATTMCCCCCTGGGGAAGGMACCGGTTTTGGACCANGGGGAAGGTAAGGCTTT 

AAANCCTTGGGGTNGGGGCCCAAAAGGGCTTCCCCTAAATTGTTAACCCCCCTTGGCNTT 

TT 

Sequence 179 

CGTCCGGAMACTGTTCATCTACTCACTGTAGTGCCCTCCTTGAMTGTGTGTTTGTTCA 

TTCMCTAACAATATTTGGGGATCCCTGTAGTAAACACTGTATGAATTTTACACAGTCTG 

GCCATCAAGAAAGATCACGGAGTATATTCTAGATGGGGAGGCTACTAAGTGAATAGGAAT 

CACCACGCTGGGCTGTTTATTAGGTACAGTAATAAACATAAGTACTGGTTGCAAAANAAA 

ANAANAAAANAAAAAA 

Sequence 180 

CCNCCGCGTCCGAAMGACAAGACAGCATACTGTATTTTTCCTCTTAAAATTCAATGTTA 

CMTTMATGATTGTTNTCTGAGAATAAGTTAGCTTCAGCTTTCTAATCGATGTGTTCCC 

ACATCTACMATTGATATGAAMATTATTTTGAAATGCACACTGCAAAATGGTGAGAATA 

TGAAAGTTACCTGGGAATTAAATCAGAACTGTCTCCATATGACTATTTCCAAGTCACAAT 

CATAACTTTCTTMTAGCAATGGTTATATATGTGGCCAGATAGTATTCAGTTTCACAGTA 

ATGTCTCGGTCACATAAAGATAGCANAGCATAGACATAGTACAACAATTTATTATTTCTG 

CTGATTGCCAAATGTGCATAAAACTATAAAGATATATTTTCCAGCCCAGGTGACAGAGAC 

CCTGTCTCCCNTTTNAAAANCTTCATGNTAAAGGTGCGGCCGCTAGACTAG 

Sequence 181 

CGCGTCCGCTAATCAACTTTTAAAMTAATGTTTTACGGCCGGGCGCGGTGGCTCACGCC 
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TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGGAGATCGAGA 

CCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCCGGC 

GTGGT GGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGATTTTTT A I I I I I IGAACT 

TTTTTACAGAGAAGGGGGGTCTCCCTATGTTGCCCAGGTTGGTCTCAAACGTTTGGGCTC 

AAGTGATCC7TCTCACCAAGAGTGCTGGGGATTATAGGCATGAGCTGCTGTGACCGGCCC 

AAAATCAAATTTCAAAACTAAAAAAATTCTCAGATAATTAATGAACCTACGTGAATTAAT 

CTACAAATTCCAGTTTGAAAAACACTAAAGATAAACAACTGTCAATGTGGGAAACTTAAA 

AAGTANGTTGGTATTTAGGTTATTGGTTAAAATGGGGGACCGACTGGCATACACAGTCCT 

AAATATTTAAGTCTTAAG 

Sequence 182 

CCCTATCTTNCGGTACTGGTGGGCCAAATTCCTGGGACCAGGTCAAGGTGGGCTGCCTCA 

GTAAGAAGGAAGGACTGGAGAGTGCCCATTTCAGAGGAGCAGGCTGGTGGGGGCCAGCCA 

GAAAGTAG7TCCCTTTGGGGGGAAGATGTTGGACCTTTATTATTTGTGGTAACCAGCCGA 

GGCTGGTTGTCAGGACAGCAGGTGAGCCACTTTAGGGAAGAAAGTGCAGGGGTGGGTGGA 

TGCCCAGATTACCAAGGCCAGCCACCCTGATGGGGTAGGGTCTGGTTATCTGTGTTCAAG 

A AGCA AATCCCACCCCAGCCCCAGCACTAGCTCTCTATGTATGTATTTTCCCTGTACAAT 

GTTTTATAAAAGAGATCATTAATTTAAAAAAAAAAANAAAAANA 

Sequence 183 

TCGTCCATTTACCTCACTTATGGGGTAAAAGGTCACTTCAAGTAAGGTTAAAGGTTTTCC 

CTGGCAAAGGACCTAACCAGAGCCCCNAAGGGGGGGAAAAAAGGAGGTCAGCTTTGGGGA 

GGTACCATG GCCC NTTTTGGTCTGGCCCNTTGGGCNTCTTCAACAACAAAGGAATATTTT 

ACCAGGCCCTTTTGGAGGCCTTTGGATMTTTCTTAAGAAATTTGGTTACCAGGAAGAAT 

TTAGGCTTCNTNGGNAAAAAGGAAAAATTAGGACCTAGGAAAGGGAATTAAAGGGGGNAA 

GGGG AAATCAATTAAGCNTTAATGGAAAGGGGGTTTTACTTCTGGCAATCAAGAACCGGC 

I I I I ICNTAAGTTTCNTAATGGAACTTTAAACCGGTNCCTAATANGGGCTNGTAAANGGG 

GTTGTCI^GGCGGTGGNAAACCACCTTTCTTTTTCCTNGGGCCCTTCCCTTTTCTGGNCC 

CGCAATTTNCCCTTCTTTNAAACCCTTCAMGTTTGCCCTTGGAGGGTTTTTAATTAATT 

CCCCCTTGGTGGCCANTTCCCTTGGGGGGCCAATTGGGTTTCAATTTCCANCAATTANTG 

GNAAMACCMANTCCAAGGGGAAGGGACCCCCTTNGGGCTTAANTTTTTCCTTTTAAAT • 

CTTCCTGGGAATTTTNGGATNGGGGGAAAAAAAMTAAATTAAAATTCCTTTTTGGGGCG 

CCCTTGGCCAAGTNGGGGGGGAAAAATAATTTNGGNTTTTNGGGGGGGGGGNTTTGGGGA 

ATANCCTTACCAAGGAACCCCTTCCTTGGATNCCTTTTGGGGGGTCTTTTCAAAAAAT 

Sequence 184 

GCGTCCGGTTGTAGTTTCCCTTCCCATTCTCTTGGTGGCCCTGGAAGCTTCTAGGCACAA 

GTGTGCCACCCTGATTATTCCNACCACTCCATCCAACTTTTCTCTCTCTGTGGGTGTCTG 

CACCACAAGCTGCCTACCCTCCAGGTGCCTCAATGGTCCGGCCACCAGTTGTGCCTCGGC 

GCCCCCCGGCCCACATCAGCAAGTGTCAGGCAGGCCTCCACCCAGGTGCCACGCACGGTG 

CCTCATACCCAGAGAGTAGCCAACATTGGTACTCAGACCACAGGACCCAGTGGGGTAGGA 

TGCTGTACACCAGGCCGGCCGCTCCTGCCGTGCAAATGTTCCTCAAGCAGCACATAAGCA 

CCTATCGGGTCCAGGAGCCGGCTTGTGCACATNCCAGGACAGGAGCCCCTGACCGCGTCC 

ATGCTGGCTGCGGCGCCCCTGCATGAGCAAAAGCAGATGATTGGGGAGCGTCTCTACCCC 

CTTATCCATGATGTCCACA 

Sequence 185 

GTCGCCCCGCGTCCGGGCATTTGTATTTTCAACAATTGTTCTCAAATTTAGAAAAGAGAC 

ATCGCAAGATGGTGAAATAGGAAGCCCTGGGCCCTCCTTCCCTCCACAAACACACTGATT 

TGACAACAGTTCATGGACAGATTCCCTTTATAAGAAACCAAGAAACTGTTAAGAGGCTCT 

TATACCCCAGGTGAGTGCAAAATCATCCACATCAAAGATAGCTGGGAAGTTCAAGACACC 

TTCTTTCCGTAATTTCTAAACTGGCACAGTACTATATGATTGAGATGAATCTCCCCACAT 

CCAAGCTTCCTGCTGGGGAGGAGAGGGGAGGGTATACCATTATGTCCAATGTTCCAACTC 

CTCTAGGAGCTACCCAGGTAGGAGGTGGGTCCAGCTCTGTCAGGCTTGTCTTAAGAGCAC 

TGATTGAGGGTCTGGTATTCTCTAAGTGGCCCAGGACCATAAGAGCAGTGGATGGTGCTG 
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GGGNTNNGNNGTGGGTTACCCATAACCCCCTGGTTTTTGG 
Sequence 186 

TCCGAAGCAGTGAAAATAGTGTTTGACTCCATTGACAACCTAGAGGCGGCTCCTCATGAT 

ATCGGCTACGTCAAACAGGCCATGTTCCACTATTTCCAGGTGCCAGATCGGCTAGGGATA 

CTCACTCACCTGTATAGGGACTTTGATAAATGCACGTTTGCTGGGTTTTGCCGGAAAATT 

GCAGAATGTGCTCANCAGGGAGACCCCCTTTCCCGCTATATCTTCAGGAAGGCTGGGGAG 

ATGCTGGGCAGACACATCGTAGCAGTGTTGCCCGAGATTGACCCGGTCTTGTTCCAGGGC 

AAGATTG GACTC CCCATCCTGTGCGTGGGCTCTGTGTGGGAAGAGCTGGGAGCTGCTGAA 

GGMGGGTTTTCTTCTGGCGCTGACCCAGGGCAGAGAGATCCAGGCTCAAAACTTCTTCT 

CCAGCTTCACCCTGATGAAGCTGAGGCACTCCTCCGCTCTGGGTGGGGCCAGGCCTAAGG 

GGCCAG 

Sequence 187 

CGTCCGCGTCGCTCCCCTGCTCGGGGTCAGCTAGTGTCCGCTCTGCTCGGCCGCGGGCTC 

CCGGAGGACTGCAGGCAGGATGACCGCAAAAACACGGGTGATTGGTGAATGGAAGTTGCT 

ATGGGCCTCTAAGGGCCATCCCAAGCCCAACCCAACGTTAAACGGTCCACAATCCACCAA 

GGA AGTCM GCTTTTGACACAAACTGCTGAAAAGCTGGGAGGGTTTCTTCTGAAAGAAAG 

TTTCTTTTTTCMCCCTGGGGGACACTGGTGCCCTTTCCACMGCCAGGGMTTGGGTTT 

ATGAAGCAAGCTTGGCTCTAAGGGGGGTGACCTCAAGATATTTGCTGGGGGGTTGTGAGG 

TTTGGTGGTTCTTGGGAAGTGTGTCTCAAGCTTTGGGGGCCCTGGAACTGTGCTTGAAGT 

GCCCTCAGGCCTGTGCCCTTCTGGGGCCGGGGGTCTTGTGGGTGNATNCGCAAGCANGGA 

AGCCTGGGGGCCATTGGTCCATTNAAGAAGGCACCCCGGGGCCAAACCTTGCTTGGCTAT 

ANTATTCCAAGCCTGCTTCAACCCNTGGGCAGGCTTT 

Sequence 188 

TCGACCNCGCGTCCGGCTTCGACGCCTTCCCTAACATCGAGAAGGTGTCCAAGATCACGT 

CTCCCGTGCTCATCATCCACGGCACGGAGGACGAGGTGATCGACTTCTCGCACGGGCTGG 

CGCTCTACGAGCGCTGCCCCAAGGCGGTGGAGCCGCTGTGGGTGGAGGGCGCCGGGCACA 

ACGACATCGAGCTCTACAGCCAGTACCTGGAGCGCCTGCGTCGCTTCATCTCCCAGGAGC 

TGCCCAGCCAGCGCGCCTAGCGGCGGCCCCAACCGGCCGGACCTCAGCAATAAGGCGGCC 

CCCGGACCTCACCCCGCGCCGGCCCCCACCCAGGGGCTGCATGTGGACCCCCCGGGCGGC 

CCAGGGGACCCCGCCCCGACCCAGGGGCTGTGGACGATGTACAGGCAACAGAGCTACCGC 

A CTCCTTT CCTTTTGGAAGCAAGAAGAAAATACGTGAAAACGGGAAATTAAAGATTTAAA 

AATTTTTTNNNNNTNNANAAAAANNAAAGTGCGGC 

Sequence 189 

CGCGTCCGAAGCCTTTTGTCTCAGAGAATTTATTGTCCTGACAGCAGAGGCCGATGGTGG 

GATCGACTGGCCCTTAATTTACACCAGCACTTGAAGCCGCCTGGAACCCGACTATCAAGT 

GCATCACAGAGGGGCTGGCGGATCCCGGAAGTCAGAACGGGACACCGNCTTTCACTGTAT 

CAGCGAGCCGTGCGCCTGCGAGAGTCTCCGAGCTGTAAAAAGTTCAAGCACCTCTTCCAG 

CAGCTCCCAGAAATGGGCTGTGCAAGATGTGAAACACGTGACCATCACAGGCAGGCTGTG 

CCCACAGCGTGGGGATGTGCAAGTCTGTGTTTGTGATGGAGGCCCGGGGGAGGCCCGCTG 

AC CCCA CCACGGTCCTGTGCTCTGTGGAGGAGCTGGCACTGGCCCATTACAGACGCAAGC 

GGTTTTGACCAGGGGATTCATGGCGMAGGGTCCACCTTGAGCACCCTTGTATGGCCTTC 

TTNCTGTGGGGACATCATCTTCATNGGATGGGATTCCCGGATGTCTTCAGAAACGCCTGT 

CANGCATTNCCCCTGG 

Sequence 190 

CNACACATGCGGAATCATAGGCCTGCAAAGCTCCTGCTATTCACTATAACTCTGCCATGC 

CTTAGGCACTTCCTAACCTAGAATTCTGAGTGAAGGACAACAATAACTAATACTTTTGAT 

TCAGGTATTACAAAGAAGTTAAGAGTTCATAAGGCACCTAAGTAAAGTCACATTGGTTAA 

GAGTACATGTCTCCAGATACTCTTACATTTGCAAAGNMTTGCATTTCTGNATCTATGGT 

CTGTAMTAMATTGAAGAGTTGNGAGMTAAMGCATGTTGTCTTTGATAAATTGTTTT 

TACAAAACAGGCACAAGAGAGGCTTGAAGGGTCCTTGCTATCTTTTAACCTATTTTATAA 

TCTTTGCTGCATAAGAAACAAATATGCTTATTTACATTCTATACTTAAACATATTATCAA 
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ACTTTTATTCTGMGATMTACACATGAAGGCTTTMACTTTAATCTTCAATATATTTTT 

TAATCTCCCTGATGAATTAGGGAAATAAATATTGGG 

Sequence 191 

TCTCCCCCTTCACCCTCCI I 1 1 I I I I 1 1 I 1 1 I I I GTTTTTTCTTAAAAGCATTTAGCTGG 

GTGCAGTGACACATGCTTGTAATCCCAGCTATTGAGGAGGCTGAGGTAGGAGGATTGCTT 

GAGTCCAGGAATTTGAGGCCAGCCTGGGGAACATAGCAAGAGCCTATCTTAAAAAAAAAA 

AAAAAAGCA TTTCAA TTATTTTAAATTTTNTMTTACAAAACAATTCTTTCCTGTCTTAG 

TTTAGTTTACTTTTTACTCACAAAGTTCCTGGAAAGTAGTTATTACTCAATAATTGAATG 

CATGAGTGTCMCTGCAAAATCTATGCATTATGTAGNGATTTGAATCAATTAGTCTTTNT 

TGATACTCCAAAATTACCCTTTTTCGAGNGTCTTTATCANAAATTTGATAAATCGGAACT 

T 

Sequence 192 

TCCGCCCCGTGAGGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATG 

GTTGAGTCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGT 

GATGATGCTGGGATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCA 

ACTGCATGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATGGTTGAG 

TCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGTGATGAT 

GC TGGG ATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCAACCGCA 

TCATTTTGGGGAAGACACMTTTCTCAGMTTTTATTTAAGTTGTAAAAATAA 

Sequence 193 

ACACATTGCGGCACCGGGCTGGGCCTGGCCATCGTCAAGCATGTACTGCTGCGCCACCGC 

GCGCGCTTGGAAATCAGCATGTGTGCTGGGCCATGGCAGTACGTTCACCTGCCATTTTCC 

GCCAGCTCAGGTGACGCGCACACGGCTGGTCGGGAATGATGAATAACCTGAAGTGACGAC 

GACCCAATGTGGGAGCTGCTGATACTTTGCTGTCTCCGCCCAGCAGTGGGCACTAGGCAA 

GGGCCCCATCAGCCGCTACATTGGCCGACTTGCGCCTGCCTTTCAGGCCCTCTCGTCACC 

CCTTATTGAACACACGGAACCTGCAAAACCCCATCATGGACCCTTCCCCTGGTATTACCC 

TCGCTACACTCTTCGCCGATTTCGGCATGATTCTTTTTGCACTGATCCTGGTACTGCTCA 

ACGGTTTCTTCGTTGCGGCGGAATTTGCCATGGTCAAACTGCGCTCCACCCGGGTCGAGG 

CCATTGCCCACACCAACGGGCTGGCGCGGGCAGATCCTGCGCACCGTACACAGCGAGCTC 

GACGCCTACCTGTCGGGCTGGCAGCTGGGGTATTACCCTCGCCTTCCTGGGTTTGGGTTG 

Sequence 194 

TGAMTCTCCTAATACACTGNGTTTTTATTGTTATGTATTCTATGTTTTAAAGCTCCTCA 

AGGTAT TGTTATTCTTTTGTATAATCAGTGTTTGTTTGGAGCTGTCTGTAAATTTTCCTC 

TTTTTTGCTCTTTATTTTTTCTTTCAACTCAGACCTTCCMAATGGGATTATTTTTCTTT 

TTTCTTTAGAGACATTTGCTTGATMCAGMTTTTMTTTGGCGGCTATTTATTTTTTTC 

AGCACACTGCAGGTATTAMCTACTTGGCTTCATTGTTGGTTTTGAAAAATCAGCTGTTC 

TMTGTTGCTCTTTGAATATAATCTGCCTTTTTCTGTACTTGCTTGNCTTTMGTTTTTT 

GTT 

Sequence 195 

CGCTCCCTGGTTTCTTGTCTCATGAANAAGAAAAAATCCTAACTGTTCTTGATGATCTTT 

AAGGCTCANAATGATCTGGACAGAGGTATTTACCTTGAAGCTCATAAAGCATAGGCCTTT 

CTCACTTGCAGAGTATCTTTCTGAAGCTGGACTAAATTGGTTAAGGCCACTGTACTTTTC 

CACTTTGTCTTCTTTCCTGTCACACACCCTATCCTTTCAGGCTGTGTTGGGATGGACAAA 

AGCAGTTCTGGAGTCCTAAGGAGAAGATGAGTGGGATATGTTTCTGTGACCTGCAGTCAT 

7TTAAAGTTTAGCTGTTGCTAGCTGACTCCATGTAAGAATACCTTCCAGGAATTTGATGG 

CTGTGCACTCTGGCAGTGCAACTGGCATGGT 

Sequence 196 

CNCGCGTCCGGCCCTGATTGATGAAGCACAGTCAGTAAATCATCTCTTCATTCCCCAGTT 
CTTAAGCCAACATCAGCAACACTGAGAGAACATTAGATTAAAGGCAGGTATAGAAGAGAG 
ACTTAGGGTAACAAGTTAGTGGGTGCCTGAAGGCATGTGGGAAGAGATGTGGTAAAGGTG 
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TGCACACTATTCCTTACGTGACTTAAAATGTCAAGGGGCTTGGTGTGGTGGCTCATGACT 
GTAAAACTAGAGCACTTTGGAAGGCCAAAGCTGGAGGATAGCTTGAGGTCAGAAGTTCAA 
GGCCAACTTGGGCAATATGGCAAGACTGTGTCTCTACCAAAAAAAAAAAAAAAAAAA 
Sequence 197 

CCGTCCGGAGAGGCTGTTNTTGATNGATACTCCAGAATAGACACGGGCCCTTGAAAAGCT 

T GTACC TTTGGGGAAAAGTAGACCAGGAAAATCTCAAGCCAGAGAAGGACTATGAAGCTT 

CI I I I ICCAGTTACACATGGAAATAAGAGGTCTTGCATGAGAAACCTAAATCAACCCCTT 

TNTIvnTTAAGCAGGTTTTGGAGTCTAAATTTATACTMTTGCATTGTGGAGCAGGTCTAG 

CCAAGAAATTAACATGACAATCAGTCTCTAATCTGTCTGTCTATCTACTTACTTACCTAC 

GAGTCAGGCCAAAGAGATTCCCACAAGTAGGTTAAAAAAAGTAAAGTCTGATAACAAATA 

ATTAGTAAGAATGAATGAATAAATAAATAGCACAAAGGAACACTCTACCACAAGGAGGAG 

TACTC ATCA CATCAAATGAGAGAATACCCTAAGAAAGTGAGATAATGCCAGCAATCTGAA 

AG ATATTTTACCN AAACGTGTTTAAAATG NTTAGGTTATTAAAAAAAAAAAAA 

Sequence 198 

CTAAGCTGTTATTTTCCCTAAAMTGCTTCCCTTGCATTTATACTTATTAAGAATAGATA 

ATAGCTAACATCTATCCACTGCCTTTGATTCATGAGGCCCTCACGAATTGCCTCATTAGG 

TCTCCGACAGTATGGTACAACTACTCTTTATATTTTACAGATGAAGAAACTGAGGCTGGA 

CTGNTACCTTTTTGCTACACATCCTAATG 

Sequence 199 

MCCTGTTTTGTTAGATGTGAATCTAGGAMTACAATATATTTTTAATGTAAAAGNACTC 

TTGCTTTACTTGTAAACTGATTTTCGTTTTTTTCCCCTCAGGCCATCAAGCCCTGTCGTC 

CT ATGAC CAACAATGCTGGCAGACTTTTCCACTACCGGATCACAGTCTCCCCGCCTACGA 

ACTTTTTMCTGACAGGCCAACTGTTATAGAATACGATGATCACGAAGTATATCTTTGAA 

GGATTTTCTATGTTTGCACATGCCCCCCTGACCAATATTCCACTGTGTAAAGTAATTAGA 

TTCAACATAGACTACACCGATTCATT 

Sequence 200 

AATGCAAGMCATCTGGATNAAATGGCTTTCTAGATGAGMTGGTTGCATTTTTTAATGG 

CTATTCTGGTAGAAAGGACAACATGTGATATTCATCGACCCTCTTTCTCAGACCCTCTTT 

ATAACAGCTAGTATGGAAAAATCTGTCTTTCCTATAMTATTTTCCCTGGGGAATGGNTT 

CCCATTMTGTGTNGGATGCTTTGNTGTTTCTCTCTTGCACAGCGTGTATGTTGTCATCT 

GCACTTMGMGTGGMTGGAATTGGGAATCCTGGTTTCTTGCTCTGGCTGGAGGTCCCA 

AGTCCGCATTGTGATTTTGGGAAGTCACCGTTGTTCTTCTAGATCTAATTTAACTCATCT 

GTAAAANAANNNGGNTNGAATTCCGACMTTTCATGGGTTTCATTAGATGCCAATATTTT 

ATGACTCATGATCCCAGCNAGACTACACCTATTTATMGGCTGGTTTTGCTTGTTTTTAC 

TAAGAGCAACAATNACTACATATTTTCAGGTTACTCAATCATCAAAAAAATTATAAAATC 

CATAAACACTTTGGATTTGAAACATTGCAACTTTG 

Sequence 201 

CGTCCGAAMGCAGTCTTTCTTGCTCAAAGTATTAANGGTGAACAATTGAATAGAGTACT 

GTGGTCGGGAGACTGATTTGAGACTGCAGAGCTGATGCTGGGTAGAGGGTCTGGACTTGT 

ATTCATGTTCTGTCTCAGGGCAGCCCCTGGAGCAGGAGATGGCAGAGGCATTTACAGCTG 

CAGAAAACAGGGAGGAATGGAATCTGAGGTAGCCCTGGCCTCAAAATTCAGGCCTGGCTG 

TATCATTTACAGAGATTTTTCTGGAGGGAAAAAGTCTCATTTCTGAGGAAGGCAAGGNGG 

GCTAATCATTATTAA I I I I I I I I AAACTTTTTG 

Sequence 202 

GCGTCCGGTTGAAGAGGGCAGGGAATAGGGGTGGGTGAGCGTGAACAGAGTCAGGCTGAT 

TGCTGCAGGGTCCCTTGCATTAGTTCAGGTGAGAAGAGACCCGAGTAGGCCAGTGAGCCT 

GGAGGAGAGGCTCTCTGTGTGTTTAATTGGTTTCCAGCI I I II I I CTCTATTCATGTAGG 

TTATACACGT7TCTTCTGTGAATTTTTATTTAAATGA I (Ml I GTTGTTACTGGATCTAC 

AAACAGCCCAACTCCAAGGAATCTGGCATCTCTCAGTGGAGCATACAGGTGACTTCATAA 

TCTAACCGCATTAGTMCTGCCAMATCGGAAGTAATTTCTCTCTGTTTAAAAGGCAGTG 

AAACAAATTTTCAGAGCAGGTTTCTTCAACTGAACAAAATATTTTGGACCTTAAAGGTGG 
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TATGGCTCTCCTGATCAGGGAGGGACAGTGAAAGGTTTGAGCTCTGACACTGNCCAGCTC 
TCTGGATACAACCAAGTGA C I 1 1 1 I I I IG 
- Sequence 203 

CGC GTCCG AGTATATAGAAAAACCATTAATTTAGACTCTGTGAGATTAGGTTGCATGAAG 

AAGGTTTTCTGAATATTTGAAGAGTGGATAAATAAATGTCCCCCAAAGCAATAAAATCAT 

AATCCTTTAAAATATAGGAAAMTAACTAATGGGAACTAGGCTTAATACTCGGGATGAAA 

TAATCTGTACAACAAACTCCCATGACACATGTTTACCTATGTAGCAAACCTGCACATGTA 

CCCCTGAACTTAAAATAAAATTTAAAGTAATAATAATAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 204 

CGCGTCCGCTGGGAGGCTGTGGGGTCTGCCACCCAGCAGATCTGTGTCACGGGAGTGGCG 

CTGTCACTCGTTGAGGTGGTGGCCTGGTTCCTTTGGCCTTAGGGAAGGACAAACTTCAAC 

TCTGAGCCTTGATTGAGTGACCTTGGCCAAGTTACCTAGCTTTTCTGAGCCTCACTTTTT 

TGGCNATTAGATGAACCAGAGGTTTATTTCACTCAGAATCCTGTTCACGATGCTGGTATT 

TGGACCAGCCTGCGGGTTTATCCTGGGCTCTTTCTG 

Sequence 205 

CGCGTCCGAAAAAGGATGAGAAGAGAGGTGCATTCCAGAAGACAAAAGGTGTGTAGTATC 

AGGATAAGGGGCTTTAAATATCAGATCCAGAGAACACTGCACATGTAGAAATGGGCTTGG 

CCTGGGTCAGGGCATTGAGATTGGTTACATAATCTTTTCAAGGATTGGTGAATGAGTTGG 

AGTATG TGTAG AAACCTACAAAGATGACAGTTTAATCTCATGTCATAATTTTTAGACAAA 

TAATGTATTTTAAAACTGGGTGCAGTTCCTAAAGCTGTTCTAAAAGTCAATGCAACTGAA 

TTTGGAATGTAAGCATAGGACAAACAGATGGGAAATAAGTCATGACCTCTGTGGGATAAA 

GTGAGAGTTATCAAAGAATGTCAGTGTTTATAACAAGGAACAAGCTTGTTTTGGAGAATT 

ACTAGATATTATGGMMTTTTTTTCTTTTCTACATTTGGGTAACTATAGCTGAACTATA 

GCAGATCATATTGACTTGGCAAAAAA 

Sequence 206 

CCNCGCGTCCGGTTATGTACTTTTGGAGACTTCCATTAGAAATATTGGCAAGTCCCTGCT 

TCGTGGCCATAGATTTAAMGGCCTATCMTTTTAAATGTTTCGGTCATTGAGAGCTAAA 

ACATGTAACATATCACAGTGTTATTCACCAGAAATAAAAAATCAAGAGTCTGCTCAGAGT 

AGGTTAATATGAGTTCCTTTCTTCAGTCCAGCTGATGGTTTTTAGTAAGATGAACTGCCA 

AGGAGACAATGAGCACTGACTTCTCGATGCATGACTTCATCTTGTTAGAAGGTGGGTTGC 

CGGGCCGCGGTGGCTCACGCCCGTAATCCCAGCACTTTGGGAGGCCGAGGCGGCCGGATC 

ACCGAGGTNGGGGGGATCGAGACCATTCTGGCTACACGGTGACACCCCGTCTCTACTAAA 

TATACAAAAAATTGCCCGGCCGTGGTGG 

Sequence 207 

CCNCGCGTCCGATGAATAGTTAGCCCATGATAAAGGAATAAAAGGATGAAGAATATTTGA 

AGAGAAATAAATCTTCCTCACTCCTCAGGTTCCCTTCCATGTGCAGGAGCCTCAACCTAC 

AACTAGCAACCTTATCTCCTGACTCATTCCTCTCTAGAGGAGGAGTAAATTAGTCAACTG 

ATATGCTCTGGAAGAAAAACCCA 

Sequence 208 

CGTCCGGTCTTTCCTCCCCCTAAATAATGCATTACAAAGTGGAAATGCAAATTTCCTGTG 

CAAGCTCTAAGTAGCAGGTGGTATTTCCTTAATATATTGTTTTTGACCTTTGGGGAAATT 

GGTATTACGAGCTGA CTTTG GAAAATTAAAATAGCATCAAGGTCCTACATTTTAAATAAA 

A CAATC G ATATC TTAATTTTTAAAATCAGACTAGATTACGATACCAGGAAAAGACATACA 

TATTTTGCTTTTATGTGTTAMGTTTTGTAATTCAGGGAGGACAAGAAAAGGGATATGGT 

GCAGCTGAACTTTCTAATTCACTAAGACAGGAAAAAAAAAAAAAAAAAAAN 

Sequence 209 

CG TCCGG GAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGCAGGAAAA 
CTATTTTAAAAATTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGGGGGTGGG 
AGGTATCCTTGTAATA AGAAT ACCATGCATGTATTCCCACACTGCTCTTGGTGGTCTGCA 
AAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGACTGTATA 
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CCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCAGATTTGA 

ATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTCTACACCC 

GA TGAGG CCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCCTTAAGTT 

ATATTTTAGAAAGATTTCTGTCTAAAATAGATAAACTTGAAAGTATAGCTCTTCAAAATA 

ACGTATTCCTGTGTTGGCAMTATTTTCCAAACTCACAATCMCACATAGGTGTATTTCT 

TAGACTACTAGAAGTGGGGACTTACCCCAA 

Sequence 210 

CGCCNCGCGTCCGGGAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGC 

AGGAAAACTATTTTAAAAATTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGG 

GGGTGGGAGGTATCCTTGTAATAAGAATACCATGCATGTATTCCCACACTGCTCTTGGTG 

GTCTGCAAAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGA . 

CTGTATACCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCA 

GATTTGAATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTC 

TACACCCGATGAGGCCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCC 

TTAAGTTATATTTTAGAAAGATTTCTGTCTAAAATAGATAMCTTGAAAGTATAGCTCTT 

CMAATAACGTATTCCTGTGTTGGCAMTATTTTCCAAACTCACAATCAACACATANGTG 

TATTTTCTTAG ACT ACTAG A AGTG G G G AC 

Sequence 211 

NCGCGTCCGGTTTCNTTGGGATAGATTTTACCTATGAATTCCTCCTTAGAATTCTGAAAT 

TGCTCAGATTTACCCAAATGACAGCCAGTTTCTCATTTCACATTTGGGGGCTGTAGAATC 

TTCCAACATTGAGAACCTGTTTTMTCAAAGGATGCTTTGTGGAATCCTGAATGAGGAAC 

AGCATGTTGCAGGAAGMGAGAAGGATCCTGATGCCCTMTGGGACTGATTTCCTTTTGG 

GGGGCAGGAAGATATATATTCGTTGGGTGCTTATAAAAGGTTAATTCCAAAGATTGTGTA 

TGGTTAAAGGACTGAAAGTCACACTTAGCCTCATACTTCACTTAGATGAAAAACAAAAGC 

CTTCCTCTCCATTACCTTGTAAGATCTATTCCTTGTGTCTTGTGCTGAGTGGACCTGGAA 

TAATGGATAGCCCTCACTGAGTACCTAGAAGGGGACTAGGGGTGGGTGATGAAAGGGGGT 

TCACACCGAAGATCTAAGTGCTAGCTTGGGTA 

Sequence 21 2 

CACGCCCCGTGGCCTTGCTAGAGATCCATATAATGCAGTCATGCTGTTTCTTNCTCCATA 

GTATGTGGGGCATGAGGAGGAGACAGGGAGAGGGTGGCTTCATTGNGCAAANGNGGAATG 

GCTGTGCTTTGGGGCCAAGGAGATGCTGTCCTGCTGTAGCTGCTCTGTGAAAGGTCAGGC 

CTGCCCCTNTGAGGCTCCCTTTATCCTCCTAAATTCTGGGGCATCTACATGACGCTTTCT 

AGTCCACCTTTGCCTNCGCAGATCATGGCTACTAACCTGACCTTTGTCTGTACTTGAGCA 

CCCTTCGCGATTTAACTTNCATGTANCGTCCGACTTCTMTATGGATTTGAATTTNTTGA 

CTGTTACTGCTCANAACAATCACCCTTTTTTGAGCAGNGAGCTGGNAGGATAATTGCCGA 

CAAATGACATTNGGANCCGTTTTNAACCACAGGGGGCATGGGG 

Sequence 213 

CGATCATTTTATTAGAGTNATGTATTTAAGAACTGATAAATCATGGGCTTACCTACACAA 

TGTCTAGACACATGAGCAATGAACAAATAGCAAGGTCTGTGATATCTCATATGGCAATAC 

TAGGACTGAGATTATTTTTGTTACAATTAAATAATTGTCAGTAAAAATCCACAGAGATCA 

TTTAGM TGGG MAAAAGTCTGATATATTTTGTTrCAGATTATMTCATTAATAGGGTAC 

CTGACAGTTTTCAAAGTTGTTTAATGTTTTTTAGGTCTTTAACCTTCCTAATCCTACAAG 

GTGGTTACAATCCCACATATTATCCTTGTGTCAGGGGTCTCCAGCACCTGCCTTAGGGTC 

AGTG ATTTG CTAG AAGTACTTACAG AACTCAG 

Sequence 214 

ATCTGACAGCCTGGAACNGCACCCACACCCCCAGGTGAGAATCTGATGTTCTGGAGCATC 

ACACACAACCACAGGTGAGCATCGGAGAGTCTGGAGCAGCACCCACAACCCAAGGTGAGC 

ATCTGACAACCTGGAGCAGCACCCACACCCCGAGGTGAGCATCTGACCTCCCGGAGCAGG 

ACCCATACCTCCAGGCGAGCATCTGAACCCATGGAGCAGCACCCACGCCCCCAGGCGAGC 

ATNTGACCGAACAGAGCAGCACCCCCTCTA 

Sequence 215 
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TCCGGGAAAGACTCTGAGAAAGGACTCCTGACCTGGCTCTTCCAGAATGAGGATGGATTT 

TCCATATGAAGGAGCCGGGGGCAGGACCTTCTGGGTAGAGCGTGAAAAGAACACAGTATG 

TGTGGAGAGCGGGAAATTGTGTTGAACTGTCTAGAAAACAGAGCCACAGGAATGCGTGTT 

GAGGGCTTGGGCTGTACAGTGGAGAGTGCCCTCCCTGGCCAGGGAGTTGGACATTCATCC 

CACCACAAGACCCCATGAAGAGTTCCTCAACAGCTCTGTGTCCTCATCAAACCTGTGTTT 

GCAGAACAGTGGAGGAAGAGAGCTGAAGGAGGGGAGAGGCCCTGCACCTGCCAGGCCCTG 

GCCTAGACTACAAGGGTGAGCACTGAGCCATGCTCTCGGGGAACCTTCACTGGAGTTGAG 

GGCAGTGAGAATGTTTAAAAAAAAAAAAAAAAA 

Sequence 216 

ACGCGTCCGATGCACACTTGTCTTTTACCACANGGGTGGGGCGTGGGANGGAGGTTTAGT 

TTGGATAGCCACGTAAACGCCTTTCCCTGTGGCCTGCGATGTTCCACACCGTTTATGTGT 

GAACTGGCTGCACCCGCGCCTCCCGGACGGGGCTGCCAGGGAGGAGGGCCCGGGAGACCC 

CATCCAGACCCCGGCCCGCACGCTGCAGAGGTCTGCTCTCAGACATGTGGTGGGCTCCGT 

GTCACGGGTAAGGGGTCGGGGTCTAGACGGCAACAGAGTGTCTCTCTCTTCCCGCTCCCT 

GGTGTGCCCACCTCCCTGTACAGTGTTTCTGTTCAAGCTGCTGCAGGGGACGGGGCATTT 

TTCGTCCAGACTCTATTTTCCTGCAAGGMGAGCTGCTGTCCTTTTTCTTACTGAAGCCC 

CTGATTCTGTGTCTGATGTTGCTGACCGCCGTGCTTGCTTTCTTGCCCGTGTGCAACTTC 

CAATCCCAAGCACACGTGCTCACTTCCAAG 

Sequence 21 7 

GCGTCCG GGGA ATGGCTGTNAGTNAAGTTAGAGGTAAMAMTTTCATGTTMGATTTTG 

GMCTGGATTTTATTTMAATAATGATGCGMGCCATTGAAAGGTTTTTGGTGTGACAGG 

ATAAATT TAAM TATGAACACACCAACGCATACTTCTTTTAAGAAAGAACCTGATTAAAT 

TTGG GAATTTTAAAATAAAAACAGGAAGCATATCGTACTCTAATATAATAATTCAAGGGT 

TTTTATTTTCCTAGAAGATCAAGGTCATGTTMTAAAGGGAATATAGTTTTCTTATCTGT 

GTTMGACACTGATGACTTGCAAAGAAMGTMCACTTTTGTGATATCCTTAGGTAATTC 

AAGAGGAAACGCTTGAGCAATTACTGATGTTGTAAACTGGGATCAGAAGACATA 

Sequence 218 

NCCCCGCGTCCGCCCAGTTGCAAAGGAGATGTTGTAGGATGTTAGGTCTCAGCACAAGGA 

ACCCAAACCTTCAGGGGCTCTCCTCTACATTATGCTCCCATTTTTCTCCCAAAATATCGA 

TCTCCACCCACCCTAGACATAGAACTGGAGMTAAGTTCCAGTTTCATCCCTTTGCAGAT 

CTTAGGGGGACCCATCAAATTCCCAGCCACTGGGTGAAAATCAGCAGCTTCTTTATAGGA 

CCTGAGTTGCCTTCCTAGAGGATCCTAGAGGAAAAAAAAAAATCTTATCCTTCAAATACT 

GCTGTCTTCCAAAATACGTAAGGACGCCACGGTGAATCATAGTGGACACCCTGCATTGGT 

TGGGTTATTATTTATCCTAGMGCTTGGGTTCTTGGAGCCCTAGCTTATTTAAGCAACAA 

AGTCCCTCACAGCCCACAGGTGAGGAAGTGAGTGTAACAAAGAGATCATTGGACCTAAAA 

TCAAAACACCTG 

Sequence 219 

CCCGAAAGGAGGTTTGTGGAACTTGGAGAGATCCAGGAGGTAACACCAAAAAGCTGCATT 

TAGCAATGCCTGCCTAGCCCTCCTGTACAGCTCATGTTATTTTGTTGCCTTAGGCCATCT 

TACACCMACCATTCCTATCTCCATGCTTTTGTTTATGCTGTTGCCTCCTTTTAGAAGGG 

CCACGCTCCACCGCTCTGCTGTGTTGATACTACCGACCCTTCCTTTAGTTCCTGTTCAAT 

TCCCAAGCCTTCTGCCAAGCCTTCTTGGACATTCCCATCCCATGCTGACTATTCCTTAAG 

CTAAAMCCCTTAGAGTTAATATGATACACTTGGCTACCTCATTGTTTCATATCAGTCAT 

TCCTGTCTCTAGCTATAATCGGCTCAGCCAAGGAAGATATATTTATATGGACAATGTCTT 

TGTGCCTTGGCTGTAAACAGTGTTGAATAATTAATCTCACTTGATGAGGTCTTACTTAAT 

GATAGCCTCC 

Sequence 220 

CTCGTCCGGGCTGGATCCGTCTGCNCCACTGCAAGGGCAAGATGCAGCTGGTGGCTGACC 
TGCTGCTGCTGTCGAGCGAGGCGCGGCCCGTGCTCTTCGAGGGCCCCGCCTCCTCTGGTG 
CCGGCGCCGAGTCCTTCGAGCAGTGCCGGGACACCATCATCGCGCGCACCAAGGGGGTTT 
TCATCCTTACCCACGACGTGCAGAGCCAGCTCAACATGGGCCGCTTCGGGGAGGCGGGGG 
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ACAGCCTGGTGGAGCTGGGCGACCTGGT6GTGTCGCTGACCGAGTGCTCGGCCCACGCGG 

CCTATCTGGCCGCTGTGGCCACGCCGGGCGCCCAGCCCGCGCAAGN 

Sequence 221 

GATTTTGGCTNCTGCAACCTCCGCCTTAGGGGTTCAATGCAATTCTCCTGCCTCAGCCTC 
CAGATTGGGATTACAGGCATGTGCCACCGTGTCTGGCTAATTTTCTTGTAATTTTAGTAC 
AAAAGGGGTTTCACCACATGGGCCAGACTGGTCTCAAACTCCTGGCCTAAAGCGGATCCA - 
CCTGCCTCGACCTCACAAGGTGCTGCGATTACAGGGATTACAGGCTGGGATTACAGGCAT 
MGTCACCACACCCAGCCTMMATTAAAAGTTTATACTGTATTGTTCACATTGTGATGC 
AACTTTCCATATGTATTTCCAGAATCAACTATGTATCAAGGAATATTGAAAGCATAAAAT 
GAGATCATGGTGAATTCTCTCCCATTGTCATTCTTGNGGTAAGGGAATGAATGGTGG 
Sequence 222 

CNCGCGTCCGAGTTATTNATATMAGAACATTTTTCTGTTTTAGAGAGAACCCATTTATT 

TGTGMAGAAACAGAGTTTTGTTCTTATCCCTATAMCACTGCATTTGCTGTTTCTTCTC 

TAGCTG ATGT GACATATTAGGGAAGCAGCACCCGACTGGGACTCAAAAGACCTGGGTTTG 

GAAACTTTTTGATTACTAGCTTAGCTGTGTGAAATCAGGCAGATGGTATAACTTCTCTCA 

TTGATGATGTCAATCTTTTTMAAATmCTAGCAGTGAMTTTTAA 

AGGTGAAACCTCAAACTTACAACAGTGGTAAAATTGGAAGGGTTCTGGATGGAGTGAGGG 

CAGGGAGGGGGTAACTGGGCTGAGGGGCCCCTCTTCAGCCTCTTCCTCCCCTTCAACCGA 

AGTCTGAAAATCTCTGGCTMTATGGGTGGATCCCTMGATTTCTTTTGTACTTTTAACA 

TACCGAAAAGNTATTCTAAAGMTTTTGGTATGGTTTT 

Sequence 223 

CGCNTCCGCCGAGCGCAGCAGTCTCCCCCTAACCTCAAAAGCCTCCTCAGAAGTAACCTG 

G TATCA GTGGGCTGTGCAGATTCTTATAGTCTTCTTTGCCTTTATATATGAAAATAACTT 

TGTTTTATGTTTGTTTTCACATAGGTGGAATTATACTGACTCTTACTCTGTGACTTGCTT 

TCTTCACATGACAGTAACTCCTGGACTGTTTCTGTAGCAGCCACACAAGTCTCCTTTATA 

TTGCCAATATCTATCTTTATCCCAAAGACAAGATGAATGTTCAAAAACGGTAATCCAAAC 

TCACTTCTAAAAATGGGTGTGTGTTTAAAGAAGAGCTGCTCAGCTGAGGCAGTTTTGCTG 

CCGAGGGGATTTGAGCAATGTCTGGGGACGTGTGGTTGTCACAACTTGTGAGGGGGCTC 

Sequence 224 

GCNTCCGTCAATTTGCATCAGTTGCCTTAAAGAATGGGGTAGTTATAGGAAGCTCACAGA 

GAGGMAAAMCTCTTCATGTCTACATTGCTTCTTGMCTGATTTTATTTATAAAAGAAG 

MTTGTTGAGCTAGTGATAGAAGTTTCATGAATTCCCTGGTACTMTTATCAGTTAAATG 

ACCTCTCTGMGTCTCTGGAGAGCGTTCTGTGTCACTAATATTGGTGAAAATTTGAAACA 

AAMTGCTCTCCCATTGTCCACATATTGCTTCTTTGAATTTGGTTTTTCGAGCCAAGAAC 

TTAGGG TGTGA GAATATGTTTGTGGGGAAACCCACACMATTTTATGTTAGTCTCTGTAC 

ATTTAAAT TTTTA CCTTCCTGATTACTTACGTAAGACTAAACAATTTMGTTTCTAAAAT 

GCCATCACTTTTGCAAATAAAGGACTTTATTAAAGNTGATAAATAACAATNATGGGCCAT 

CAGCTCCACCTATAATTAATATCCTTGCCTGGCACCCCTGGAAGGGACTTAGCTTNTT 

Sequence 225 

CNTCCGCAATTACTGCCTTAGGCTAGATTCCCATGAATANGAATTGCTGAGTCAAAAGGT 

TGACACATTNTTTAAAGGTTAGATACATATTAGTCAAAGTTTTTAAGCAANATGCTTTCT 

MGCCTNTTTGATCTTTATAANNCATTGNTCCTTTCTAAAAATATATAACTGTCTTCTGC 

GTNCCMGGATMTTNTTTTTATTMTATGGGGCTTCTTGTGTCTACTTCCTCCCCTTTC 

GTTATTTCTTCAAMTGTTTAACAMCTAATACATTTCAGMCACATTATGCTTNCATCT 

TGTCACTATTTGCAGTACCTTGTATCTCCTGGACTTTATGCAATGCGTGTGTGTGCACAG 

ATGGAATATGTTNCACCATTTGGCT 

Sequence 226 

GCAGTGCCGGCGGCGCAGTTGGGAATGGGAGTGCGCTTGCACAAGCGCCGACGAGCGATA 
GCGACGCTATGCCCTCTCTTGCCCAGGCGCCGGGTGGCGCGGATCAACCTGCTCTCGCAG 
CAGCCCCTGTGGTGGCCGCCGGTGATGATGCAGGAATCGTATTTCGACGGTGGNCAGCTT 
CGCTTTCGATCAACATCTTCGNCGAAGTTCCGCGTGAACCGTNCGGNATCGACATCGTGG 
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CGCTGGCATCTGCCGCTATACCAATCATCGAAACCGCGCTGGAAGGTCGGGTCCAGCGCC 
CTGCAGAAGAGACGCCTCCAGCAGGACAAGGTGGCCTGCTCGACGAGCAGGCTACCCGAG 
TGTCGTTGAACAAGTATCGCCTTGGGTCT 
Sequence 227 

CTGACACCCAGTATTGCCMCCTMGGACTAGGAAGAAAGGGCTTACATGTTTCTTGCTT 

TMTAATATGTGAATTAGAAMGGTATTCAAGAAGATACAGGGTTAGMTGTGCATCTTT 

CTTTTATGTACTGAAATGTTTCAACTCATATGCAGCAATGTGGTNACCTTGAAAGATGAT 

TCCAGAGATTTCATGCTTCTGGCAGTTTCAGGTCACATTGGGGTGAAATTTGACATACTG 

TAGTCATCCCCAAGGTAATAATGTTGATGAAATGGTATAAACCCTGATACATTCTTAAGC 

MAATGA ATCTM AAGTTTGTTTCAAAATTTTTAACGTTTTATGTTGCTCTGACTTTCCA 

TATACTGATTTTTACATTACTTAGTGMAAAAMAAATTACCTTTTAGCTTNTGGCAACA 

AAAACATTTTTGGCTATT 

Sequence 228 

CGNCGCGTCCGCAAGACAGGAGATTTCTTAAGGTCATATATGCACGATTTGCTTAAACGT 

CATATAGCATAACTGGAAGAATATAGAATCTACCCAAAGCCTATGCTCTTTCTACTGNAT 

TATTTGCTTATAATAGAGAATGCTAATCAGAATCACCTGAATAGTCTTGTAGAGGGTCAG 

TACACAAGCCTAGGTCCTATCCTTCAGAGAGGTGAAATGGAGGCAAATATATTCTGAAAG 

AGTTTCCCAGTTGATTCTGATGAGCACTCACAATTAAGAAGTGCTGCATTAAGAGCTGTA 

CATGGTGGCTTATGCCTGTAATCCCAGCTATTCCTGAGGCTGAGGCTGGAAGATCACTTG 

AGCCTGGAAGATCACTTGAGCCTGGGAGTTGGAGACCAGCCTCGGCAACGTAGTGAGACC 

ATGTCI I I I I I I 

Sequence 229 

TTGGGAGATATGGCANGGTGAGAATGTTTGGGAAAGGAAGTAGAGACAGACATTGGA^ 

TTGTCAGTCMNTTTCTTTTGATTTTAAATTATTTATTCATTACATTGTCTA 

TTAATATTCTAAAACTATAAACAAAATTTAATATAGAAAAAATATTATGCTTTTCTTTTT 

TGCCTTTGCATGCTCTTATTTCMAATAGATTTAAAATTCATGGCATATTATACTAGAAA 

ACAAGTCTGTCAATGATCTAAGCTTCTATCTTTAGATACTAGGAAAAAAGAGAAAAACCC 

AAAGCAAGCAGAAGCAAGGAAATAAGAACAGAAATAAATGGAATTGANAAGAACACTTCA 

GAAAATCAGTCAAACTGAAGCTGATTTTTATAAAGATTA 

Sequence 230 

CCACGCNTCCGAGGAAACTGGCAGGGAAACAAAGTATCCCTGGAAGGAATTTCTTAAAGA 

GGAAGAGGCAGATCCCTACAAGTTTAAAATCAAAGAATTTGAAGATGTTGATCCCAAAGT- 

GAAATTGAAAGATGGACTTGTGAGGAAGGAGAAAGAGAAGCATAAAGATAAGAAGAAAGA 

TAGAGAGAAAGGCAAGAAAGATAAAGATAAGAGAGAGAAAGAAAAAGTGANAGATAAAGG 

CAGAGAAGATAAGATGAAAGCCCCAGCACCCCCACTGGTGTTGCCCCCAAAAGAGTTGGC 

CCTGCCCTTGTTCAGCCCTGCCACAGCCTCCAGGGTCCCAGCCATGCTGCCATCTTTGTT 

GCCAGTGCTTCCGGAAAAACTGTTTGAGGAGAAAGAGAAGCCGAAGGAGAAAG 

Sequence 231 

NCTAGACTCCCCTCTCGTATCATGGATCCCAACATCNAGGNATATGGNCATTTACGTGTT 

GGGATCTGCTCTGCCATTGNACACAGCTATATTCNA7TGCCCCGGGNGTTGTGTATNTTT 

CCAAAAACGTTGAAAGGGAGGTTCAGAAGTATNCAGTTATTNGTATTATTAGTCGTTTTG 

AAACTGAGTNGAAAGACTCATTNANGAAAGNTCCATATGCCTTCTTGTCTGTCTATGGCT 

GGNNTGCTCNNGAGAAAAGTCCNCANTTATACAATTGTA 

Sequence 232 

TTTCCi I I i 1 1 IGGGGGCGGGGTCGCGCTCTGTCGCCCAGGCTGGGGTGCAGGTGGCGCG 

ATCATGGCTCATTGCATCCTCAAATGCCTGGGCTCAAGCAAACATCAGTTTTCTTATCTG 

TGAAATGAGGATAAAAATGTCTCCACTTAAGGGTTGTTGCAAGGAAGGTGTTGCCTTAGT 

CATAAAAGCTAGGGAAGGTGTTCCTAACGAAAAACAATTCGTCAGAGACATGAAGGTAGA 

GGAAGAATTCACACATGAAGGGGGCTGGGGAAAATGATTTAAGAAAAGAAACAGGCCTGG 

CGCAGTTGCTCAGGCTTGTGATCCCAGCACTTTGGGAGGCTGAGGAGGGTGGGATCACCT 

GAGGTCGGGGAGTTCTAGACCAGCCTGGCCAACCGTGGTGAAACCTTGTCTTCTACTAAA 
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MTACMAAANATTGGCGGGGTGTGGTGGCAGGTGCCTGTATTCCCAGCTACTTTGGAGG 
CTGGGACGGGATMTTNCTTGAGCCCCGGAGGCGGGGGTTTCCGGTGAGCCCGGGATTGC 
GCCCCTTGCACTACAAGCCTGGGGCACANAAGCGAGGACTNTGTCAAAAAAAAA 
Sequence 233 

CCACGCNTCCGGTCTCAAATCNCCTCAAGTATATTCAAAATTTGACCACTTCTCACCAGC 

ACCACTGTCATTATCCTGATTCMGCCTCCATCATCTCTCATCGTTACTGTGACCTCCTG 

ATCATTCTCCTTGCTTCAGCCCTGGCCCCTGCAGGCAGCATTCAGTATTAAGCAGTAGAG 

TTGTTTCTATAAAAATGTAGTCAGCTGGGTGTGGTGTCTCACGCTGTAATGCCAGCACTT 

TGGGAGGCCAAGGTGGGAGGATCACTTGAGCTCAGTTTGAAGACCAGCCTGGTCAACATG 

GTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACTGGGCATGGTGGCGGGCACCTGT 

AATCCCAGTTACTGGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAAGGCT 

TGCAGTGAACCCAGATCACACTACTGNTTTTNCAACCTGGGCACAGAGTGAGACTGCCTC 

AAAAAAAAAAAAAAAAAAA 

Sequence 234 

TCGAGGTCAAGGACGGTTATGGCCCGTAACTACCTGCTGCCGCAGAATTACGCCATCAAG 

TGGACGCGCGGTGCTGAGGCCCAGATCAAGGACATCACTCGCGCCCGTAAGGCTAAGGAG 

ATCAAGTCCAAGGAGGAGGCTGAGCAGATCCGCTCGCAGCTGGAGCACCTGGTCGTCCAG 

GTGACTGTCCAGGTCGGCGAGAACGGTCGTCTGTTCGGGGCCGTGACTCCTGGCGATATT 

GCGCTGGCAGTCAGGAAGGCCGGTGGCCCCGCCCTCGAGAAGCGGTCCATCGAGATCACC 

AAGCCGATCAAGACCATCGGCAAGCACACTGTCCGGCGTCAAGCTGCATGACGCTATTAA 

GGGTCACGTCACGGTCGAGACTGTTCCCGCCGCGTTGATTTGACGTACACGCAGANTAGG 

GGGAGGGGCATCCAACCTGGGTGCCCTTCCTTTGCTTNCGTCACCACCGGCGAAAGGTAA 

ATGACCGAAAGTAATTCTTATTACNGTGCTTAGGGGGGTTTCGCCGCGCCGGGCCGTTAG 

ACTTAGTCTAGAAGAAAAAACCTTCCACACCTTCCCCTTGAACCTGGAAACATTAAAATG 

AATGCNATTGGTGGGGTGGTAAACTTGG 

Sequence 235 

GNGGTAGCTTTGTGTATGTCGGGCACTTCNAGGAATAGGGTGCAGGAGAAACGTCTCAGT 

GTCTCCCCTTCCGAATCTTGGCTTCTGGAGGGAGAGATGCTGGGGTGGGAGTGCTCCTTG 

GTGGAGTACTCAGGAGCTTAGTAAAAGCAGAGGGGGCTGGAGAGGCAGGCCTGGCCTGCA 

GAGCCAGCATGGAGAAGCCTGGTGTAGGGCTCTCCAGCCTGCCAATTTACAGTTAAGAAG 

AAAGGAGATATGTATATATATATATACACACACATACATACATACATACACACACACATA 

TATACACACATATATATACACACATGTATACATATATATACACACACATACACATATACA 

CACACATGTACACATATACACACATATATACGTGGGGTGTGTATATGTATATATGTGTAT 

ACGTATATA 

Sequence 236 

AACTGTCTAATCTCTTGACCTCGTGATCCCCTGCCTCGGCCTCCCAAAGTGCTGGGATTG 

CAGGCATGAGCCACTGTGCCCAGCCATAC I I I I I I 11 I I I I I I CTGGNGCATTTCTGAAG 

TTAAATATGTTGAGTTCTCTGCCCATTTGTTCAAATTCTATTGNATTATTTCTGNNGGAT 

TGACAAATGTTATGAACAATTTGGTTTCMAATAGATTTTTCTATTTAGCAACTTTCTAT 

TCCTCTAGGACCTAGTATTAAGGTCATTAAGATGGCTTAATGAGTCATTAAGCCATTAAT 

GAGTC ATTAA GATGGCTGACCAATCTGTGCCATC I 1)11 I AGTGACATTTTGTTTATCAG 

TGTCTCTTTTGTGTATTTCTTTGATTTATATCTTGACAGTATTACATAAGCAGGAATAAA 

AGAGACTTTGAGTGGGAATGTCTCGCTCAAATATAAATAACCTTATAAATACAATCTTTA 

CTTCTCAGAGTCCTAMATCTTCTTGGTTAGTCTGGTCTTCCTGGGTTGATTCAAGGTTC 

TCGG NG ACTAA ATG GC AT 

Sequence 237 

TGTTNACCTTTGGGATTACACAATACTTGCAATCCAGCTCTGCCATGGGGGACATCATTA 

CACAAAGCACTCCTCACCAGTAGTCAGATGGCACTTGATAGCTACAGGCATGTGAATATT 

CTTTTACTGTCAACTTTTGTTGTGATTTCACCTAAAATATGATAAGCTCCTTGAGGATAA 

AGGCTMGTCTTACTTCTTCGTATTTCTGGTAACCGCTTGTACCAAACACCTGCCAAGCA 

TTGTTGATTGCACCTATGAGGAGGTGAGAAAGTGCCAGGCCGTCATGCCCCTCTATACAA 
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ACATGTAACAGTAGACTCTGGTTCATCAAGGAGTAAATGACATQATATGTGATAGACCAC 
ATACATTCGTGGCTATAGATAGGGTGAGTAACTGTAACCATATCCAATTGTGAGCAAAAA 
GTGTTTMTTGGGGNGGTTTTAAACTAATATAAMTTGGCTAMTCATCATTTTT 
TCTCGAATCATCTGAAGACC I I I I II IAI I I I I I AANANTTAGGCC 
Sequence 238 

CGCTCCGTTGCAGTGAGCTGAGATCACGCCACTGCACTCCATCCTTGGTGACAGAGCANG 

ACTCCATCTCAAAAAAAAAAAGAAATTGGGTTGTCTAGGCACAGTGGCTCGTGCCTGTAA 

TCCCAGCATTTTGGGAGGCTNCNTAGAANAATCACTTGAGCCCAGGAGTTTGAGACCAGC 

CTGNACAACATAGTGAGACCCCATCTGTACCAAAAAAAAAAACACCAGACCTGGTGGCAC 

ATGTTTGTAGTCCCAGCTACTCAAGGGCTGAGGTGGGAGGGATCACTTGNNCCCAAGAAG 

TCAAGGCTTCAGTGAGCCATGATTGGTGCCACTTGACTACAAGCTGGCAACAGANCCAAG 

ACCCTNTTTAAAAAAAAAAAAAAAAAAAAATNGGNTTGGGTTATAGGNGAAGTTNCCTGG 

MMCCTTACAAAACAAGAACCATGTTTGGGGGAANCAANTGGANGGGGTNAAANTGNCC 

NCTTGANGCTTNTCCTTTNGNGGNCCNTAATTAAMANGTTTCCCNGGTNTTGGCCCNGG 

GNCCGGAAAATNCCTTNGAATTTCTTTAGCCCCTTTTGGGGNGGGCAAAAGCCAGGGNGG 

MTCCCNCNNGGGNTCAGNGATTTTNGNAACCNNACCTTGGCCAAAAATNGGGGAAAACC 

CCNG I II I 1 I I 1 NTAAAMATMCNAAAMNTTTNCCCCCTTTNCTTTNGGGGGGGCCCA 

CANNNNCCCCCTTTTTTNATANANAANAANAANCCCCCCCCCCCCCCCCCC 

Sequence 239 

CATCTCMAAAAAAACTCACAACATATTTTATGTACACCTAGTATCAGTGAGATTTTTTA 

ATMTTAGATTTATTATCTTCCATATTGNNTACAAAACTTTCTTCAACTTCATAAAAAGT 

CAAGTAACATACTGTACTTACTTCTCTCACCTCATTTTAAAATAGTGGATAAATTGTCAG 

ACACAGTGACTCTGCACCTGTAATCCTAGCTACTTGGGAGGCTAAGGTGGGAGGATTGCA 

TGAGGCCAAGAGTTCAAGACCAGCCTGAGCAACACATAGAGACCCTATCTCTTTAAAAAA 

AAAAAAAAAAAAAAA 

Sequence 240 

TGTCGACCCACGCGTCCGCGCTCCTGTCATCTCCCTTGGGTCTTCATTTAAATGCCACAC 

CAGAGAGGCCCTCCCTGGCCACCCTAATGAAAACTTCAACATCCTCAACCCTAACATTTC 

CTGTCCCCTGGGTTATTCCTCCCCTTGGTATTTATCACCATTTAATGTACTATCTGGCCG 

GGCATGGTGGCTCGTGCCTGTAATCCCAGCACTTTGGGAGGCCCAGGCGGGCGGATCACC 

TGAGGTTAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAA 

TACAAAAATTAGCCAGGCATAGAGGCATGCCGCCTGTAATCCCAGCTACTTGGGAGGCTG 

AGGCAGGAGAATCTCTTGAAACCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCAAGCCAC 

TGCACTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTCAAAAAACAAAAACAAACAAACA 

AACAAACAAACAAACAAAAAAACCCAACAAACCCACCAAACATACCATATTTGGTTTCTG 

TMMAAATAMAMGAGAGAGAGMTTTTTAAAAAACAAAAACCATACTATCTAATTTA 

CATTTTTAATCT 

Sequence 241 

CCCCGCGTCCGCTAAGGCATGTGAGCGCCTTGTAGGGCACGTTCTGTCTTCTGACTACAT 

MGCMCACTTTTGCGACCCTAAGTCACAGCACCCACAGCTTTTGGCAGGATACTTTAAA 

ACAGAMMACACATCATTGATCCTGGCAGGATTTTTTAAGGGATTGTGCTTAAGAATGT 

TTMCTTTGGTMTCAGAGACCACCACTGGTGTTTTTCTCCAMATCACATCTTTAAGTT 

AATTAATACTTGAAGTTAATAAATAATACTGCTCAAGTGTATTAGTAATGATGCCATAAT 

ACC ATGT GAATTTATGCTGATTCAAATGTTG I I I I I I I CGTTTGATACTCATATGGCCTT 

CTGTTTTGAGGACTTCAGATTATTTAGCAACTGATTTAATCTGGTCAAGAMTAAACTTT 

GCTTCAGCTGGAAAGCGTGAGGCTTGAGAAAAGCAAGGTTTTGGACAGGGGACCTATGAA 

GCTCATGTTGAMCTTAAGTGTTTTAAGGCTGTATGGGAACTTGGAATGGGAGTGAAAAG 

AACCAAGCCGTCTACTGNCAAGGTTTTTCCCTCCTCCCTCTAMATTTATAAACCTCATT 

CTTAGAAGTGGCAAAMGTTGGGAACCTTTTCCACTGNTTCACTTCTNTTANTNAGGGGA 

TTAMGGNGGATNGGNMANGGMGTTTTCNTGGTTTTTTAAAAAAAANNGNANAANGGG 

GGGG 
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Sequence 242 

GNCACGGCGGGGCAACCTGGTTCCCCTGGGCGGTAGTCCCGGCAGCTGTGCTTCCCCCTT 

CACTGCAGTGCGTGTTCAGTTGGAMCACGGAGATTTCACCTTTACTCCCTGGCTATAGG 

GGGATGGTGAGAANGGGGCTGCGTGTNAGGNGANATTCAAGGCATAAATCTTAGNTTGTC 

TTTTAAAGAATTGNGTGCCGGGACGTGNTTGGAGGNTGGGTGCANANAGGAAGAGGCGAN 

CTNCTGGAGGANGCAAAGGACAGGTCANTNTGCGNANTTCNGNNNGATGCTGCTGANTGA 

ACCCCNGNTGCTGTNGCCCCAACAAGAAAGGCTNNGCTNNGATCAAACATTTACTGNACC 

TANAAGGCCACATTCTNGCTTAANNANNCCAGGAANGCNCNTTCTTGTTNCCCATTACCA 

MATCATTNACACTTGGTTTTTGGCCCNACATACCTTTCCTTTTNGACCANNGTTCTGGN 

NNAAANGGCANTANACCNNNGNATNTNNCNTCATNACANGAATAAAAAGCCCNCAGAACN 

CNACANANGNGCCCATCACTANNNNTANAGATNTCAAAAGGGGGCCCNGGATCCTTNACC 

ANAGAANACTCTGGTCCNNAGNGNAAGAAAGNAAAACCCCNCNANANGCACTGGGACNNT 

NTNATANTTCTNNTTNAAACAAMNCTTATAANNNAAAAAAAAMTTTTT 

Sequence 243 

AATTAATAAGCCTCCTGGGTGATTTGGCTGCAGGTGGTCCTGAGACCACACTTTAAGAAA 

CACTMTCAAGAATCAAGGCCTTATGACTCTTAGTCCTGTTTCTTCCCAAAGGACTATGT 

TGCCTCCCTACATTCGAAAACATTCCAAACAATACATAAGAGTCTTTGTAATTCACATTA 

G TGCT CGTATTTACTGACTCMCMTTMTTCCTGATTGAGTCATTCATTGAGTMTTTT 

CTTTTACTCACTCAGCCTCTGTGGTTTGCCAGGAGCTACAAATAAAAGATTAAMCACAG 

TCACACCTCCAAGGAACTCATATAATAGTAACAGAAGTAACTACCATTTCTTGAGCAC7T 

TATTTATACATACCGTCATACAGTTTATCTCATTCACCTATCTCATTCCCTGCCTTTCAG 

TAAGCCTGTATACTATTATAACCGCATAGTTTGGGAATATTTCATATATATATATATATA 

TATATATATATATATATATATATATATGTATAATCACATATATTCTCTTCAACTTGTNCT 

TTACATTTAATATGTCTTAMAGTATTTCTTTGTCAGTTCTTGCTTTATATATGTCTAG^ 

ATTTCTTTATAAGGGCTACAAGAAGTATCCCTCAAGCATTGGATATTTCAATCAAATTAC 

CCTTGGGGGGGTNGGGGGAAAAAAATNNGGGGGGGNTTT 

Sequence 244 

TCGCCCCGCGTCCGGGGAAGGCTAAGGCGCGAGGATCCCTTGAGCCCAGGAGTTCTAGGC 

TGCAGTGAGCTATGATCACGCCACTGCACACCAGTATGGGCAACAGAGCGAGATCCCATC 

TCTAMCAAATTTAAAACAAACGAACAAATGAAAAATATTGCTGTCTTAAGGTTGGGAAG 

GGGCAGAGACCCCTTTGCTTGCTCATCACCAAGACACTTCTGTGAGGCCCCAGGGCTCTT 

TGGAGMCGTTTTGAAAATCACGGTTCTAAGTAATTATAGTTACTGTGACTGAACTAATT 

TAGCCCTAAGCTTCCTACAATCAAGATAGAGATACACTATGGACTGCATTTCTCCGCTTC 

AGATTAAAAAAAMAAGTTTAAGTCAGAATGTAGTTATATTTTCAGGTAAAATGCTCAAT 

ACATTTTCAAGATGAAGCTGCTCAAAATTAAAGCAGTGAGTCCAAGGGTTAATCTGNAAA 

AAAAAGTACAATTTACTATCTCCTGGTTNCANACTTATAGACCCTCATAGGTGCATTGGC 

TMTACMGGGGCCACTAAACACATTGTGGCATTTACNGGATTTTATTGGTGAAGNGCTC 

TATMGTTTTATTGGTGCCAGGTAAAAGAAANGCCTCNTATAAAAAAATGGTGGNGGGGG 

Gl I I I I I I I I NCCCCCCTTTTT 

Sequence 245 

TTCCGTACCTMGAATTGTACCCTTTCATAACAGCACCACTTGGATATGTAGAAAGAGTT 

TGTTGTGAGATCAGATGTAAAACAAATAAAAGTTATTCGTGAAATGATATGGAAGACTGG 

GATTAGMACTGTGGCATTCMGAAGCCAGTTAAGCTGTTCTCAGAATTGACAGAGATTC 

TAGAGATGGTGTAGTGCAGATGGTGTAGTGCAGGGATTCTCAGCCTCACTGCACATGGTA 

ATCTCATAGGMAATTTTAACAAGGACAGATGCAGAAATTCTTATTAAACTGTCAGGACT 

GMGTCCAGAGGTCACTATATTTTAAAAAGGTCTGCAAGTGATTGTAACATGCATCTATG 

GAGGAAAACATCANCGTAGGAGAAAAAGGGAAAGAAACCAACTGGAGTAAAGGCTCTGTC 

TTGAGCACTGTGCTGGCTCCTGTCATTCTCCATCTCTTTTTATCTTCATAGTAAGTGAGA 

TGGGTTTGACAAATAGGGA 

Sequence 246 

GCGTCCGGNGTMCTTGAACNMAGTATTCTCCTTCTTCTGTATATTTGTTCTCAACCCC 
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CAATTCCTCATAACTTTCTAAGTAAGTAGGGTGAAAAGAAAGTCAGGGCTGCCTAGGGAA 

GCTACATTTGCTTCTCMGGCTTGCACCTGCATATTTCATGCCCGACATGTTGCTGAACA 

CAGCCGAGTAAAGTGCCAGAATAAAATCAGCTCTGTCCACGTGGTACCAGGAATGCTGCT 

GAAATGAAGGTAAGTATAGGATTCCACAGGATCTAAGACAGTCCAGGGAACTGGGACCAA 

TTGTGGGAAACCTCAAAGCTGAGCATATTTGAAGAAAATGCAGAGTAAAATGGCTATGGG 

CTGCCGGGAGGCCTGGATTTGAGATACCAAGCACCACTGATATAAACACTGATGCTCAGA 

AAMGMATGATGAGTGTGTATTTGTGTCCCTTGAGACCTMTTTAATTTATMTTATTC 

ACCAGAGATAAGAGGCAAGGGATGTCTACTTGCTGGATATCTGAATTTACACAGTTCTCT 

AGTCATTAGTATTTTTTACAMTATACCACTCAGACTAGGGGGGCAGAGGATCTGGGGTT 

TNAAT TTCAAG ACCTGTCACTTATCTGCATGAATTTTGNAGCTTATAAGAACTTACAGGT 

TAACCi I I I I I 

Sequence 247 

T GTCGA CCCCGCGTCCNTNGTATTTATAAANATAATNCTGNTAGATAAATAAGTGATTCA 

TATTTTGTCAAANCTATTTTMAATTTCAATATTTAAMTATTNTTGAATCACTGGGTGT 

CGNTAAGTGGCATCATNNATGAGATTTGATTCCATGTACCATATAATNTTAGATTGGTCC 

TNTCTCACCCCTTTTAMCTCCTTCMGCATTGCTATTACTGGGGTTGCCTTTGGGAAAA 

CTTACTTCTAGATACTACCATATATCTGAAATAGTAGAGGTGGATGTTAATAAAATTCAT 

AAAATNATCATGTATTACTTTTTTTGATTTACCACTGGAAGGAAATACAGNCATGTGCAA 

TATAATGACCGTTTTGGTCATNGAGACCCACATGTGTGACAGTGGTCCCATAAGGATGNG 

GCTGAAAANNTCCTGTTGCNGCCTAGTGACACTGTAGCCATNGNAACNCCATAGCACGAC 

ACGTNACTCACCTNTTCATGGTGATGCTGGTGT 

Sequence 248 , 

CCCCGCGTCCGATTTTGAATGTATTGAGACTAAAGTTCCTTTGAAACATTAAAGAAGATC 

CATCAAACTGGCAACTGTATATAAGGTCTGATATCTTCTGGCACTCAGAGGGAGAGTTTC 

TGGTGGAGGTGGAAGTGACTTAGGAGTCATCCATGAACATAAATGAGATCACTCGTGTAG 

GGAAGAGTTAATGAGATTAGAGAGCCTTGGACAGAGCCTTGCAGCCCAAACCCATGAAAG 

AGCAGTAATAGTGAGTGAATTACCTTCTGAGGCAGTGCAAAGTAGATAAGAAAACAAAAN 

CTNNAAAAAAAAAAAANAANAAAAAAAAA 

Sequence 249 

CCGCGTCCGCAGTGGTNGTGATCTCGTCTCCTGCAACCTCTGCCTCCCAGGTTCAAGCAA 

TTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGATTATGTCTTGTGTTANNGAAA 

TCATTCMTTTTTCACCAGAGAATAAGGAAAAGCAAATTAGACTGTGAATAAAATCTCCA 

TGTACCCACTAGGATGACTACAATTCAAAAGGCTGACCTATCCAGGTGAAGATGAGAGTG , 

TGCAGCATTGGAAATTCTTACACATTGCTGGAGATTTAGAAGCTAAAGAAAGAGGACAAA 

TGATTTGGAAGCAAAGATAAGAAGGAAGAAGGAGATGGGGTCTTGCCATGTTTCCCAGGC 

TGGTCTCAAACTCCTAGATTCAAGTGATCCACCTGCGTTGGCCTCCCAAAGTGCTGGCAT 

TATAGGCATGAGGCACCATGCCAGGCCTGTTTTGAAATTTCATACATATTTATAAAACAT 

ATTTTCAGGATGAGAGAATATACCAAGAAGTTACATAGCATTGTGTATCTGTATAGMTA 

AAA 

Sequence 250 

TCGACCACGCGTCCGGGTGAACGTGGTCACCAAGGCCATGGGTACCCTGGGGGTGAGCTT 

ATCCTCCTGCAGCGTCCCTGGTTCCAAACCCACCTTCGAGCTCTCAGCCGACGAGGTGGA 

GCTGGGCCTGGGGATCCACGGGGAAGCTGGTGTGCGCCGGATAAAGATGGCAACCGCCGA 

TGAGATTGTGAAACTCATGCTCGACCACATGACAAACACCACCAACGCGTCCCATGTGCC 

TGTGCAGCCCGGCTCCTCAGTTGTGATGATGGTCAACAACCTGGGTGGCCTGTCATTCCT 

GGAACTGGGCATCATAGCCGACGCTACCGTCCGNTCCCTGGAGGGCCGCGGGGTGAAGAT 

TGCCCGTGCCCTGGTGGGCACCTTCATGTCAGCACTGGAGATGCCTGGCATTTCTCTCAC 

CCTCCTGCTGGTGGATGAGCCTCTCCTGAAACTGATAGATGCTTGAAACCACTGCAGCAG 

CCTGGCCTAACGTGGCTTGCAGTCTTCATTACTGGGCGGAAAGCGGAGCCGGGTAAGCCC 

CTTGCCGAGCCCCAAGAAG 

Sequence 251 
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GTCGCCCCGCGTCCGCAAGAAATGATGTTAGTAAAGATACTTTAATTGCGGGGAGTTCTC 

TGTTGGCTGGTGMTAAGGACGTTCTCTTTGAGTTCCACCTTTCTTTGAGAMTTTTCAT 

GTTTCCTGTACTTTTCCTCTATTATATCATGGAAACTTACTAAATCAGCCTGTGCATTCC 

GCCCACAT ATTCC CTCATGGTGCCTTCCGGTCCTAGCAGCTGAGCCAGGGCCATCAATAA 

ATCTTCTTCTTTTCCCCCAAGATATTCCTAGTTGCTTCmACTATTCATTCTTGTM 

CCATTCATCTGTGTGTATCATTTAGGATGCTGGGCAGTTGCATCGCATTTCACCTAAGGT 

CACATCCTGATGGGATTTGGTGGGATCCTAGAGCTGGCTAGGACCACTTGTGAGAGCTGA 

TTGTTAAATTCTCCGAAATGACGTGACTCAAGTTGTTAGACTGCTTAAAGG 

Sequence 252 

CCGGTGMA CCTG GAGCTGMGTGAATTCTCTTAGAGTATATT7TGAAACTGTACTAGGA 

CTTTAAACACTTTTGGAATTTAAAACAGCCATAAAAATTCTTGTTATACTGMGGAGTTC 

CTGAGGCAGTGTGCCTCTCATTTTACCACCTAAAGTTGCCATAGAGGTCCAAGGAGACAC 

TGCTGATAGCAGAAAGTCTTCCAGAAAGAAATTAGGCGACCCACACCAAGCATGTATGGC 

TTTGAGTCTTACAGATGGCTTTTTMTAGTTTAGTCTCTTAACCTMGGAAG 

GTTCCGGTCAGAGAGTCTAAAMTTCACATTTTACCTAATAAATGATAATGAGGCTATTT 

ATCTTGTCTGTCTGGATTTT7TCACTTGACATTTAATGAAATATCCCATATTACCTATAA 

TTTTTATTTGAAG 

Sequence 253 

CCCCCCGCGTCCGAGATAATGCTGTTTGCTTCCGGCCGCTGTAAATCATAGGTGAAAACC 

AGTAG CANG TGCTCACTCAGTGCCTCCCAGAAGCGGTCTGCGGGTCTCAGCTGGGCTGGG 

GGCAGTTTTCATTGGGCAAGGCTTGGGCTTAGCTTCGAAGCANGGGCTGGGAGAGGATGG 

ATGGGGGTGTGAGAGCAAAAGAAAGACCTGGCTTTGCAGTGATGGCANCCCACGTTCAAA 

TNNNAGCTCACCACTGACCNGTCGNNTGACGNGCGCCAGGNGTTAGGAGACTGNAACTGN 

TTNJTGNGTNNNGNNTCCGGNCGTNCATNNNNNCTGCTCAGCATACANANCCTNTTNCTNA 

TCNTAATCCTCATACNCATGNCTGNNNACTNTACACTGTTCTACTTATCAATGACAGGTC 

AAAAGTGTTATCATNTGTGACNTAGAATGAGTGAAGTACACNCCCTCTTGGAAACTATGA 

ATGACTTAAAGAATCACCNTTTGCAAAAAATC 

Sequence 254 

GCCTTCGCCCCTGCCTCCTCTTGGCTGCGGCTGGTCATCTTCCCACCTCCACAGTGGGCC 

GAGCTGCCAAACAAGGGGTACACATGTGCCCCCTGACATAGGCCAGGTGGTCTCTGCCCT 

CTGTAGCTCCCATGAGAGAGGGCTCCTCGGACCGAAACAGGAGGCCACTGCCCTTGCGCA 

CACACCGTGGCCGGGTCCTCCTGGGCGGAGCGCTTTCCGTGTGTGGGAAAGTCAAGGGCA 

GCCCCGAGCCTCGAAGCCCAGGCTCCCAGCCCCGCGCCATGTTGCATTCCCGCCTCTACT 

CCTTGGTAGGCTGGNTGCTTTGAGTGGTTCTTTTTAATTCTTTCTGTTGGTTTCT 

TCCTTTGCCTGGGTTTTGCTTTAACCTCTCTGTTGCAGAGATGCAGAGCACTCAGAGAGC 

CTATTTCTATCATCGCTTTCCTATTCTCCACCTAGAACCAGNTGACTGGCCGCCCGAGTG 

GNGTCTCTTGTGTGTGTGGTGCCGTCAAAGCTGTGGAAAGAAATGCTTCTGCCTAGGTTT 

CTTCGCGCCCCCCCCTTGCTTGGCTTTTCCTGCCTGCTTACACCCCCGGTTCCTGATCTG 

CCCTGGGC 

Sequence 255 

GCCCACGCGTCCGCAAGANAGCTCCTCAGATTTGTCATAGACTATATTTAAAGAAAGGCC 

ACATTTTTCTTATTTAAAATGCATTAAACAATGCANCCAATTAAAAGAACTGAGNTGGAT 

TTGTACAAAAGCAGGGACTAGGTCTGNTTTGTTCACTGCTATATTCCCAATGCCTAGAAC 

CATGTCTGGCAAACATACTGGCATGGGAAGAACATTTCCATAACCCCTGAATGTTCTGTG 

C CCCT TTCCAATTAATCCCTACCCTCAGAAGCAACCACTATTCTCATGCTTATTACATTA 

GTTTTGCCTCTTCTTGACTTTCATATAAATGAAATCATACATCTAANAAAAAANANAAAA 

AAAA 

Sequence 256 

TCGACCCCGCGTCCGATTTGATATAAATAGTTATGTTACTCATATAGAAATCTCTTCCCC 

ATTACACACATACAAACATTTATCTATGAGTGGCTTATAATTGCAAATAAGATGTAAATC 

ATGCTCATGATCATTGTCAAAATTGTGAMGATTTTTTTCTATACCTCTTTTAGGTTTGT 
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TTTGTTTTTGTTTTTGATTTCCAGGTGGCATTAMGACAAGAGGGAATMTATTCAT^ 

TACTTCTACTCCCMGTCACTAGTTTGCTGAATTTMTTGAGTTAAAGAATTGTATCAGT 

CTTCTTGGMGTCTMTACAAAACCAGTTCAACACTAGTTATTCATTCTTTGCTAATTCA 

CCAGAATTGAAGGATGGATAAAATGAGAAAGAGAAGTAGTTCTTCATATTATTAAAATAA 

AGAGTTAMTTAGACACTTTGTTGGACTCTTTGGTCTTAATAATTCCTACTC 1 I I I I GAG 

GTCCAMAGTTTTGTCTTTGATMATATAATTTTAATGGG 

Sequence 257 

AAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAGTAAAATGC 

ACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCTGGGGAG 

TGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGMTTTTTAGTTATATGTGCTGTAAT 

GTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCMCATATCCACATCTTATATTCCACA 

AATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 

GGCGGCTGCATTTTAGTAATGGGTCMATGATTCACTTTTTATGATGCTTCCAAAGGTGC 

CTTGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGC 

ATAMCAGAGCAGTCGGCGACACCGATTTTATAMTAAACTGAGCACCTTCTTTTTAAAC 

AMCAMTGCGGGTTTATTTCTCAGATGATGTTCATCCCGTGAATGGTCCAGGGAAGGAC 

CTTTCACCTTGACTATATGGCATTATGTCATCACAAGCTCTGAGGCTTCTTCTTT 

Sequence 258 

GAGTCGACCCCGCGTCCGCTCTGGAGGMGCATAGATTAGMTCATGATTTTTATCTATT 

TTAAGAGAATAGAAGAACAGAAGGGGTTACAATCTTGGAATATTATGCAACTCTTCTGCT 

CTAATATATCAAAAACTTGATGATCCAAGATCATGCAGAACAGCTGAGAAGAAATCAAAG 

TAAACAGTGTACCTTGCAGCCAACAGATCCTGCCAATATGAGATTAGAACTCTCCATCCT 

AGCAAAAAAAAAAAAAAAAAA 

Sequence 259 

CTGGTACCTGCGAGTCGCTGCAGCAGCTGTGGCAATTGTCACCTTCATCCAGGCCCATCC 

CGCTTTGAGGGCCTAGAGAGAGTGGGCCAGAGGTTAACCCCCGATTCATCTGCCTCCCCA 

CGCTGGGCATCTGGGTGTGCCAGGGCATTCCCCCGCTGGTCAGACAGGTTTTTGGGCCAG 

GGCGGGGCTGACCAGGGTTAATTAGAGGGAACTGGCTAGGAGGAGCTGGGGAGGGGGCTG 

GGGAGAGTCCAGGCCTNCAGAGCCCCTGGGACACAGCAGGTGTGTGCTGCCATGGGCCGG 

GGCTTGAACTCTGCCAGACTCAGGCGCCAAAAACGGCGCTTGCGACCTCAGGTCCAGAAG 

CCCCGGCAGCAAGCTG 

Sequence 260 

TCGACCCCGCGTCCGAAACTCTGTCTAGTCTAAACTATTATTCTATACTTCTCATCTCTA 

TATGTTAAGGATTGATCTCCAAGATAAATTG I I I I I I GTTTGNTTTTTAGGGACAGGATC 

ATGCTCTGTTGCCCAGGCTAGAGTGTAGTGGAACAATCATAACTCACTGCAGCCTCGAAC 

TCCTGGGCTTAGGTGATCCACCTGCCTTGGCCTCCTGAGTAGCTGGAATTCCAGATGCAA 

GGCACCATGCCTGGCTAACTTTTTMMTTTTTCATAGAGATGGGGTCTTACCATCCTGC 

CCAGGCTGGCCTCGAGCTCTTCACCTCAAGCAGTCCTCCTGCCTCAGCCTCCCAAGGCAC 

TGGTATTGCAGGAGTGAGCCACCACGCCCAGCCCAAMTAATTNTTTTTTAAAGCAAGAT 

GTAGAMAGTGATTATMTATGTTTCCATTTAGGCAAGAAAAAAAATGGAGAGGACTATA 

CCTGTACTCTCTGNACATAGGATCCACAGAAAACTTCTAATGGATGGTTATCCCTGNGGN 

GGGAAACTGGGGGACAGGGAATGATGMGCAGGMMTTTTTACTGGATAAACTTTAGTT 

CTGGTGGCTCTTTTTCTTCTATCATGNGNATGGTAACTT 

Sequence 261 

gtcccgcmagcctttaaamgagtccgmtttcacttttcaccttttgtagatgtgcac 

gtgtagctgtagagctcatacttacgtttcacatggcatagttgatggatatgtaggtgt 

aaagtttatggtagtggacaggctgagaatggtgtatctgtgacaaaaaatctgatggaa 

gtgatata tttga t atgam gtgmcattttcttagttgggtgtttataac l i i i i l i i g 

gtamttgtttttagtttttatccttatttTacttatgcttgg 

tccccaaat cttc ttctgaattccaaggamcactgttttagcatttatttgattact^ 

ggttcattcttttctccactcccatttatttgttttccattttgtaacttctataaagca 
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GATAAAAATCTGGAACTTCTAGATCTGACCTTCATGCCTTGTCTTTTCTATGGTACTTAT 
TCTTTCTGTCTCCTTCTCATTTTGGATTGGGCTTATGAGAGMATCTTGGGGTTGATCTT 
CCAGCTC ACTAATTTGATATTCATTTGTGTCTCTTCAGTTACTTAGCTTGCGTGAAAACT 
I I 1 1 1 1 1 I CAGCAATTGTGTACTTAATTTTCATA 
Sequence 262 

CCNCGCGTCCGTGCCCTGGGGCCCCCCTGGGCGAGCATACCCTAGGCGTCAGGCCTGGAG 

GTCTCCTCGGTGCCTCCCAGCTCTTCTGTGCTCCTCACACTCTGCTCTGNGCANATTGGC 

TGTATTTGTAGGTTACTGCCTTTTATTTCTCACAT7TCTTT7TGGTGAGAGTATATCAAT 

CAATCAATCATCCTCAGACCTCTATGATAACACTGTGCCCCACACACAAGAAGCTACTCA 

ATTMTGTTTGTTTGTTGAAATGAGAGAAAATA7TTGTCCTTAGTACAGAAAGAAGATGA 

AGCCAACCTCTGATAGAAGCCCACeCATAGACTAGGGTGTTGAGCTGCCTTCCAA 

Sequence 263 

NCCCCGCGTCCGGTAGTTGCAGAAGCATGTTCTTGAACTTATCAGTCCTGACCTCAGATT 

TCATCTTCTTCTGGTAGATGTAGCATACATCTCTGAGTGTTATTAGAGGACCAGNCTAGA 

GCCTCATCGTACTCCTTCAGTTACTTCATAATCATCCAGCTCTTTATATTAAATGTTTTT 

CTGTTTAAAATGGCAGCMTGTTTTTTATTTTTTGAAATGGTCCCTGACAGCAACAGATC 

TTCTCGTGTTTAATTATTGAGTCTGTGCTGATTACACAGAATTAAGGATATAGTTTCTAA 

AGTACTTCCATTTTTATATATTTTAGCATTATTCTGMAGGCCTGGAAAAAMCTATTTT 

TTATT CGATTTGAAAGTGAAGTGACATAGGTGGGTCGCTATAGCAAGAAATTACCCTGTA 

TTTTTCCATCTCTATCATCACAGGCATCTCACAGAATTAGAAGTCGGACATTATTGATGG 

ATATATTAGTCATGAATAATTAAAATACATTAAATATAAAATGGGTCAGATACGGGCAGA 

TTT 

Sequence 264 

CNCGCGTCCGGGAGAAAAGAGTTTTATACCTAATTCTTNTGCAAGTGATTACATATTTTT 

ATA CCCA GGAT GTTCA GCMGATGMCTTTATTTTTTAGATGTATCTATGGNTTTTCCAC 

CCATTTTATTAATTTTTAGAAATATMAAGTGCCTTTAAAATTTAGCTGGGTTAAGATAG 

TAAGTTTTAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGTGAGC 

CTAGATCGTGCCATTGTACTCCAGCCTGGGCAACAAGAGCGAAACTCAGTTTCAAAAAAA 

AAAAAAAAAAAAAAAAAAAA 

Sequence 265 

GATCCTCTAAGTNCCCAATGATCNGAGAAGAAATATGAAAGGGAATTTTAAAATATTTTG 

AACTGAATGAAGATGAAAATGCCACAGATAAAACT7TGCATGGGGNAGCTACATTAGTTA 

GCTTAGAGGGAAATTTATATTTTTAAATTCTTATATTAGGAAATAAGTCTTAGATAAATA 

ATGTCAGCTTCCACCTAAGAAGTTCAAAATAAACCCACAGTAAGCNGAAGAAAGGAAATA 

ATAAAGCTGA GAAAA AATCAATAAAGTTTAAAAGAGAAAATTAATAGATATCAATGGAAC 

AMAG7TGCCT7TTTAAAAATATCAACAAAATTGATAGACCTTTAGCCAGAATGATCAAA 

AAAAAAAAAAAAAAAA 

Sequence 266 

CCCGCGTCCGCCTGAATTTCAGAGTCNGTTGAGAAGGTAGGGAGCAGGGATCTCTCAGAC 

ACCMCATGTTCAGCTTTATTATAGATAAGGCTGAATTGATTTCCTGGAATNGACCTTAC 

TTCCCCACCTCCAAACTGCAGGCCTTCCCCTTGCAGGTTCAGAACATGTTAAGGTAAGTT 

CACTGCAGAGATAAACAGGGTGAGGACAGGCCAGTCCTAAAAAAAAAAAAAGAAAAAAAA 

A 

Sequence 267 

CCGTTCI I I I I CCNAAACTAAAGAATGCATAGGACATAAGTTAAAAGTTCATACATAACC 
TGGCTTCAAATCCAGTTCTACCACTAGCTGAAAACATCAGTTTATTTCTCATCAATGGGT 
TGTTATAAAGTACCTAGCATAGGGTATTGCTTAAATGTTAATACTCCCAATCCTGACACT 
AATGTTTCAGGGAAGAGTGAAAGAAATCACATTAACTCCACATTATTGAACATCTTCTGT 
GTCAGGCTGMTACATCTTTGTATCCATTGTCTTCTGGTTGTTTAAAGACTAGTGTAGAA 
GCCTGACATGTAAATCGGTGATTATATAAGATAGTACTGTCTTGTAATGTGTTGTGCTAG 
AGAGGTTAAGTATTAATTATTATGGGAGCTCAGGAGAGGCACCCACTGGCTGCTGGGAGA 
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AGAGCMAAACCTATATATAAAGCMTTTTTAAGATTTTGGTTCTATAAGGGTAGAATGT 
TCGATCCATTAGTGTATTCCCAAAGTCCTATACACATGTCAAGATCGGGAMGCTCACAC 
ACACAATAATGCCCAAG 
Sequence 268 

CGTCCGGCCCCATGCCCAGCCAAGTTATGTTGTTTGGTCMTTTTAGATAATTAAAAAAA 

AAAATCCTATGTAGTAACATCCTACTAGGAAAGAAGATAAACTCAANTATCCACTGAGTG 

GTGACCTGAAACCAAAGAGACATGGAGGGGCAGCCAGTCAAAAGCCACTGCACTCCAGCC 

TGGGCAACAGAGCAGGACTGTGTCTGGAAAAAAAAAAAATCTAGAAACATACTCAAATGT 

AAACATAAGAACTTAGCATATGGTAAAGTTGTGACTGTGAGTCAGTGCAGAAAAGAATGA 

ATCAATCAGTAAATGGTCCTGGGAAAACCGGCTATTTATTTTAAAATAATAACAAAATAT 

GCTGTATATCATCMTGGATTAMGATTTATATACCTTCAGCATATGTCCTTMATGTTC 

TAGMTAAAATATATATGAATTCMTGTAATTTTTGATGTGAGGGAAGGCCTTTTAGACA 

TGACACAAATGTCAAAAAGCATAAAAGGAAAGATTTTGAATAAATAATAAAC 

Sequence 269 

GCGTCCGACAGATACACCATCATTATAGGTGGCCAMTCATTCAAATCGCTGTTTGCTTC 

TTCTCGGGC TCTTA TTCAAGCTCACGATGCTATTCTCATAGTGATTGCGAATCTCACTGC 

ATMAGGTGATTTTGAAAACATCACTTTTCCTCCACACTAATGTGTGAGCTTAGCCAGAG 

TAGGAGATATATCTTACTTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 270 

CCCTTTTTTNCGGCCNTNCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAA 

AGGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCAT 

CTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAG 

GTCCTGAAATAAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCA 

TTCCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAAGA 

TGCTGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGGCAT 

TGGGATCATTGCATTGATATTAGCACTGGCCATTGG 

Sequence 271 

ACCGCGGNGGCTTCATGCAAGCTGTGGGCATGGNCAACGATCACGAAAATCATTNTTCCT 

TTAAATAAAATACAATCCTATNNAAAGGAGTNCTTCCATGAGCAACAATCAAATACGTGC 

TTATGCTGCGATGCAAGCAGGTGAACAACTGGTTCCTNATCAANTTGACGCAGGCGATTT 

ACAAGCCCATCAAGACGAAGTCAMGTTGAATATTGTGGATTATGTCATTCAGATATTTC 

AAGTGATTAATAATGATTGGCCGATCATCTACTTATCCTGTAGNCGCAGGTCATGAAATC 

ATTGGAACCATCACTGCTCTTTGGTTCATGAAGCGAAAGGACTCAAAG 

Sequence 272 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

AAMTTTAAGTATGAAAGAAAGAAAAAACAAAANGTTGAAAAAGAAAAACAATCTTTCAC 

AMCAATAGAGAAATTC GTTT ATCTTTTGGMTCAATTTAAATGATATAAAAATCAAAGC 

AAAAAMGCCAAAGAATTTTTATTAGATAACCGACAGAGTAAAAGTGGCTCTTCGTCTTA 

GAGGGCGTGAAAATACAAGACCTGMCAAGGTAAATTMTTTTAMTTCTTTTTTTGATG 

AAG N A AA ATCG ATTGCAAA ATTAAAGTAAAG AAN NGCAATCAGTTTGGTAATTTTTTAAC 

TCTTCATATTGA ACGT GATAAGAAAAMTTACCCAMTTTACTTCTTCAAAACAAATAAA 

GGMTTMTTGATTTTGAAAAAACTATTNAAGGAAGGAAGAMCTAATGCCTAANGCCAA 

AAACAAAATCGCACTTTAAAAAAA 

Sequence 273 

cccttagcgtggtcncgg cctt tgtacgmattatgactgttttagctncaggaacagat 

ttaaaagtMttgaagacgttttaaaatctgtttttgatgctaagaacatcgaaaaaatt 

g aaam cttgamgaacagagttagcatacgaaattmaaaacacamcmggmttttt 

gttttagctmcttaa aatc tgaggamgtttaatcgaagaatttgtcagaagagtaaat 

attctcaaaamcaagttttaagatttttagttattmtctagattctgaaagaggaatg 

cacaamctttcagacctagaamaatgataaacacamtttttctcttctaaaaaacca 

acancttcaacagaagaaggtaaaagttttcaaamccatttgtcaaaaaaccttttg^ 
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AAAAAATCAGAAGAAACAGATTCTTCAAAACAAAATGANCAAGACAAAGTGCTAAGAAAA 
CCA AAAACTGTAAAA ACCAG CA AA AG ATCCTAAAGTAGCTCACACAG CAAAAAA AA AAAA 
NAAAAAGTCCTGC 
Sequence 274 

CCCTTAGCGTGGTCGCGGCCGAGGTACNAAATTATGACTGTTTTAGCTCCAGGAACAGAT 

TTAAAAGTAATTGAAGACGTTTTAAAATCTGTTTTTGATGCTMGAACATCGAAAAAATT 

GMAAACTTGAMGAACAGAGTTAGCATACNMATTAAAAAACACAMCMGGAATTTTT 

GTTTTAGCTAACTTAMATCTGAGGAAAGTTTMTCGAAGMTTTGTCAGAAGAGTAAAT 

ATTCTCAAAAMCAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCTGAAAGAGGAATG 

CACAAAACT7TCAGACCTAGAAAAAATGATAAACACAAATTTTTCTCTTCTAAAAAACCA 

ACAACTTCAACAGAAGAAGGTAMAGTTTTCAAMACCATTTGTCAAMAACCTTTTGTT 

AAAAAATCAGAAGAAACAGATTCTTCAAAACAAAATGAACAAGGACAAAGTGCTAAGAAA 

ACCCAAAAACTGTAAAACCAGCAAAAGATTCTA 

Sequence 275 

CCCTTAGCGTGGTCGCGGCCGAGGTANTTNCCTGANCAGTCGAAGTGGATGCCCAGACCA 

ATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAAT 

GGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATT 

TTC AAAT CATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGA 

TCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCT 

ACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCAC 

AAGTGTTACTGGCTTCCCATAAACACAAGCCCTTTCCT 

Sequence 276 

CCCTTTCGAGCGNCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

AATAGMTACATCAGAGATGTTAMGATTTCCCAATTGMGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTT 

AGCTMAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACC 

TACTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGG 

AGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAAC 

TAATATGTTGAAMAAGGCCAMAAGTAGCMTTATTGATGATGTTTTAGCTACTGGCGG 

AA CAATG AAAGCGATTATTAACTTAATCGAATCTCAAGGTGCTGGTTGNTCATAAAGTAA 

TCTTTTTACTTGAATTANGATTTTTAAACCGGAATTGAAAAACTTAAAA 

Sequence 277 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTTCTAANGTTAAATCCTGAGGTAAGTCAACC 

AAMCAGGATGTGAATATCCTMTAATMTTCTAAAGTCTTATCrrTTAATGTTGCTCTG 

TMCCMCACCTTTGATTTCTAATTCTTTAGAMATCCTTTANAAACTCCGGTTAACATA 

CCTTGTAATGATGAATTTGTAGTTCCGNGTAATTGNTTAATA I I I I I I I CTTCTGATGTT 

CTTTTAGTAAGTAGAGTGTTTTCAACTTTTTCMTTGAMTAMTGATGAMATTCTCTT 

GATAATGTTCCTMTTTACCTTTTATAGNTACTNAAGMTTGTTAATGTTTACTTCAACA 

CCTTCTGNTATTTGTAAAACACGATTTCCGACACGAGACAATATTAATTATCAAATGTAA 

GCAATGGATTTCACCACCTACATTTTTCCCTTTCTTGCNTTGTATAATTAGGTTNAATAA 

ATCCTTT 

Sequence 278 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGATAGTNNTCTCAGTAAAAGGTCTATTATCT 

AACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAMCTGCCATAATGCCGTGCA 

CAGCTTGAAAAGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGC 

AGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGANCCAACAATAAAAATAGCACT 

AGCAATAANAAGAATGCCATCCCATGGAGCACACCATAAT . 

Sequence 279 

CCGAGGTACTAMCTCTNTTTAATGNNCTAACGTCATATTTTTTAAGTTTTTCMTTCCG 
TTTAAAAATCCTMTTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCG 
ATTAAGTTAATMTCGCTTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCT 
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ACTTTTTGGCCNTTTTTCAACATATTA 

CTMATCATACTCAAAACTAATMCGTCTCCTGGGTMTTTTTTAGGGTTTNCTTACCAT . 

AMTAAAAAGGGGTTTTTTTAAAAAAAAGCTGGCAGTAAGGGTGTCCCAAACAAAGAAAA 

CCCTCTTGCGGTCTTGGGACCTATAATAACAATCTGCATCTTTAAGCTAAACTCAAGCCA 

TTTGGGATTAATTGGTGTAAATTTMGCACTTTCTCCATTTGGCTAAAAAGTGGGGNGAA 

MTATCTTTAMAATACMTCCCTTCAATTGGGGAAMATCTTTTAACATCCTCTGGATG 

GNATTTCTATTAAAAATTGGATTTGGA 

Sequence 280 

TATTTGGTGAAGATCAGCGTTATCAGCATTTTCTACGATTAMCGCTGGCCATGCTTTGA 

CTGATGAAATCCGCCAAGCTATTCAGCAGTTGGCGGATTGGGTCAGAGAAAGTCTCAAGT 

CGAAATTGAGTTAGTCGAAGTCTGCTTTCAATACGATGCGATAACGTGCTTGACCTGAAT 

G AAGTCG CTCAATGGCATCGTTGAGCTGTGACATAGGATAGAGTTCAATTTGCGGGGCGA 

TAi I u lACGTGCTGCAAATTTGAAGAAGTTGACGAAGTGCTAAAGGAGAACCCGTCGGT 

GAACCTGTTACTGATTTGGCACCATCAATCAAAGCACCCGACTGAAACTGGGAATAGGTT 

CTAAAAGTCAGAGCTAGAAAAATG 

Sequence 281 

NGGTACCTCCACCGCGGTGGCGGCCGAGGTACANCCTCTCGGCCCGGCTAAACATCATCG 

TCTTGGTAGGTCATTACCNTACCMCTAACTAATGNTCCGCACCCCCATTTTTAAGTGAA 

GCTGNGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAAGCT 

ATTGATTCCAATAGTTATCCCAGNCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 282 

CGANAAGCANGATTTTNAATTNTTGCAGCCCGGGGGATCCGNGGAGNGGGGAGAGCCCAC 

CGCGGGGGAGCGCCAACACGACNNAGAGCGAGNCGNAATACGCGCGCACACNGACCGCNG 

ANNAACAAACGNNGAGACGGGGAAAACCCNGGCGNCCNCAACAAAAACGCCCGNGCAGCA 

CAGNCCCCAAAAGCCAGCNAGGCGGAANANCGAACAGGCCCGCACCNAGCGNCCGNCCCA 

ACAGNAGCACAAGCCCCGAANGGCGAAAAGGG 

Sequence 283 

AAACCCCCTNAAAAAGTNCCAAGGGGCCGCCAANNCGACTANAGCGAGNCGCATTACGCG 

CGCNCACTGTTTGTCGTNNTTAACGCCGCGACCGGGNNGAACCNGGGCGNNACCCAACAG 

GAAGCGCCGCGCANCACAACCCCCNACGCCAGCAGGCCGAnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnAGNGGAGGAAACAGCGCAGCAGAGACCGNNACACANGCCAAC 

GCCCNAGCNCCCGCNCNNNACGCNAGCAACCCCACCNAACANCGNCCACGNACGACNGCN 

NGCCCCCGCNCAANCNCNAAAACGGGGGCCCCCNANNANGGGGACCCGANAAANCGCNAN 

NACGGCACCCNCCGACCCCAAAAAACNNGAAAAAGNGAGAAGGGAGCACGNAAGNGGGGC 

CAANCGCCCCNGAAAAGAACNGGCGNCCCGCCCNNANGAACGAANGGGANGACCCACGNC 

CCNAAAAAANAGGGGGACACCCGAGNNACCCAAACGGGGNAACAACACGCAAACCCNAAG 

GCCGGCCGAAACGNGAANGGAACCAACAAGGGGACACANGCCCGAAAACCGGNCCAAAGG 

GGGAAAAAAAAAAGAAGCCGNAANNCAAACAAAAAANCNNAACGGCGAAAGNNNAAACAA 

AANAAGAAACCNCNAACAACNCAAAGGNGNCACNNNNNCNGGGAAAAAGGGGCCCC 

Sequence 284 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATNGGAAACAATAGCTAATAC 

CGGATATAGNTATTTATCGCATGATGAGNAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGT 

TNAGCCCGGGCCCGAGAGGCTGTACCTCGGCCCGCCACCGCGGTGnnnnnnnnnnnnnnn 

nnnnnnnnnnAGNCGTATTACGCCGCGCTCACTGGCCGGCGnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnACTTMTCCGCCTTGCAGCACAATCCCCCTTTTCGCCAG 

CTGGGCGTAATAGCGAAAGAGGCCCCGCACCCGAAC 

Sequence 285 

CGTCCGTNAGGCCTGTNCTTACGGCTGGGTTTAGAAACCAGCCCATTCAGAAAGACTGAA 
TCAGAACATGGATNAAGTGAACTNATTCTAAGATGACTCGNNTATCCATGTNGATTAATC 
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TNCTGGNTCATAATAGGCCTCTTCCCTTTGATTGAAGGGTCACGTNTAAGTATANAAAAC 

ATAAAACTGTAAGGTAGAGGAAGCGAAGGATAGCTTNGTATTAATGTTGCGTTAAAGCTT 

CAGAGACAAGAACAAGAACACTCCTCCCACGTGACAGCATTTGAATAGGAGGCGGNGGGT 

GCNGCAGCCTGGGCAGCTTCAGTCCCGATTTACAATAAAGTACCTTGNGNGTNATTAGTT 

CTTAAATGTTTATTTAGAAATGGCATTGATGTT 

Sequence 286 

GCGTCCGGTCACCGCACTGAACTTCGGGACCAAAAGCTTCCATGCCGCGGNCCCCAGACA 

AAGGCAGCTTCCCGCTGGACCACTTCGGTGAATGTAAAAGCTTTAAAGAAAAATTCATGA 

GGTGTCTCCGTGACAAGAACTATGAAAATGCTTTGTGCAGAAATGAATCTAAAGAAGTAT 

TTAATGTGCAGGATGCAAAGGCAGCTGATGGCACCGGAGCCGCTAGAGAAGCTCGGCTTT 

AGAGACCTAATGGAAGGGAAGCCAGAGGCAAAGGATGAATGTTGAGAAGGGAGCCACAGG 

ACCTTGTCCCCAGCCTGGAGCAGAGCTGAGCCCTTCTGCCCACAGNGCAGGGGGACCTGA 

CACTCAGCCCGTGCTGGCCCGTGCAGGGGCTCTCCCTGGG 

Sequence 287 

CGTCCGGTGCACTGCAACTTNTATATNTAACCCTAAACTCCAATAAAACAAATTCCAGGG 

AAAACCAAGGGTGTAATGGGATGTGCGTGTTTATCAGGAGTGTGCTCTCACGTGGATGCT 

GAATGATGGAGGACAGCGGACTGCATAAGCCAGAAACCTGTACGGGTGCTGGCTGTGGAA 

AGACGTGTCTGTCTCTATCTATGTACAATAGTTNATTCTGTCAGGCTGAAAAAGTATGGT 

CTTTAGGACCTTGCCCCTCTAACTATAGAAACTTAAACAGTGTACTGCTATTAGATATAT 

CTGATATTAATAGAACATGCCAAGTGCAGGTCCCAAATGCGTATTTGTGAAGCACACATC 

TGAGTAAATGGCTTAGATGGAAAGCAAGTCATCATGAGTAAAAATTAAGCCTCAAACCTG 

CCGGTGCTCCTCACCTCTTTGTCACCAGGTAAAGGTCACACTGTGTGTTGCTTTTGNTGT 

CTTCTCTTCCTAACCTAG 

Sequence 288 

CNCGCGTCCGGGGCTTGTTTACTATGGCCGATGATCTGGAGCAGCAGCCTCAAGGCTGGC 

TGAGTAGCTGGCTGCCCACGTGGCGCCCCACTTCCATGTCTCAGCTGAAGAATGNGGAAG 

CTAGGATCCTCCAGTGTCTCCAGAATAAGTTCCTGGCCAGATATGTGTCCCTCCCAAACC 

AGAATAAAGAT CTGG ACGGTGACTGTGAGCCCCGAGCAAAAGGACCGCACCCCCCTGGTG 

ATGGTGCACGGTTTTGGGGGCGGCGTGGGCCTCTGGATCCTCAACATGGACTCACTGAGT 

GCCCGCCGCACAGTGCACACCTTCGATCTGCTTGGNTTTGGGCGAAGCTCAAGGCCAGCA 

TTCCCGAGGGACCCGGAGGGGGCTGAGGATGAGTTTG 

Sequence 289 

NGGAAAGCCGGCATAAGTGACATNGTTTGGGCAGTTGCCNGCTGGACTGAAGGGCNAACC 

CANACCACCTTAAGCCATAAAAGCCCGTGACACTGCTANCNAAGGTGCCTTGCCCACCGC 

TTTGCCACCNGTCCCGGAAATGNAAAAAAAGTCGCGTGCCNTAAAAAAGCTGCCGGAAGG 

NCCTGGGTGNACNTTTGGGCCACCCCCACCCCGCTGGCAAGGNCTTGAATTGNGTNACNC 

CAAAAGACGCCCANGCCGGACCTTGGNAAANNATTGTTNTTTNGGGANAAAAAAAAAATG 

GANCCCGNTGGGGGAGGCCCTTGGGGGCATTGNGNAAGCCCCCGGAGGGTTCCCCGNTGT 

TGGCNGGGGTCAAATTCAAAGGCCAGTGGTNGGCCACCCCGGGGAACCTGGNNGCTGTTG 

CAAGNACCNNGGTGGGGAACCGTTTCAAGGAATACCCCAACCAACCCANGTAGCCACCTT 

AAGGTAATTTGGCCCACCTTGCCCACCANAANGNGCCATTGGGGAAGAAACCACCAAAAA 

CGTTCCCCCCGGGT 

Sequence 290 

CCCCGCGCTCAGTATGACTCTTTAGTCCCAGTTTTCATGGGTAGTCTCTAAATTCTTTAC 

CTTTATGTGATTGTGAGTTGGGAGGTGGTGGGCATCATCTTAGTCCATTTACCTTTTTCA 

GTTTTGTAATT ATCG TCTTCATCTACTACCTTTATATAATATAAAGGGAAGGGGTCTTCC 

TTTACAAATAGTTTTATCATCCTTCTCTTTTTGATGTCTATATCTTCTATTTTTTGAGGA 

GAATATATGTTGTATAGACCTACACGTGGGTGGAAGAAGAGTCATGTATGTGTGATTGTG 

TGAGATCCAGMTGTTGGAACTTTTAATTTCTTATTTTGTACTTATAATTACCTGCTGGA 

ATACCTGGTTGCATTCTGTATATTGTACCCTCATTTAAAAGTTCATGGAGGCAAAATAAC 

TCTGTTGCACATAAGGCCGGGGCTTATGCATGTCCTATCGGATGTGGGCTCAGATCACGG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

53/467 



AAGACCCGAA 
Sequence 291 

TCTTGCAGACTCAAGCTCCGCGCGCAGCCGCTCCTGGTGCGGGCCCACAGCAGCCTGGGC 

CCCGGNTCGGCCCCGGAGCCCCCTGGCCTGCGACGACTGTTCCCTTCGATCGGCCAAATC 

CTCCTTCAGCCTCCTGGCGCCCATCCGCAGCAAGGACGTTCGCAGCAGGAGTTACCTGGA 

GGGCAGCCTNCTGGCCAGNGGGGCCCTGCTGGGGGCAGACGAGCTGGCCCGCTACTTCCC 

AGACCGGTACGTGGGGCTCTTCGNGGCCACCTGGAACATGCAGGGCCAGAA 

Sequence 292 

NGGCCCGCCCGGGCAGGTACTAMCTTTMTTAATGAGCTAACGTCATATTTTTTAAGTT 

TTTCMTTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCAC 

CTTGAGATTCGATT AAGTTA ATAATCGCTTTCATTGTTCCGCCCAGNTAGCTAAAACATC 

ATCAATMTTGCTACTTTTTGGGCTTTTTTCMCATATTAAGTTTGGATTTCTAGAGTTG 

ATTTACCATATTCTAAATCATACTCAMACTMTAACGTCTCCTGGTAATTTTTTAGGTT 

TTCTTACCATMTAAMGGTTTTTTTAAAAAAGCCTGCAGTAAGGTGGTCCCAAACAAGA 

AAACCCTCTTTGCGTCTGGGACCTATAATAACCATCTGCATCTTTAGCTAACTCAAGCCC 

ATTTGATTAATTGGNGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGGTGAAATAATCT 

TTAAAATACAATCCCTTCAAATTGGGGAAATCTTTTAAACATCCCCCGGCGGTACCTCGG 

GCCCGCTTCTAGAACTAGTGGGATCCCCCGGGCCTG 

Sequence 293 

GCTCCCCGCGGTGGCGGCCGCCCGGGCTGGTACGCGGGGAAATGCAAAAAATCAAATCAA 

TTTMTACGMTACATCATGAGATGTTAMGATTTCCCAATTGAAGGGATTGNNTTTAAA 

GATATTTCACCACTTTTAGCANATGGAGAAGTGCTAAATTACACAATTAATCAAGTGGCT 

GAGTTAGCTAAAGATGCAGATGTTATTATAGGGCCAGACGCCAAGAGGTTTCTTGTTTGG 

GACACCTACTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATT 

ACCAGGAGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAAT 

CCAAACTAATATGTTGAAAAAAGGCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTAC 

TGGCGGACAATGAAAG 

Sequence 294 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCTAATTG 
ATCCTGTTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGA 
TTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATG 
CACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGTTTCTTTT 
GCACATGCTACATTGTGCTTTAGACCGGGAGATAATACAGGNGCCTTACCTTACAAATTN 
ATNTTTNTGGCAACNCNAATTNTNTNGAAATTTTNNTTTAATTTNAAAACCCCAAACCAA 
TTTTCCNNCNCAAAAAT I FT I I I I I IGGGAAAAATTAANTTCTTTAAANNNAACCCCCCN 
AAAAATTATNGGNGNNAAAAGGGNGCCCCNTTTTGGGCCC I 1 1 I I I I I I I 1 1 CNCGGGNG 
GGGNAAAAAATTTNAAAAAAAN I 1 1 1 I I 1 IGGGNNCCCCGGGAGAAAAANNTCCCCTNTT 
1 1 I I I I I ICNGGGGTTTTAAAAANGGGGGGNNAAAAATNTTTNNTGGCCCCCCCCNTTTT 

Sequence 295 

TATAGGGCGA ATTG GAGCTCCCCGCCCGNGGTCCCAAATGGAAGTGTGAAAACCANGGCC 

CATCCCCCNNTTTTNTAGAGGGGTGGTAAAAAATAAACCCANANATCAAGGGGAGAAAGG 

AAAAGGATGAAAGGACAAACTGCCAAAAAAATTTNCCCAAAGTGGCGACTTTTTTAANTN 

TGGGAGCCAGAATTCTGAGGGCT7TGCATTGTCTTTGCAATTCNCTCAAGGAGCCTGAAA 

TTGMMAAMTGCCAACAAGGCCAAATNAACTACTTTTTAGGAGGGGGTTTTGGAGGTC 

TTGGGAAGCCTCATTCCCNTTCAACCNNTCNAATTCTGGGAATGGGGGAAATGGAAAGAA 

TAGAAGATGTTGGGTGCCCACTAGGCTACTGNTTGAAAAGGGGAAGCTTGAAAANTTNCT 

NCACCCAAGGTTGGGTATTCAAAAATATTGTAATGGACTGGGTATTGGCAAAAAGG 

Sequence 296 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGTGGGTCCCTTTTCAGAGGT 
TGGGCCTTCTAGACCTCACCTGTTCTCACTNCCCTGGTTTAMTTCAACCCCAAGCCATG 
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GCCAATGGCCAAATAATAGAAATTGGTTCCCTACCCAGCTGGACCAGGGGAGGGAGGTCT 

TGTGCAGTTTCTTGACCACTTTGTTGGTTGGACCATNGGCTTAAATACCAATGGGGTATT 

CGGCTTGAGACCTAMGTTTGTAAGAAAATTNAACCAAAATGGTGCCTGCTTGGGTTAAA 

AATGGGCTACCACCTCMTCTGGACTTCAATTCTTTTAATTCTMTTTTAAGTTTGGG^ 

TTGGTATTCTTTGGCCTAMGGTGGCGGTAGTCCCMCCTCTTTGGGTANTTACCCCTTC 

CTAAATAGGTCAATACCTAGGTAGGTCAATACCTCCCTGGGTGGTAAGGNGGTATTTCTT 

CTTAAAAAAGCCTTTTAAAA 

Sequence 297 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACT 

CCTACGGGAGGCAGCAGTAAGGNNTTTTCCACAATGAGCGAAAGCTTGATGGAGCGACAC 

AGCGTGCAGGATGAAGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAA 

AATAANNAAAAAAANGGTACCT 

Sequence 298 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACGCGGNGAAATGCA 

MAAMTCAAATCAATTTAATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGG 

GGTTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAAT 

TAATCAAATGGCTGAGTTAGCTAAAGATGCANATGTTATTATAGGTCCAGACGCAAGAGG 

TTTCTTGTTTGGGACACCTACTGCAGCTNTTTTAAAMAACCTTTTATTATGGTAAGAAA 

ACCTAAMAATTACNAGGAGACGTTATTAGTTTNGAGTATGATTTAGAATATGGTAAATC 

AACTCTAGAAATCCAAACTAATATTTTGAAAAAAG 

Sequence 299 

• CCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCAGCA 
GTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGAAG 
TTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
GTACCT 
Sequence 300 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTCTAGGACAATCAGGAAG 
TAATCTTAAAAAAATMTTGAAGATGTTAAMMTACGTTAAMATMAAAACTTGTTTT 
; AAACATAGATGCAGTAGAAATTGAAAAACCAGATTTAGATGCAAAATTATTAGCTGAATC 
AATTGCMTTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTTGCAATTCG 
TTTAGCACAAAMGCCGGAGCTAAAGGTATTMAACTAMGTTAGTGGTCGTTTAAATGG 
TGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACACACACTTAG 
ACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTTGGAGTTAA 
AGTTTGAGTTTCATTAGGCGAAGTATTT 
Sequence 301 

CCGGCCAGGTACGCGGGGAAATGCAAAAAAATCAAATCANGNTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCAATTGMGGGATTGNATTTAAAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATNACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTAT AGGT CCAGACGCMGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTTTTAA 

MAAACCTTTTATTATGGTMGAAAACCTAAAAAATTACCAGGANACGTTATTAGTTTNG 

AGTATGATTTACAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCMMAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTA 

TT AACTT AATCGAATCTCMGGTGCTGNTGTTCATAMGNAATCTTTTTACTTGAATTAG 

GAI I I I I AAACGGNATTNAAAAACTTAAAAAAATATGACCGTTAGCTCATTAATTAAAAG 

TTTAGTACCTCGGCCCGCTCTAG 

Sequence 302 

AGGTACTTTGATATCTNCGCCCTCTCGTGTGTTCCTTGTGGNGNTAACCAGAGGCAAGAT 

GCCCGAGGAACTTCATGTGTATGTCTACCAGGATTTCAGATGATCTCTAATAATGGAGGA 

CCTGCTATTATTTGTAAAAAGTGCCCAGAAAACATGAAAGGTGTTACAGAAGATGGCTGG 

AACT GCAT TTCTTGCCCTAGTGACTTAACTGCCGAAGGAAAATGTCACTGTCCCATTGGC 

CATATTTTAGTGGAAAGAGACATTNATGGAACATTGNTGNCTCAAGCAACTNGNGAGCTC 




WO 01/070979 



PO7US01/09126 



TABLE 1 

55/467 



TGNGATGGAAATGMAACTCTTTTATGGTAGTAAATGCTTTAGGAGACAGGNGCGTNCGA 

TGTGAGCCAACATTTGNTAATACCAGCAGGTCCTGTGCATGTTNCGAACCTMCATTTTA 

ACAGGGGGATTATGTTTCAGNAGCACAGGGMTTTTTCCTTGTACGTANMTTTCACCTG 

CACGTTATGGAGAAGTTTGGCAT 

Sequence 303 

CCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAG 

ATGTTMAGATTTCCCAATTGMGGGATTGTATTTMAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTAT AGGT CCAGACGCAAGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTTTTAA 

AAMACCTTTTATTATGGTMGMMCCTAAMAATTACCAGGAGACGTTATTAGTTTTG 

AGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCAAAMGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTA 

TT AACTT AATCGAATCTCAAGGTGCTGTTGTTCATAAAGTAATCTTTTTACTTGAATTAG 

GATTTTTAAACGGAATTGAAAAACTTAAAMATATGACGTTAGCTCATTAATTMAGTTT 

AGTACCTCGGCCGCTCTA 

Sequence 304 

GCGG TGGCG GCCGAGGTACCTTNTCCGAATGCACCTTNAAAGCGGGTATTAGCCTATACA 

GGCTGTTTTAGTCGAATGCAGACCATCAAGGAAATTCNNGAATATCTATCTCAAAGACTG 

CGCATTAAAGAGGAAGATATGCGCCTGNGGCTANTCCANAAGTGGAGAANTACCTTACTC 

TTTCTGGGNTGATGAGGAATCATAAATCTGGAATATTTNGAAAATCCAGGATGAACAACA 

C 

Sequence 305 

GCNGNCGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAGATGTTA 

MGATTTCCCAATTGNNGGGATTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAG 

AAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTNAGCTAAAGATGCAGATGTTATT 

ATA GGTC CAGACGCAANGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTTTTAAAAAA 

ACCTTTTATTATGGTAAGAAAACCTAMAMTTACCAGGAGACGTTATTAGTTTTGAGTA 

TGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAGGCCA 

AMAGTAGCAATTATTGATGATGTTTTAAGCTACTGGCGGAACAATGAAAGCCGATTATT 

MCTTAATCGAATCTCMGGNGCTGGTGNTCATAMGTMTCTTTTTTACTTGGAATTAG 

GGATTTTTNAACCGGAAATTGGAAAAA 

Sequence 306 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAATTA 

TGAAATTATTGCAGAAAGAAGATTCACTCTCACCTGATGAATAAGTGTTCATAGGTNAAG 

GCTACAAAATACTMTTTGTTATTATTTTTMTAATAATTTTTGTTTTGCTG 

GATTT ACCAC TTTTTTATTTTTTMTCCAAGGAGGAAMATTAT^ 

AAMATTTTTCACGTTCTAAACCAGTTCMGMCATTGAGTAMCAGAAATATTCCATTT 

GTCAAAGTTTTTCTTATCGGCTCAGATAATGAAAAAATTGGGATAATTGAAACAAGAGAA 

GCTATTGAAATGGCAAAAGAACAAAA 

Sequence 307 

CGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCAGCAAGCGGACGTGAGC 

GATAATGGCGGATATGGAGGATCTCTTCGGGAGCGACGCCGACNGCGAAGCTGAGCGTAA 

AGATTCTGATTCTGGATCTGACTCAGATTCTGATCAAGAGAATGCTGCCTCTGGCAGTAA 

TGCCTCTGGAAGTGAAAGTGATCAGGATGAAAGAGGTGATTCAGGACAACCAAGTAATAA 

GGAACTGTTTGGAGATGACAGTGAGGACGAGGGAGCTTCACATCATAGTGGTAGTGATAA 

TCACTCTGAAAGATCAGACAATAGATCAGAAGCTTCTGAGCGTTCTGACCATGAGGACAA 

TGACCCCTCAGATGTAGATCAGCACAGTGGATCAGAAGCCCCTA 

Sequence 308 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATGTTATTAATGTGACTGACA 

AGTAATTAGAAAACTGGAAATTAMTTTTACAAACATTTTTAAAATCGCTNCAATTAAAA 

AAATTCAAGATGGTTACATTATGAATATGAATGAAATGTCATTAGCGACTTCGTTAAATG 
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TAT ATGTAATTCTATA TTTTCCCCAAMCCCACATTTTATGAAGAATATTTATTTATTTA 
TTTATTTTTGi nil iGAGATGGAGTCTCGCTCTGTTGCCAGACTGGAGTGCAATGGTGC 
GATCTCCGCTCACTGCAACCTCCACCTCCTGGGTTCAAACGATTCTCCTGCCTCAGCCTC 
CCGAGNAGCTGGGACTACAGGCACCGNCACCACGCCCGGCTAATT 
Sequence 309 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGACCCTCCTTGATGGTTTACTTTGCAAGCTA 

TGGTGACCTCCGCAAGTTGTGTCTGGGCCCATCCAGGGCTCTGACTAATTGTATTCAAAT 

CA AGGCA GGAGCGGGCCAGCTGGCGTTGACTTAACCAAGCCATTTTATAAGCCTCCCGAT 

CATTTTTAAGCCACTCTAAGTCGTGTAGTAGGATCTGGTCAGAGTTATGTATACTCTGAT 

GGGCATGTGCTGTGTCGTCTAAAATGTCCAGAAGTTCTGAAACACTTTTAGATCTTCCAG 

AATTTCTTGAGGA AGTCT GCCTAAGTAACTGATGCACATCAAGTTCATCACCGGAGGAAT 

CAAAAGMTTTCCATTTTCTATTTCTCTACAGAAAAGAAAAGGATCTTCCTTTAAGATGG 

AAATATTATTTCTCTTC 

Sequence 310 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTTGAATTATCTA 

TTGAAAGAACT ACTACAT CGAGTTTTTGTTCTTTTGCCATTTCAATAGCTTCTNTTGGNN 

NAATTATCCCAATTTTTTCATTATCTGAGCCGATAAGAAAAACTTTGACAAATGGAATAT 

TTCTGTTTACTCAATGTTCTTG AACTG GTTTAG AACGTG AAAAATTTTTAG GATTTG GTT 

TGGAAATAATTTTTCCTCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAG 

CAAAACAAAAATTATTATTAAAAATMTAACAAATTAGTATTTTGTAGCCTTTACCTATG 

AACACTTATTCATCAGGTGAGAGTGAATCTTCTTTCTGCAATAATTTCTAATTGCCCCGN 

GTACCTGCCCGGGCGGC 

Sequence 31 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAAGGTACCTGACTGTGGC 

TCANATCTGCGTCGCAGCAGCGAGAGAAGAAATCACTCCATATCCGATGAGAGGAAGGGT 

GGCACAGANATGGTGTCTACAATTAGAGACATTTCTGACTCCACCTTAGCCTAAGCAAAC 

TTTATATACTGAGTMCATTTGMGGTTGTCTTTTMTGGTGGGGGGTGNTTTTTCCTTT 

TTAAACTACAGT 

Sequence 31 2 

CCAAAANGGGNCCTGGGGCGTGGTCACGGCCNAGGTACAAAATTATGACTGTTTTAGCTC 

CAGGAACAGATTTAAMGTMTTGMGACGTTTTMAATCTGTTTTTGATGCTAAGAACA 

TCGAAAA AAATT G AAAA ACTTGAAAGAACAGAGTTAGCATACGAAATTAANNAGCACAAA 

CMGGAATTTTTGTTTTAGCTAACTTAAAATCTGAGGAAAGTTTAATCGAAGAATTTGTC 

AGMGAGTAMTATTCTCAMAMCAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCT 

GAAAGAGGAATGCACAAAACTTTCAGACCTAGAAAAAATGATAAACACAAATTTTTCTCT 

TCTAAAAAACCA 

Sequence 31 3 

AATNGGGCCTNGCGTGGT CACG CCCAGGTACNAAATTATGACTGTTTTAGCTCCAGGAAC 

AGATTAMAGTATTGAGCNTTTTAATCTNTTTTGATCTAAGACATCGAAAATTGAAAACT 

TGAAAGACAGAGTTAGCATACNAATTTAMNGCACAACAGGATTTTTTGTTTTACTAACT 

TAAA TCT GGGGA AGTTTAATCNAAGATTTGTCAGAAGAGTAAATATTCTCAAAAAACAAG 

TTTTMGATTTTTAGTTATTMT CTAGA TTCTGAAAGAGGATTGCNCAAAACTTTCAGAC 

CTAGAMMTTGATAMCACAAATTTTTCTNTTTTAAAAAA 

Sequence 314 

NGGGCCTNGGNGTGGNNACGANCCAGGTACTTTTACCAAAGAATCTACTAGAACTCTCTG 

CTATTCAAAACAAAGAGCTCATACTTGTTGGAGTAGGGAAAAAATTAGAAATTTGACCAA 

AAGATAGATTCAATCAACTACAAAGTCAATTCCAAGATGCTGMTAACATCGAAACTCTTG 

AAAAAGAACTATTANAATCTGGAGTTGAACTfTAATGGATCATATACCTGNTTTGCTTGA 

TCAAGCGGATTGNTCAGCTTAATATTAAAGAAGATGGNATCTATTTAGATCTTACTTTAG 

GACGNGGNGGNCATTCGAGNCAAATTTTAAAAAAACTTACTA 
Sequence 31 5 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

57/467 



CCCTTTCGA GCGGCCGCCCGGGCAGGTACTTGTCCATAATTTGTGAATATATTAACTAAT 

TTTTCTTTTGAGTATtCATTTACTMCTCATMGCTGAAAAGCTTTGAGCTTGATTCATT 

CTCATATCCAAACGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 

GGTGMGATACTCCTAAATCMGTMTACACCATCTACTTTGTTGATTTGMGTTTTTTT 

AGTTC TTGATCAAAA TCTTTAMATCAGATCAAATAAATTCAATATTTGAAGAAATTTTT 

AAGAGI 1 1 I I CTTTTGTTTGTTCAATTGCTTGTTTGTCTTTATCAAAGACTA 

Sequence 31 6 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACC TATC TATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 317 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGMTTATG AACTTA AAAAGCAAATTAAGAAGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTTTGCTTCTTGCCAAACAAGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTAATAGTAGAGAAGGAAGGTTTTTAGCATTAGATCAAA 

TTGTTAAGCTTTGAAATGAAAGTGAAAAAGTTAAAAACAAAGAAGAAGGATTTTATCCAA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 318 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 

CAGGATTCCTCATTGGTGAGTTCAACCATACATTTTGGGTAACTCCTGAAGACATCTGCA 

AATTGTGAGTTAGTTGGTGGGGTCCATTAACATTTGCATATGTTATTCTTTCTACTGAAG 

TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 

TTTGGATATTCAGCAGNGGCTGNGAAACTGGATTTGAATTACCGGGATACATGCATGCTT 

CTTGGTT 

Sequence 319 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGCAATACGTAGAGATAATAACAGTTTTT 

TAAAAMCTTMTATTTGTTATTGMTGTATATTTTTGAGTATTGCATCTTTTCTATACT 

AATAAGGAGGTGTAATTTGAACGCTTTTAGAAAGAAAAAAGAAGAATTAGTGCAAGAAAT 

TAAAGATTTGATTAATTCTTCTTCTTCATTAGTTATAGCTGAATATCGTGGATTAACAGT 

TGCTGAAATTGAAACTCTTAGAMTGMGCTTAAAGAAGCAGGTGTTTTTGTAAAAGTTT 

ATAAAAATAGACTATT TAAAATAG CATCTAAAGAAGCAGGTTTCGGAGATTTAGAACAAT 

CACTAGTTGGTCCAAATTTTTTTGCTTTTGGTTCTACAGATGCANTAGCTCCAGCTAAAA 

TTATTTCAAAATTCGCTAAAAACAMTCCAGTAAGTTGTATTAAAAGGCGGTATTTTTTG 
A 

Sequence 320 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACGGAATTTTAAAAATAGACTNTAAAAMCCN 

TCAATTGCANMCTMTTTTGNTTGATTTACTTCAMTTGAAGGAACTCTTTTAGGTTTT 

GTTGNAATAGTAAANGAAATTCGATNAGTAATTCAATCAAAGAATAATGAATNGACAAAT 

TCTGACATAAAAACAGCCAAAGAATATCATTTAGAACAACTAANAGCGCTTAAAAAAACT 

TCAAATTAACMATATAATTTATAATTTATGGCACTATGAAAAAAAAGNGACTTTATTTA 

ATATCMCTTTANCATTTTCMCAATTTTATTAGCTATTNCTNGTGGTAAAAATCAAACT 

ACTCCTGTAGGCACACAACCANTTGATANTCCAAGCCTGGAGGATTCATCAAGNGAATCA 

ANAGTTCTGGATTTAAATTAATAGAAAAAAAG 

Sequence 321 

CCCTTAGCGTGGTCGCGGCCGAGGTACAATTTCTAATTGATCCTGTTCACATTCAGTGAA 

ATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGATTTAGGTAACTATTGTGTCTT 

CCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATGCACTCTTATATTCTGCCTTTC 
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TGCCTGGAGTTTGAAATACATGTCTCTTTAGTTTCTTTTGCACATGCTACATTGTGCTTT 

AGACCGGAGATAATACAGTGACTTTACCTCACAAATCATATTCTGTCAACACAAATCTAT 

GAATTTAGTTTATTTAMATCAGMCAATTTCCTACAAAATTTT^ 

CTAACAGACCTACCAGAATCATGCTTAAAGGGCTCCCTTGACACTTATTCTATACTGAAG 

GATAAATTTTAAAAAAT 

Sequence 322 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I I 1 I 1 I I H I I M H I I 1 I I H I 1AGTTAA 

ATATAAAT TTATT TATTAACI I I I I CAATACCTTTTGAAGGTCAAAGTTTGTAAGATGAA 

TTAGGTTCTTTTGAAAGTAAATTCCCACAACTGAAAGTGCAGCTAGAATAATAAATACTG 

CATATGCAGCTTCTGTTGATGCTGATTTAATTCCTGCAACTAGAACAGTTGCTATAGAAT 

MAATGCATMCCTMTCCTCANATMGTCCMATTGATATCCMCTTTTTTAGGATTTG 

ATCCTTTGTATTCGGGAGGAAGATTTAAGATAACACCTTGAATTCCTCAAAGCATTACTC 

CCATTAAAAATCCTAAAATATAGAATAATCAAGTTCCATTACCTA 

Sequence 323 

G CGGTG GCGGCCGCCCGGGCAGGTACAAGGTAAAGCAAGAGCTGGCTCTCTACGTTCACC 
AAI I I I IGTAGGTGGTGGTCGTGCATTTGGGCCTACAAATAATAAAAATTACAAAATTAA 
ATTAAACAAAMAGTTCGCMATTAGCTTTTGCCTCAGCTTTTAAGTCAACTTGCTCAAA 
ATAATCAAGTACCT 
Sequence 324 

NNCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAA 

TAGAAT ACAT CAGAGATGTTAAAGATTTCCCAATTGAAGGGATTGTATTTMAGATATTT 

CACCACTTTTAGCAAATGGAGMGTGCTMATTACACMTTANNCAAATGGCTGAGtTAG 

CTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGNAGC i im I AAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGAG 

ACGTTATTAGTTTTGAGTATGATTTAG 

Sequence 325 

CCCTTTCGA GCGGCCGCCCGGGCAGGTACTTGTCCATAATTTGTGAATATATTAACTAAT 
TTTTCTTTTGAGTATTCATTTACTAACTCATMGCTGMAAGCTTTGAGCT^ 
CTCATATCCMACGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 
GGTGAAGATACTCCTAAATCMGTAATACACCATCT 

AGTTCTTGATCAAAATCTTTAAAATCAGATCAAATAAATTCAATATTTGAAGAMTTTTT 

MGAGTTTTTCTTTTGTTTGTTCMTTGCTTGTTTGTCTTTATC 

Sequence 326 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACCTATCTATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 327 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGAATTATGAACTTAAAAAGCAAATTAAGMGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTTTGCTTCTTGCCAAACMGATTTGNNNAAGTAAATGACCATA 

AATTAGTGATTGCTCACACTTTTAATAGTAGAGMGGMGGTTTTTAGGATTAGATCAAA 

TTGTTMGCTTTGAMTGAMGTGAAAAAGTTAMMCAAAGMGMGGATTTTATCCAA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 328 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 
CAGGATTCCTCATTGGTGAGTTCMCCATACATTTTGGGTAACTCCTGAAGACATCTGCA 
AATTGTGAGTTAGTTGGTGGGGTCCATTAACATTTGCATATGTTATTCTTTCTACTGAAG 
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TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 
TTTGGATATTCAGCAGNGGCTGNGAAACTGGATTTGAATTACCGGGATACATGCATGCTT 
CTTGGTT 
Sequence 329 

ATAGGGCGA ATTGGA GC TCCCC GCGGTGGCGGCCGAGGTACTAAACTTTMTTAATGAGC 
TAACGTCATA I I I I I I MGTTTTTCMTTCCGTTTAAAAATCCTAATTCAAGTAAAAAGA 
TTACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATMTCGCTTTCATTGTTC 
CGCCAGTAGCTAAAACATCATCMTMTTGCTACTTTTTGGCCTTTTTTCAACATATTAG 
TTTGGAT TTCTAG AGT TGAT TTACCATATTCTAAATCATACTCAAAACTAATAACGTCTC 
CTGGTAAI I I I I I AGGTTTTCTTACCATAATAAAAAGGGTTTTTTTAAAAAAGCTTGCAG 
TTAGGGTGGTCCCAAACAA 
Sequence 330 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACCTCTGATTCTCACTAGTTGAATGCAAGAAC 

TTGAAAGGTTCAGGTAAGTGTTTTGMAAATTTTGACTTTCCAAACTTTr 

ATCTGAAACTCAGGMTCAAAMATACCGACAGGCACTGTTACTTTCAAAATTCTTTCTA 

TMGTTGAGAATGGGACAGATTTGCAGAGCAAGGGAAACTTGAACAGTTACTTCTAGTGG 

TAGGAAATGAGGTGGCTAGGATATTACCCAGCTGGTGGGTGACTTGGGCAGTGTGTTCCT 

GCTTTCAGTGGTTAGC CTTTAG CMATCTGCTTTAGAGTGAGAGTAGAGGGCAGGCTGTT 

GTATTACAGTGCTCTTGTTTTTGTAMATTTAATTCACTCTACTGNTATTTTGTCTCCTT 

GGGTAAAGNGNTATTTA A 1 I I I I CT 

Sequence 331 

ATTATTGCNGAAAGAAGATTC ACTCT CACCTGNTGAATAACGTGTTCATAGGTAAAGGCT 

ACAAAATACTAATTTGTTATTATTTTTMTAATAATTTTTGTTTTGCTGANAAAGTGGA 

TTACCACTTTTTTATTT7TT 

ATT TTTCA CGTTCTAAACCAGTTCAAGMCATTGAGTAAACAGAAATATTCCATTTGTCA 

AAGTTTTTCTTATCGGCTCAGATAATGMAAAATNGGGGATA 

Sequence 332 

CCGCGGTGGCGGCCGCCCGGCCMGGTACGCGGGGGCAGAAGAGGAAGATTTCTGAAGAG 

TGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGGTA 

AATATTTTCCCTTCGTATTCGGTAGTGCTGTTGAGTCATCTTGTCCAATGCAAATCCTGA 

GMGCTATGTTCCCAAAGAGGGCCAGCTCCATTTAGTGTTTGTTTATAGCCTTACTATGC 

CTCTACC TCTGG GGGTTGTAAATCTGTTNTACCCAATGGGNGGGTTTGTTNCCCTTCCTG 

AANCAAA I I I f ICTGCTTNNACACTTGGGCAAACCNTTTCCTAAATTTCATCCTCCCANA 

ACTTTGCNCCNCCNTTGGGGGGAGGTTTGGGGTTTTCAACTCCNGAANAAAAAGAGGGGC 

CCCCCA CCNN AGGGNNTNNT TTNTT TTAMTTNGGGCCNNGGGGGNTTNNANTTAAAAAA 

NNGGGNTTTTNGGGGGACNNTTTTTNTTTTNACCCCCCCCCCCCTTTTTT 

Sequence 333 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACGTTGACCTTCCTTG 

CCCAAGAATGAATGATTAAAAAGTTGAAAAGCTGTGTTTGTAGATCGTTCAGGCCTTCGT 

CTGCMCTATCAGGAAATATGGGTMCTCTCTTCCAAGAAACTTCCCCACTCTTTTGATA 

AGCATCTCGMAATGTTGTTCGAACATCMTGGTGGTATTTTCTGAAGATGAAGAATTAA 

AGCAGTTCTCAAAGAAAGAAGTTCAGGGAAGTTGAAATACGCATACTCGGCATCCTTGAA 

GAAAACTATGGTGAATTGTCCTCCTTTGCTAATAAGATCCACAAGATAGCCGTTCAACCA 

GATAGAAAAAATGGGAGGTNCCTGCCCCAGGCTTTAAANTGANTNNCNCCNCAGCATCCC 

CTTGGGGATNNGGTAANNNCCNCCCCCCCNANCAMAAAAMAATTTTTCCCCNCCNACC 

NMTTNTGAAAAACCNNGNGGGGGTTTAMMTTTTGNGNTCNNAATTTNAAAAAAAAAA 

AN 

Sequence 334 

CCGGGCAGGTACTAGGAGATATTGATTCTAGTCAATTAGGCATTGTAGACTGTCATGACC 

ACTTAATAAAAAATTATGGACCTGMGCTCACGAGCATCCAGATTTTATTATGATGTCAA 

AAGATGCTGCAATTAAAGAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTA 
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CAATGGACCCTCCTAACGTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAAT 

TAGMGGAAAAGCTAACATTATTATGGCMCTGG7T7TCATAAAGCTGCATTTTATGACA 

AAGGTGCTTCTTGACTTGCTTTGGCTCCAACAGATAAAATTGNAAAAATGGTTGTAGCTG 

AAATCGAAGAAGGAATGGATGAATATAACTACAGCGGACCAGTTGTAAAAAGATCTAAAT 

CCAAAGCCGGAATTATTAAAGC 

Sequence 335 

CTCCCCGCGGTGGCGGCCGAGGTACCGCGGGGAAATGCAAAAAAATCAANNCAGTNNANT 

CNMT ACAT CACAGATGTTNAAGATTTCCCAATTGAAGGGATTGTATfTAAAGATATTTC 

ACCACTTTTAGCAAATGGGAGANGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAG 

CTAAAGA TGCAGA TGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCI I I I I lA AAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGAG 

ACGTTNTTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTA 

ATATGTTGAAAAAAGGC 

Sequence 336 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATGCAA 

AAAAATCAAATCAATTTAATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGGG 

ATTGTATTTAMGATATTTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATT 

AATCAAATGGCTGAGTTAGCTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGT 

TTCTTGTTTGGGACACCTACTGCAGCTTTTTTAMAAAACCTTTTATTATGGTAAGAAAA 

CCTMAAAAATTACCAGGAGACCGTTATTTAGGTTTTNGGGTNNAGATTTAAAATTGNGG 

AAANCCCCNNT TNTAG AAATCCAAACTAATTNTGTTGNNAAAAGGGCCAAAAAGTNCCAA 

TTATTGGTGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTATTAACTT 

Sequence 337 

CCGCGGTGGCGGCCGAGGTACCAATAATAGCAACCCTGTGATTTGTCCAAGTGCCCGGGA 

GTGGAGGCCATCCTGACAACAGCTCTATGATTTTCTATGCCAATGACACAGGAGCCCAAC 

AGTTTGAAAAGTGGTGGGATAAGTCCAGGACAGTCCCCTTTTATCTTGTAGGGCTCCTCC 

TCCCACTGCTCMTTTCAAGTCTCCTTCATTTTTTTCAAAATTTAATATCCTAGGCACAG 

TGTCTGTCCTTTATTTGATTTTCCTTGTCACCTTTAAGGCTGTTCGCTTGGGATTTCATT 

TGGMTTTCATTGGTTTATACCAACAGAATTTTTTGTACCTGCCCG 

Sequence 338 

" CCGGGCAGGTACCTGGAAGACTTCTCCACCTCGGGGGCCTGGCTGCCTCACAGGTATGAA 
GACAACCACCATAACTGCTACTCTTACGCACTCACGTTCATTAACTGCGTTCTGATGGCA 
GAAGGTAGACAGCAACTGGACAAGGGTGAATTTACGGAGAAGTACCT 
Sequence 339 

ATAAACTGCGGGATCTCAATGGCTTCTATGATCGTATTGAGGCAGTAGTTCCCACACTCT 

GCCCGGTGCCAGCATGAAAGAGAACAGGGAGAGTCAGCCTTTCACAGTCCTTGTCAMTC 

CCAATTACTCTGGTTGCAGATCACTTGAAGCCTGCCTTGTTTCTCTCACAAAGCTCTGCC 

CGAGAGTCCAGCCCCGCGTACCTGCCCG 

Sequence 340 

GCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAAGATAGTCATNTCAGTAAAAGGT 

CTATTATCTAACTTGCCAAACTTGTTTANNGAGAGCCCTAAGGAACTAAAACTGCCATAA 

TGCClvrrGCACAGCTTGAMAGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAG 

TTCCTCAGCAGGCCTGGCTGAAGGCGCAGGAGGGAAGGAAATATAAGAACCAACAATAAA 

AATAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCAC 

CTNTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAG 

Sequence 341 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTCTAGGACAA 

TCAGGAAGTAATCTTAAAAAAATAATTGAAGATGTTAAAAAATACGTTAAAAATAAAAAA 

CTTGTTTTAAACATAGATGCAGTAGAAATTGAAAAACCAGATTTAGATGCAAAATTATTA 

GCTGAATCAATTGCAATTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTT 

GCAATTCGTTTAGCACAAAAAGCCGGAGCTAAAGGTATTAAAACTAAAGTTAGCGGTCGT 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

61/467 



TTAAATGGTGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACAC 
ACACTTAGACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTT 
GGAGTTAMGTTTGAGTTTCATTAGGCGAAAGTATTTGCAAAGCAAAATCAAGCATATAA 
TGAAGAAGAACCAACNCACAAAAAAGGGCCAAAAAGAGCAGCAAGAGTTAAAAAAGAA 
Sequence 342 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACC 

TTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGA 

Sequence 343 v 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCATGAATGCCGTGCTCCAGGTGTTCAC 

AGCTGC7TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGC 

CTGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCT 

GGAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTNTTGCCAGATGACAAGGN 

GACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCAC 

Sequence 344 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATGA 

GCTMCGTCATATTTTTTAAGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAA 

GATTACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGT 

TCCGCCAGTAGCTAAAACATCATCAATAATTGCTACTTTTTGGCCTTTTTTCAACATAAT 

TAAGTTTGGATTTCTAGAGNTTGATTTACCATATTTNTAAATCATTCCTCAAAACTAATA 

ACCGCCNTCCTGGGAMTTTTTTTAGGGTTTTNTTACCCCTAAATAAAAAAGGGNTTNTT 

TT 

Sequence 345 

CCGCGGTGGCGGCCGCCCGGGCACGGTACCACTTGAATTATCTATTGAAAGAACTACTAC 

ATCGAGI 1 1 I IGTTCTTTTGCCATTTCAATAGCTTCTCTTGTTTCAATTATCCCAATTTT 

TTCATTATCTGAGCCGATAAGAAAAACTTTGACAAATGGAATATTTCTGTTTACTCAATG 

TTCTTGAACTGGTTTAGAACGTGAAAAA I I I 1 1 AGGATTTGGTTTGGAAATAA I I I I ICC 

TCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAGCAAAACAAAAATTATT 

ATTMAAATAATMCAAATTAGTATTTTGTAGCCTTTACCTATGAACACTTATTCATCAG 

GTGAGAGTGAATCTTCTTTCTGCAATAATTTCTAATTGCCCCGCGTCCTTGGCCGCTCTA 

GAACTAGGTGGG 

Sequence 346 

CACTACTATAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGA 

GATATTGATCCTAGTCAATTAGGCATTGTAGACTGTCATGACCACTTAATAAAAAATTAT 

GGACCTGAAGCTCACGAGCATCCAGATTTTATTATGATGTCAAAAGATGCTGCAATTAAA 

GAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTACAATGGACCCTCCTAAC 

GTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAATTAGAAGGAAAAGCTAAC 

ATTATTATGGCMCTGGTTTTCATAMGCTGCATTTTATGACAAAGGTGCTTCTTGACTT 

GCTTTGGCTCCAACAGATAAAATTGTAAAAATGGTTGTAGCTGAAATCGAAGAAGGAATG 

Sequence 347 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTAGAAGAAGAAGG 

AATACCTAAAGAAACAGACATAGAAATCATCCCAGAAATCCCGGAAACTCTAGAGCCACT 

GTCCCTTCCAGATGTGCTGAGGATCTCGGCAGTTCTGGAGGACACCACAGGCCAGCTCTC 

TATTCTGAACTACATCATGCCCGTTCAGTACCT 

Sequence 348 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATG 
AGCTAACGTCATATTTTTTAAGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAA 
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AGATTACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTG 

TTCCGCCAGTAGCTAAMCATCATCMTMTTGCTACTTTTTGGCCTTTTTTCAACATAT 

TAGTTTGGAT TTCTAG AGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGT 

CTCCTGGTMTTTTTTAGGTTTTCfTACCATAATAAMGGTTTTTTTAAAAAAGCTGCAG 

TAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAG 

CTMCTCAGCCCATTTGATTAATTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGG 

Sequence 349 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAACAGGT 

GCTCCTGTTAAAATAGAAGATCTTGCAGCTACTTCTAGAGATTTAAATTCTAAACGATCA 

ATAGCAGCGTATCCTGTTCCGGCTTTMTMTTCCGGCTTTGGATTTAGATCTTTTTACA 

ACT GGTCC GCTGTAGTTATATTCATCCATTCCTTCTTCGATTTCAGCTACAACCATTTTT 

ACAATTTTATCTGTTGGAGCCAAAGCAAGTCAAGAAGCACCTTTGTCATAAAATGCAGCT 

TTATGAAAACCAGTTGCCATAATMTGTTAGCTTTTCCTTCTAATTTCTTTGCAATATCT 

MCATTTGATAAACATCACGCGCAACGTTAGGGAGGGTCCATTGTAACAACAGTTTTTCC 

TCCTTTTGCTACATATTCATTCATTTCrrTTAATTGCAGCATCTTTTGACATCATAATAAA 
ATCTGG 

Sequence 350 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCTTCACTATCA 

CTGTCCTGTAAATTTAGTAGCCTTGGCTGGAAACACTGTAGTCGACATGATCTGATATTG 

CTTAATATTTCAGAAAGAGACAGTCTATTTTCACAATGTTTACTGGAAGCATTGGTCCGA 

GAGAAATTAGAAGAAAAGTCTATAGTTTGGGAAGAGCTTGAAAAACTATTCAGCATTTCA 

GGGTCTATCTGTTTCAGGACTGGGTCATGTTCTGTGGATATTCGGTCCATTATGACCCTT 

CCACCTCTGCCAATTCGCCTCCTTGCAAATCCTATACATCTTCTTGGGACTGTAAGTGTT 
GTAAGGC 

Sequence 351 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTGGTGACTGTA 
ATTMCAACMTGTATTTTGAMGTCACTGAGTAMTTTTMGTGGTTTTTCCAAAAAAG 
CACATAAGGTAATGCACCGTTAATTAGCTATATTGAGCCATTCCACAATGTATAGATATT 
TC AAAACATGTTAC ACAT GATAM TCCAGTTTTTCTACGTCATTTTTTAAAATTATATTT 
AATI 1 1 1 I I ATTTTGAAAGTTTTTTCACAGATC I I I I I I I I AGTATTATTACCTTCTGAT 

ATATGTGTCATTATTGAAGAACCATACCTTTTAAAGGTATTATTTTGTAATTAAGGTATG 

TCAACAGTAAAAAATAACCAGTGGCCCAGGCCATNGGGGCTCATGCCTGTAAATNCCAGC 

ACTTTTTTCGGAGGCCCG 

Sequence 352 

CCGCGGTGGCGGCCGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCT 

TGATCCCCAGAACTGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCT 

TCAAACAACCTCAGCCAGAGTGTCAATTTCTGTGAGAGAACAAAGATTTGGGGCACTTTC 

MAATTMTGAAAGGTTTACAAATGACCTTTTGAATTCATCTTCTGCTATATACTCCAAA 

TATGCAAATGGAATTGAAATTCAACTTAAAAAAGCATATGAAAGAATTCAAGGTTTTGAG 

GTCGGTTCAGGTCACCCAATTTCGAAATGGAAGNCATCGTTGCTGGGTATTGAAAGGTTT 
GT 

Sequence 353 

CCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTTATATTCT 

TCAAATCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGACAA 

AAACCAAAAAAACAAAAACTTATTTTAGAAAGACTTGGATAATGATTGATTATAAAAAAA 

TGATTGACCATACTTTATTAAMCCAGAAGCTACATCTAAAGATATTTTAAAACTTATTT 

CACAAGCTAAAGAACATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATTAG 

CAAAAGAAAAACTTGCAAATACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATTAG 

GTGCATCAAACACACAAACCAAGGTTTTTGAAGCAAAATTAGCAGTTGAACATGGAGCTA 

CTGAAATAGATATGGTTATAAATGTGGGTAAATTCAAA 
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Sequence 354 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGCATGAAACCACACA 

TACGGGCAAAAG CAGGAAG AGACTTTGCAGACATAAAAACCAGGTAAGTTTTGAGAGTCA 

GGTGATTGCA AGA I I I I I I I I I CTTTCATTGACACTTAGCTAATTTTGAGCCATATTTTT 

MATGTGTGTCTT TTCAG TTACTTGAAACCTCTTCTCCCACCTCCCATCCTTTCCAAACC 

TCCTCCMGCCMTTTTCCAGTTGGTTTAAAAGAAATCATTAGGGTGACCCCCACTCCCA 

CTTTCTGGAACCTCAGGCAGAGCTCCTGGGGTTCCTCC 

Sequence 355 

NNI 1 1 1 1 I I I I I I I I I I I I I 1 1 IGCGTGGAATGCTTCACCTCTTGCTAAAGGAACAGTAA 

TGTCATATCCTGTTAATCTTGCAGGAGGCGCTAATAAGAATTCAAATGCTTTCTCATTTC 

TCTTGT AATAATT TCAGTGAACTGAAAATGATTTAACAGCTTCGTGAACAACTAAAAGTC 

TTCCAGI I I I I I IAACAGAGTTAATAATTGTATCTGTATCTAAAGGAGAAATTGTTCTTA 

MTCMTTMTTCTACAGAGTATTCTCCAT7TAATTGTTTTAAAGCAGCTAGTGCTTCGT 

GAACTTGTGCTCCATATGTTACTAATGTTAAATCAGAACCTTCTACTAATACATTTGCTT 

TACCMTTTCAACTTCATAAATTCCTGCTGGAGCTTCTTGTTTGAATGAACGATAAATTT 

TCTTAGGTTCTAAGAAAATAACTGGATCTGGGTCGTTAATAGCTGCGATTAATAATCCTT 

TTGTATCATAAGGAGTTGAAGGCATAACAACTT 

Sequence 356 

AGGGCGA ATTGGA GCTCCCCGCGGTGGGCGGCCGAGGTCTAAACTTTAATTAATGAGCTA 

ACGTCATATTTTTTAAGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATT 

ACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGTTCCG 

CAGAGCTAAAACATCATCAATAATTGCTACTTTTTGGCCTTTTTTCAACATATTAGTTTG 

GA TTTCTAG AG TTGAT TTACCATATTCTAAATCATACTCAAAACTAATAACGTCTCCTGG 

TAAI I 1 1 I I AGGTTTTCTTACCATAATAAAAGGTTTTTTTAAAAAAGCTGCAGTAGGTGT 

CCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAGCTAACTC 

AGCCATTTGATTAATTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGTGGTGAAATATC 

TTTAMT ACAATCC CTTCAATTGGGAAATCTTTAACATCTCTGATGTATTCTATTAAATT 

GATTTGATTTTTTTGCATTTCCCCGCGTACCTGCCCGGGCGGC 

Sequence 357 

GAATGGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCT 
ATTATCTAACTTGCCAAACTTGTTTACTGAGAGCCCT AAGGAACTAAAACTGCCATAATG 
CCGTGCACAGCTTGAAAAGCAATTAGAGTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTT 
CCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAG 
Sequence 358 

AGGTACANGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTGTTTACT 

GAGAGCCCTAAGG AACTAA AACTGCCATAATGCCGTGCACAGCTTGAAAAGCAATTAAGA 

GTAAGCAAGATTAG I I I I I CCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCAG 

GAGGGGAAGGGAAATATAACGGAACCCAACAATTAANAAATAGGCAAATAGCCAATTAAA 

GTAAGGAATGGNCATCCCATGGGAGGCANCAACCATTAATTTCTTGGGAACCCACTNTNT 

CCCNGGGATTCAGGGCTTCCATTTGCTTCACNGATGGCTTCACGTCTGGNGCAGCCCGCC 

AACTCTTACTTTGCCAGGAAACCCTCACCTCAACTTTGCCCAGGGTATTTCTNCCCCGGG 

TCTTGGAMNGAAAATGGGCTTCNTCCACCTGAAAAAGGGTTNGAATCCTTTCTTCCCAT 

TACCCAGGCTTTCCNTTTAMGCCAMMGGCAAATTCCCTCCTTTTTTGGCTTTTCCT 

Sequence 359 

CTAATTGATCCTGNTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTNACAGC 

TTATTGATTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTT 

CTCTATGCACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGT 

TTCTTTTGCACATGCTACATTGGGCTTTAGACCGGAGATAATACAGTGACTTTACCTCAC 

MATCATA TTCTG TCAACACAAATCTATGAATTTAGTTTATTTAAAATCAGAACAATTTC 

CTACAAAATTTTTCTGGAAAATAGACTCCTAACAGACCTACCAGAATCATGCTTAAAGTG 

CTCCCTTGACACTTATTCTATACTGAAGGATAAATTTTAAA 



• 
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Sequence 360 

CCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGTCGTGCACAGCTT 

GAAAAGCAATTAGAGTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACC 

Sequence 361 

TNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTTATATT 

CTTCAMTCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGAC 

AAAAACCAAAAAMCAAAAACTTATTTTAGAAAGACTTGGATAATGATTGATTATAAAAA 

MTGATTGACCATACTTTATTAAAACCAGAAGCTACATCTAAAGATATTTTAAAACTTAT 

TTCACAAGCTMAGAACATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATT 

AGCAAAAGAAAAACTTGCAAATACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATT 

AGGTGCATCAAACACACAAACCAAGGTTTTTGAAGCAAAATTAGCAGTTGAACNTGGAGC 

TACTGAAATAGATNATGGGTATAAATGGTGGGTAAAT7TCAAA 

Sequence 362 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCT 

TGGTAGGCCATTACCATACCAACTAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCT 

GTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTG 

TTTCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCT 

Sequence 363 

CACTACTATAGGGNTNATTGGAGCTCNCCGCGGTGGCGGCCGTGCTGTTGCTTGGCCGCG 

CGCCAGGCTGGCCAGAGGTGCTGTTCCACTGGGGCGTGGCCGTGATGGTGTGCGCCGATC 

GACGTTCTGTTTTAATGAAAGGATGGACATGCAACTGACACTCGCACAACTCTTGGCAAT 

CATGCCCAATGCCCGCTCCAAAGCGGGTATTTTTTTGCCTGCGCTAAACGTGGCCATGGC 

GGAATTCGGTATCAACACGTCGGCGCGCCAGGCCGCGTGGCTGGCCACCATCGGTGTCGA 

G TCCGG TAGCCTGCAGCGGGTAGAGGAAAACTTGAACTACCGCGCGGATCGCCTNCTCGT 

TATTTTCGGAAAATACTTCACGCCGGCGTTGGCCGCAGCTTATTGCCGGCAAGCCGGAAA 

TGATCGNCAACCGTGTTTACGCCAACCCGCATGGGGGAAACG 

Sequence 364 

AGGTACTAAACTTTAATTAATGAGCTMCGTCATATTTTTTAAGTTTTTCAATTCCGTTT 

AAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCGATT 

AAGTTA ATAATC GTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCTACTTT 

TTGGCCI I I I I ICAACATATTAGTTTGGATTTCTAGAGTTGATTTACCATATTCTAAATC 

ATACTCA AAACTMTAACGTCTCCTGGTAATTTTTTAGGTTTTCTTACCATAATAAAAGG 

i i I M i I AAAAAAGCTGCAGTAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTAT 

AATAACATCTGCATCTTTAGCTAACTCAGCCATTTGGATTAATTGTGTAAATTTAAGCAC 

TTCTCCATTTGCTAAAAGTGGTGAAATATCTTTAAATACAATCCCCTTCAATTGGGGAAA 

TCTTTAAACATCTCNGGATGGTATTCTATTMMTTGAATTGMTTTTTTTGGC 

Sequence 365 

CCGCGGTGGCGGCCCGAGGTACCAAAATAAAGGGTATTTGCTACCTTTAATACTTGCCAG 

TTCAGGTTGGAGGCACAGGCAGCAGCAAGAATGGAAAGAAATGTTCTTACAACATTTTCA 

CAGGAAATGTCCCAGTTAA TTTTGAAT GAAATGCAAAAGCTGAATATTGCAGTTTATTCA 

ATGATTTTGTTGAATCTGMTTTTTTTTGATTGATGGGGATTCATTACTTATCACATGTA 

TCTGTGAGATATCATTTAAGCCTGGGCAGAACCTCCATTTCTTCTATCTGGTTGAACGCT 

ATCTTGTGGATCTTATTAGCAAAGGAGGACAATTCACCATAGTTTTCTTCAAGGATGCCG 

AGTATGCNTATTCMCTTCCCTGGACTTCTTTCTTTGAGAACTGCTTTAATTCTTCATCT 

NCAGAAAGAATACCCCATTTGATGTTCGAACAACATTTTCGAGATGCTTATCAAAAAGAG 

TGGGGAAAGTTTCTTTGGAAGANGAGTTACCCCATATTTNCTGATTGTTGGCAGACGAAA 

NGCCTTGAACGATCTACAMACCNCAGCTTTTTAAACTTTTTAAATCNTTCAATTCCTTG 

GGGCAAAGGGAANGNTNAACNTTTGGTACCTTGCCCGGG 
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Sequence 366 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

AATAGAAT ACAT CAGAGATGTTAAAGATTTCCCAATTGAAGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTCACAATAATCAAATGGCTGAGTTAG 

CTAAAGA TGCAGA TGTTATTATAGGTCCAGACCCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAA 

Sequence 367 

AGGTACAGCCTCTCGGCCCGG CTAA ACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 368 

ACCGCGGTGGCGGCCGAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACTCCTACG 

GGAGGCAGCAGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGNGTG 

CAGGATGAAGTTNTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAANAAAAAAAAAAA 

AAAAAAAAGGTNCCTGCCCG 

Sequence 369 

GGCGGCCGAGGNACAATATAGNCATCGCNTTAAACNGCCNANTNTTAANCNCGCCAAACT 

TGGTTACTGAGAGCCCTAAGGAACTAAAACCGCCATAATGCCGGGCACAGCTTGAAAAGC 

AATTAGAGGAAGCAAGANNAGNNNTTCCTCCCTTCCAGNNCCTCAGCAGGCCTGGCTGM 

GGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGA 

ATGCCATCCCANGGAGCACACCAAAATTCNGGAACCACCNCTCCCGGANCAGGCNTCCAT 

TGNTCACGAAGCTCACGCNGGGCAGNCCGCAACTTTACTTTGNAGNAACCTCCCCACTTG 

GC CAAGG GAATTCNCCCCCGGGCCTGGAAGAAAAGGGNTCTCCCACCCGGAAAGGGGCGN 

ACCTTTTCCCAAAACCAGCCTTTCCTTAAAGCNAAAAGCAAACCCNCTCTTTTGGGTTTC 

NCAAAGGGGGCNGNACAAAAGGGAAGGGTTTTGGGGCNGGGGGGGGGAAACAAAANCCCC 

NCATTNGGAAGNTTGCCCCCGGCCGAGGGGAAGGGGAAAAGGTTGGNCCCCGGTTGGGGG 
GGG 

Sequence 370 

ATTGTTGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACANGGAACTGCCAAAGGCAACA 

GAAATTCTTTCTCCCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTC 

AGAAACCAGATTCAGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGG 

CCAGGGAAGGAGANNAAGGTAGNCAAGCGGGGGTGGTCTCAAATCTGGTTGNGCTCGAGC 

TATGCAAATGCCTCTCATGGAGATGCGAGGACCTATCTATTATGATGACCAGGGCCACAT 

CCGGAGGGGGCAACAGACTTTCATNTATCAAGCCCTT 

Sequence 371 

AGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGC 
TTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCA 
GAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATT 
TTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 
Sequence 372 

CCGGGCAGGTACCGGGTCTAGGGAAGATGCAGAACACTTCAGCCTGGCAGAAGGCTCTAA 

AGATCGTCAACCTTTCCTCTGTTCATTTTGAAATTTTTAAAAATATGCTAACTTAGCTGA 

AAATCTCATGGAAACCAGGCTCCTCATCAGACTTGAAAGTCAAACCGGTTTCTCAACAAC 

TTCTCTTTATGGTTCTGTATGGCTCCACAGAAAACCAGAAAAACATTTGGGGCAAGAAGC 

TATGACTCTGTGAGGCCACATGGGAGCAGGCAGTCAATTATTACACTAAGGAACACCCAG 

TTAGCATGAAGTATCCCATCACCTCGGTATTAAGCCCTGCATGCATTAGCTATTACCT 

Sequence 373 

AC I I I I I I I I I I I I I I IGCTCAATAGAAGTATGGAATAATTCCAGGTAATTTAAAGCATA 
1 1 l I iCAATTGGTGTAAGCTGCTCCATGAAGTCAGCTAGCTCCTCTAATTGGGATGGTTC 
TTCATCACACGGCATGTTCTCAGAGTCTGATGACAGAGCATCAGTGTGTGGTCCCAGCAC 
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CCCCTCCTGTGCGGACTTCTGGGCATCCTCCTCCAGATACTCAATACTCTTGAGGGCCTG 

AGGMAGTCTCTATGAAAGGTCTTGCMTTTTGGGTGCAATGGTTTCCGTGACAGAAGGT 

TCCTGAGAAAGCACCACAAACTCCTCAGCTTTGACCGGGAAGCCAGCATCACGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 374 

CCGGGCAGGTACCGCTACTGAAATTATTAAACATACACTACAGATCAATTATATAANTAT 

GTTAATATCTTTAGAAATCAAGAGTTGCAGCATAAGAGAAAGGGATACAAAAACAAAACA 

AGC AAAGA AGTTACATAAAAACGTAACGTTGTATTGGAAAAACCAGTATGAACTTATGAT 

TTAGTTTTCTTTCTAAAAACGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 375 

NGCGNTGCGCTCACCTGCCCNCTTTCCANTCGAGGNAAACCTGGTCGTGCNAGGGTGCNA 

NTMTG MTTC GNCCAAACNCCNCCGGNNGAGNAGGCGGTTTTGCGTNATTGGGGCCGCT 

ATTCCNCTTTTCTCGCTCACCTGACTTCGCTGCCGCTCGGTCGNTCGGCTTGCCGGCCGA 

AGCCGGGTAATTCAAGCCTCCACTCAAAAAGGGCCGGGTAAATTACCGGGTTTNTTCCAC 

CMNGAAATTTCAAGGGGGGGATTAAACCNCNAAGGGAAAAANGAAACATTGTNNGANNC 

AAANAAAGGGCCCNNNCAAAAANNGGGCCCANGNNGAACNCNCCGTAAAAAAAAAG 

Sequence 376 

CCGGGCAGGTACACTCCAGCTCCTCTATCCTTTGTTCCAGGTGAGCCTTNTCATTAAATG 

CTGTCTCTTGAGCMTCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGAAGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAAAGAAGGAAGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTGGGTCGAAGCTTGTACACCTnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAAAATNGCNCNC 

TGGGGCNAAATCNNGGGAAAATANGCTGTTTTCNTGGGAAAMMTTGTTTTNCCCCGCA 

ANAAAAATCCCCCACAAAAATTNTANNGNNNGGGNGAAAAAAAAAAGNAAAAAANNGNGG 

GGGGGGGCCCCCA ANNGG GGGGNNGCCCNCNCCCCTNATANTTNATGGNGGGGGGGGGCC 

CTCNANNNGGCCCCTTTTTNNNNMNNGGGAAAMCCTTNGGCGGGNCCCNTTTNTATNA 

AN AAAAA ACC CNCCC CCCCCCCCNNGGGGGGGGNGGGGNGGTTTNGNTTATTGGGGGGCG 

CNi I M l CCNNTTTTCCNAANAAAAANAAAANANNCGGNGNCCNGGAGGGGGAAANGNGG 

GGGGGGGG 

Sequence 377 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGTGATG 

CTGGCTTCCCGGTCAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGAACCTTCTGTCACGG 

AAACCATTGCACCCAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAG 

TATTGAGTATCTGGAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACA 

CACTGATGCTCTGTCATCAGACTCTGAGAAPATGCCGTGTGATGAAGAACCATCCCAATT 

AGAGGAGCTAGCTGACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTA 

CCTGGAAATATTNCATACTTTNTATTTGNGGCNAAAAAAAA 

Sequence 378 

GCCGGCCCGAGGACTCCAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAA 

AAAAAAAAAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGGGGCCCCGGGTACCCAAGCTTTTG 

TTCCCTTTAGTGGAGGGGGTTAATTTGCCGCCGCNTTGGGCGTAAATCAATGGGTCATTA 

GCTGTTTCCTGNGNTGGAAAAATTGTTATCCCGCTCACCAAATTCCACACCAAACCATTA 

CCGGAAGCCCCGGGNGNAGCCATTAAAAAAGGTGTNANANAGCCTTGGGGGGGGTGCCCT 

ANATN GGM GGTGGAGCCTAAACCTCACCATTNAATTTGGCCGTTGGCGCCTCACCTTGC 

CCCGCTTTTCCAAGGTCCGGGGGGAAAAACCTGGTCCGNGGCCAAGCTTGCAATTAAATT 

GGAAATTCGGGCCCA 

Sequence 379 
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GAATTGNAGCTCCCCGCGGTGGCGGCCNNCCGGNCAGGTGGAAAGGTGGGTGGGGAGAGG 

GAGGCTTATTTGTTGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAA 

GGTATATTTGGTTAGATCTAGGTGAGTTCTACTTTAGAGGAAATCCTGGTAACTGTTGTT 

TGTTTGTAAGTTATAGCTGTAATTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCAT 

TCAGATACTATGCATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTA 

AGCTATTATTCAAGGAGTATCTACCATCATAAAGGTGGTTTAAGTCATATAGGATAATAT 

CMTCAATAATACAGGGAGATGGCAAAMT7TTTTGGGNAAANCCCAATATTANCTTGGG 

TTTNATGACCCCCNAATCTCACACTTTGGGGNCNTATGGGAAAGGCTTTTTTTAAGACCC 

GGGAGTTCAAGACCNGCCCTGGGCAACATTAAAAAACCTCCTTTNNNCCAAAANCTTTAA 
AAAA 

Sequence 380 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACMGTCGACCCAC 

GCGTCCGCTTCAGMTATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTAATAMAGTCMATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 381 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GCATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAA 

TTGATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGT 

ATTA AATCCTC CAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGA 

GGTG I I I I I I I GTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGA 

GTTAGGTAGAATGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTG 

GATTTCAATTCGGCACTTTATGTATTAGCTAAAAATTTCATGACCAGATCTTTTGAAGTA 

TACAAAGTA AATC TTCAAGGTGGATAGTTTATCCAAGTGTAAAATGTGTTGCACTAGGTC 

AGCTTGGMTTTTGAGATGACTTTTGGCATCATTGCATACATCTGGTTTGTGTACCTGCC 

CGGGCCGGCCGCTCTAGAACNGTGGATCCCC 

Sequence 382 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCA 

GTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACAAA 

TNGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTC 

CCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATTTTGATAATAACACTAAGATGG 

TATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 383 

TTAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATCTGCATATACACCAT 

TAATTTTACATCGTTGAGGTAGCCAAAACTGCTCGTAAGTGGGCTTTTATTAAATAATAT 

AATGTTCTTAATAGAGGAAAAAGGAATTGAATACATTTTTAAAAACAAAATAACAAAACC 

AATCCATTGTC CCACAAA AGAAAATCAGTGGAGACAAAAGCAGTTTAATTTGCTGGATTC 

TTTTGTGGCTTATTTTITGAGTATTATTTACAAAATGTTAGACTAATTTTTAAGCAATAT 

TAATAATAAGCAACATACAACTCCAAGAATAATATAATAAATAATAAACTGCGGACGCGT 

GGTTCGAAGCTTGCTCGNNGGGGGCCGGGNCGCTTCNAGGCCCNCCCGGGCAGGTACCCA 

GTNATCACATAAATTCTGCMTCATNTGG^ATTNAGCTTNACNTGNTTTTTTTATTTGN 

NGAANTTGTTGTTGTATTCAG 

Sequence 384 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACC 

CACGCGTCCGCTTCAGAATATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATAT 

TTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCT 

TTACCATATCATTMTAMAGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGT 

GGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 385 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I I 1 1 I II I I I I I 
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TTAAGAAATAGGGTCTCACTCTGTCCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTT 

CTGACTCCTAGGCTCAAGGGATCCTGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGG 

CAGGAGATCGCTTGAACCGGGAGGCGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTT 

TCCAGCCTGGGTGCCAGAGCAAAACTCTGTCTCAAAAAAAAAAAAAATAATAATAATAAA 

TAAATAAAAAGGCAAGGAATATAGGGAAAAGTCAAAAGAGATGGACTGTGAGAAGACTGG 

GAAAGCCAGAAGAATGGNGGAAAATGTAGCATGGAGTAAGACAATAAAAATATAAGAGGA 

CTCATTTCGGACGCGTGGGTCGACTCACCTCGGCCGCTCTAGAACTAGTG 

Sequence 386 

TTAGGGCGAATTGGA GCTC CCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGCTTCGACC 

CACGCGTCCGGGAAATTTTAATTAAAAATAGGTGAACATTTTAAATGACCTAATACATAT 

TTAGTCCACATTGAMCTTTGGCATTTTGTCATTGCCATTAAAATTTTGATGGCATTAAA 

ATTTG ATGCCATTAAAATTTTGATCAGTAGGTAGCATTTTTTTCTTAGCTACMTTGTTT 

I 1 1 I IAATTATMGTATTAAAAATTCATGAAGATGATTCTTTTTGTAAAACAGTTTTGCA 

TAAAAAGTAAGTCTCATTTTAAAGCAACTACCAACTTACTGGCCACCT 

Sequence 387 

AGGT CTTCGAC CCACGCGTCCGATGGTTTTTGCAAAAATTGAAAATGCATCGATATTACA 

GTTAATTTTTTCAGTGTGTATGTGGTATTAGGCTTAGAACTATAACACAGGAAGTTTTTA 

GAGTATGTCC ACTCTGGTTTACTC CTTTGTAAGTATTAATACCTGATAATTTACATCCTA 

CAGCCCTGCCI I I I I I I I I I 1 1 I I I CAAGTTTGTCCCAGCAAGTCTTGGCCCTTTGCATT 

TTCTTAATACATTTTAGTACCTGCCCG 

Sequence 388 

CCGCGGTGGCGGCCGAGGTACAAAGAACAAAGGGAAGCTAAGGAAGAAAAGATAGTCAAT 

AAAAGATGTCTCATCTGGGCTTAGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCT 

GAGGCTCGAGGACTGCTTGAGTCCAGGAATTTGGGCAAGTAGGAAATTACTGAACAGCTG 

CTATCACAGACAAATGCCTAACATTGTGAAGTGCTACACAGGGGAAGGAGACCCACGCTA 

AGAGGAGAGCATGCACCCAGACACAGAACTCAGAGGACACAGTTCAAAACACACATACAA 

GAGGCTTAGGCACCTGTGGGCGTGTGTGTGCTCACAGCCAGCAAAATGAAAAAAAATCCC 

AGCTCTGAAGGAGAGGCAAGTGCATGGCTTCCGTACCTGCCCG 

Sequence 389 

ACT TAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGA 

TGGTTTTTGCAAAAATTGAAMTGCATCGATATTACAGTTMTTTTTTCAGTGNGTATGT 

GGTATTAGGCTTAGMCTATMCACAGGAAGTTTTTAGAGTATGTCCACTCTGGTTTACT 

CCTTTGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGCC I I I I I I I I I I I I I 

TCAAGTTTGTCCCAGCANGTCTTGGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGC 

CCGGGCGGCCGGCCGCCCGGGCAGGTACNACTACCTCTTTAAAGTTGTCCTTATTGGAGA 

TTCTGGTGTTGGAAANAGNAATCTCCTGTCTCGATTTACTANGAAATGAGTTTAATCTGG 

AAAGCAAGAGCACCATTGGAGTAGAGTTTGCAACANNGANGCATCCAGGTTGATGGAAAA 

Sequence 390 

TCCCCGNGGTGGCGGCCGAGGTACTATTGACTAAAGTCAAGTTGGGGGAGAGAGAGGCGG 

AAGTATATTACTTTTNTGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAAC 

ATTNNATCATTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTNTGAGGAGG 

TTCATGAGTTCAGMTATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGG 

TAGATATTTGCACTGGT 

Sequence 391 

AGGG CGAATTGGAGCTNNCCGCGGTGGCGGCCGAGGTCTTCNACCCACGCGTCCGATGGT 
TTTTGCMAAATTGAMATGCATCGATATTACAGTTAATTTTTTCAGTGTGTATGTGGTA 
TTAGGCTTAGMCTATAACACAGGAAGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTT 
TGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGCC I I I I I I I I I I 1 1 I I I CA 
AGTTTGTCCCANCAAGTCTTGGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGCCCG 
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Sequence 392 

CCGCGGTGGCGGCCGCAC 1 1 1 I I I I I II I I I I I I I IACCTGAAAATGCTTATTCTAGC7T 

CACATTTGATTGTTTGGCTMGMGAAMTTATTTATTAGACTTAATTTTCCTCACGAGT 

TTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 

CTGATTCTTGCMCTTACAAGTAGTCTTTTTTTGTCTAGACGCTTTCAGGACCTTCTTTT 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 393 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTA TGAT GCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAG 

Sequence 394 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCTACCCACGCGTCC 

GCACATTTTGATGGTCAGTCAATAACTTAAGCAGTTACCAAAATACTAGGTATCCAAGGA 

GCGAGAGGTGGGCGAGCATAAGAAACACATTTCTCATGGCACAGCTCTGCCAAAGCCCTG 

CAGAATCATTTACACATAGGTCTTTGGTTAGTAGCCCCTGGCACAGAATTCTGATCTTAA 

ACAAATATTGTCTATAATCAAGTAGAGCAATGCAATTAAAAAAAAAAAGCACAGGTTTTG 

GGGCCATGCTGAAATCCCAGCCTTGCTATTTGCTGGCTGTGTGACCGTGGTTCCTTGGTC 

TCATTATGCTTTGGTTCCCGTATCTATAAAACGGACGTAATAATGTCTCCCTCTCATTAT 

TGTGAAGTCGAAATGATGTCTGTAAAGTGCCCAACACAGTACTAAAGGGCTATT 

Sequence 395 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTTTAAGTAACAT 

T CAGAT TTGTGTGT GTGG AGAGGTTGTAGGGAACAGAATTGTAGGAAGGTGCTCACACCT 

G I I l I I GTTTGTTTGTTTTATGTATATATGGTGGGTAGAAAATAAGGATTAATGAATGCA 

GTAAGGTATTTGAGCACTCTTGTTTATCTTGTGTAGGTGCCAACCAAATATTTTTATAGA 

GATGTGGTTAAGCCTCTTGGCATGTTCAACTGTGTACCT 

Sequence 396 

CCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAAATTATTATTTATCTATA 
CAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGCTAAAAAAAAAAAAAAAA 
NGANGGAGAATGAAAACTTCCGGACGCNTGGGTCGAAGCTTGACCT 
Sequence 397 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCGCGCGTCCGCATTT 

ATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATC 

TTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAMGTATTAAA 

TCCTC CAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGGTGTT 

l 1 1 I I GTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGG 

TAGAATGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTC 

MTTCGGCACmATGTATTAGCTAAAMTTTCATGACCAGATCTTTTGAAGTATACAAA 

GTAAATCTTCAAGGTGATAGTTTATCCAAGTGTAAATGTGTTGCACTAGGTCAGCTTGGA 

ATTTTGAGAT 

Sequence 398 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGCCATGTTGGCTGGGCTGGT 

CTCGAAC TCCTGA CCTCAAGTGATCTGTCCTGGCCTCCCAAAGTGTTGGGATTACAGGCT 

ACTAAAGCTTTTTA TTTAT TTCTGTGGATTTGAGTTATTGTGTAGTGTCATTTATTTTTC 

A TATG MGGA TTCCT TTTGGTATTTTTTGAAGAACTTCTTTATCAGAATTAATTATCTCA 

CTTTTGMAAT TmA ACMT CTAGA AACGTCTTTTTTGTTTTTTGMG 

GGATGTAAMTTTTTGGTTAATTTTGTTTTTCTTTGMCACTGAATATTTAATCTTATGC 

CTTCTGGCTTTCAGTACCTGCCCG 

Sequence 399 

CCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAGATCCGTTTCAGA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

70/467 



AACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACCTGATCTTATTCT 

CATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAATTTACTGGAGTTG 

GCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCACCTGAGAAT7TGA 

TAGACATAGACCCAGAGTTACTGAGGCAGGTGCTCTGTTTTGGGGACCAGCAATCGGTGC 

TTTAGCAAGTTCTTTGGGTGATAGGGTTTGGAAACTACTGCTCTAAAGCATCATCTGTTT 

TGACTTTGCCATGCACAATCTGAACTCACTCCCGTGAGGCCCTGCTCCTGATACTTTAAA 

TCGTCCTGTCTC I I I I I CTGCCTCTCTGTGGAG 

Sequence 400 

CCGGGCAGGT ACAGGC ACCTATA GAAT TTMAGGGGAGATTTCTTTATTTTGTATTCAAT 
GTATTAATAAGAl 1 1 I I A AAACATATTT TGGAGAAATTGCTAATTAGTGTATAATCCTGA 
TGCCAATTCTAAAAAACC I I I I 1 1 I I I I I I GTAGAGACAGGGTCTTATTCTGTCACCCGG 
GCTGGAGTGCTCTGGTATGATCCTAG7TCACTGCAACCTCAAATACCTGGTCTCMGCAA 
TCCTCCCACCTC AGCCT CCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCT 
MCCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCTGGTCTTGAAC 
TCCTAACCTCAAGCAAACATCCCTCCTCGTGCTCCCAAAATGCTNGGATTACCAGCATTA 
AGCCTTACMGCATAAGCTACCATGGACTGGCTTTCNAAAAAATATTTGGTTTAAAATTC 

Sequence 401 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTAAGTGAAACCTAATNNATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGA GTTCT ACTTTAGAGGAAATCCTGGNAACTGTTGTTTG7TTGTAAGTTA 

TAGCTGTAATTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAA 

GGAGTATCTACCA TCATAA AGGTGGTTTAGTCATATAGATAATATCAATCAATAATACAG 

GAGATGGCAAAAATTTTTTGTGAAGAGCCAGATAGTANCTGAGTATGATGACCCCTAATC 

TCAGCACTTTGGGAGGCTGATGGGAGAGGGTCATTTAAGACCAGGAGTTCAAAGACCAGC 

CTGGGCAACATTAAAAAACTCCATTTCTACCAAAAACTTTAAAAAAAATTAGC 

Sequence 402 

GCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGGGAGGACCTAGGCAACGGCC 

TGAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAACCTACTGAAGATACGTGGTTT 

TACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGCCATCTGAGACCAGCCATCTGG 

CAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATCCCACTGACACCACGTAGAGCA 

CAAATGAGT TGCCT CCACTGAGCCCAGCCCAAATTGTTATCCTATAAAATCATAAAAACA 

TAAACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCG 

Sequence 403 

TACTATAGGGCGAATNGNAGCTNCCCGCGGTGGCGGNCGAGGTATTCAACAAGGGCCCTG 

AGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGT 

GTGAGGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCA 

GGGAGGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACT 

AAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTT 

TAAAAGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTG 
CCCG 

Sequence 404 

CCGCGGTGGCGGCCGAGGTCAAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGT 

CAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGNACCTTCTGTCACGGAAACCATTGCACC 

CAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTG 

GAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTG 

TCATCAGAGTCTGAGAACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCT 

GACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTACCTGGAATTATTC 

CATACTTCTATTGAGCAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 

Sequence 405 
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TTAGG GCGA ATTGGAGCTCCCCGCGGT6GCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 
GGAAGTTTTC ATTCT CCCTC I I I I III I I I I I 1 1 1 I I 1 1 I AGCAATTCAAGNCATGTTTT 
TGTCTACAAGTTTTTCCAGTATTGTATAGATAAATAATAATTTACTAGGCTGCCTTGAGT 
ATACTAGACAAGAGACCTGCCCG 
Sequence 406 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I 1 1 I I I I I I I AC 

CTGAAMTGCTTATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAATTATTTATT 

AGACTTMTTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAA 

AGCTGAGMGTCCMTG CCATTCT GATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTA 

GACGCTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATC 

TTTTGGGTACCT 

Sequence 407 

CCGCGGTGG CGGCC GCAC I I I I I II I I I I I I I I I I I I CTCTTATATTGAAGTAAAATTTA 
AAATTTAATACI I I I I ATTTTTTAAAAG C ATGTATGG CATCATTTCAGTCTTATTAAATT 
CTCTCTGCATCCATTCACCCATCCTTCTTTTTGTGTGTGTGTGTAGTGGTCTCTGTGAGA 
' GGGTTCATTAATGTCMTCCTGATCATTTCTTCCTCAAGAGATGTCAGTAGATTTGTTTT 
TTTTGCTTTGGACTTTTATGAATTGATTGAATTTTTATGCCAATTATTTTTAAAGTATTA 
CATAGAAGAACAAATGGACAGAAAAATTTAAATGCAATCAAATCTTGTTGATTTTGAAGT 
ATAGGAAATAATC I I I I I I I I ATTATACTTTAAGTTTTAGGGTACCTGCCCGGGCGGCCG 
CTCTAGAACTAG 
Sequence 408 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 409 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGAATATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTMTAMAGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 410 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCAGTGCAAATATCTACCCAGT 

TAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTCCT 

CAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTTCT 

TTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGCCT 

CTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 41 1 

ACTT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACi I I I I ICATTGATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGNNNTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCAATAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGGAAAATGA 

ATTATTAAAATTCCATGGAATTCCTTTTGAAGTTTATGAAGTACCTGCCCG 

Sequence 412 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGG 

Sequence 413 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTGTGATGCTT 

TTGGCAGGAATTACAGAACAACCAATGCCATTCAAGTTGTGGAGATTATACTNGCAGGTG 

AACTCGTAAAGAGAAGATTCTGGAATGCCTATATCTGAAAGCTTGAGTCGACACCTN 
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Sequence 41 4 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGAATGAGTGAGTGCAGAACTGGC 

AACCAGAAGACAGGAACAAGGCCTGGGAATGGAGCGGAAAGGTAGCTGCTATATATAGTT 

CCTTCAGCCAGTAACGATTAGAGCCAATAGCCATCTGGATGATGAATGGCTCCTAATTGC 

CTTAAATTACGGCAGTTAGCTAAGGG7TTCTGTTGCTACATGGGTTACCGTAGGCCGCTG 

CACCCTGCATAACTGTCCTCAGGCCTGCGTCCCCTGAGTCTCAGCACTTGGGCCTCCACC • 

TGCCCG 

Sequence 41 5 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAA 
TCACAGGAATGAGCCACCACACCCAGCCAMTTGGGCACAAATTTAAAATTTGACTTTTA 
TTAATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAG 
GCAMTCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTC 

TGAAGCGGCCCCNTGGGTCGACTTTGTMCCTGCCCGGGCGGCCGNTCTAGAACTAGTGG 

GATCCC 

Sequence 41 6 

TATGGCGAATTGGAGCTCCCCGCGGTGGCGGGNCGAGGTNAAGCTTCGACCCACGCGTCC 
GATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCNTATT 
CTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 41 7 

CCGCGGTGGCGGCCGAGGTACTCTTGATGTCATAAGATTAGAAAATGTGGTTAATTGTCA 

TCAACCCATTAAGTTCTTAAATGTCATTGAATGGAGTCCTTGTCATGTTACAGAGGAGCG 

TAAAATTGTGGTTAAACATTTTTTTAAAGATTACATGGTAGAGCCACAGTTTGTTATGCA 

GAAGGAAAATTTAGCAAATATTATTTTGCTTAATAGCCTTTAAAAAATCGTATAAATTTG 

ATTTGTAGTTTTATCCCCAGAGTCATTAGATTTTTCCAAAAAAAAAAAAAAAAAAAGGT 

Sequence 418 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCT 

TM CTTAG ATTGTTACTTATGTTTCTAAATCTGNGGAAGCACATTTCCTTTTNTTNTNNT 

TTTCTTTTACTGTTMTATCCTTATTCTCTATTTTACCAGTGGAGAATGNTTAGTATTAA 

TTTCCATTTANCTCANGATTCMGMATGCAAAGTGCTATTTTTATCAAATTTCTGAAAG 

CCTACTGTCTTCTGNTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCCCCACTT 

TCCTCATCAGCAAGTTGGTGTGGCAATGGATCATAATAGGTTGCTGGGAGGATGAAGTGA 

GCGGACCGCGTGGGTCGAAGCTTGTACCTN 

Sequence 41 9 

CCGCGGTGGCGGCCGCACTTTTTTrGTATTACTTCAACTTTTAAAAATTCTAAAGAAAAC 

CATCATCTCAGACCAGCATTTCCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 420 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCT 

CCCTCGTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGC 

AGGTGATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTG 

AACTGGAAAGCACCT 

Sequence 421 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGMGATTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

MCCCATTCTACCTAACTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTNATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGGATTTAATACTTTGG 

Sequence 422 

GGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTCGTCCGGATCCTTCTAACGCAGG 
AGTCTCAGACGCAAATGCCGGCAAGGGCCAGGCAGGTGATGTAAGATGCGTGGAGCAGAT 
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GCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTGGAAAGCACCT 
Sequence 423 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATAC 

AGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGT 

AGAATGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACT 

TTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGT 

AGGATACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAA 

GCTATCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACA 

GAAAATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 424 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGAC 

CCACGCGTCCGATTTGGCAGAAATTCAGGGTAATGTCAAGGTTCTTAAATCTGAGAGAGA 

CAAGATCTTCCTTCTTTATGAACAGGCACAGGAAGAAATTACCCGACTTCGACGAGAAAT 

GATGAAAAGCTGTAAGAGTCCTAAATCAACAACGGCACATGCTATTCTCCGGCGAGTGGA 

GACTGAAAGAGATGTANCCTTTACTGATTTACGAAGAATGACCACAGAACGAGATAGTCT 

AAGGGAGAGGCTAAAGATTGCTCAAGAGACAGCATTTAATGAGAAGGCTCACCTGGAACA 

AAGGATAGAGGAGCTGGANGNCCNTCCCGGGGGCCGGGGNCNGCCCGCCCCNNGCAGGGT 

CANATGATTGCAGAATTTATGTGATTCCTGGGGT 

Sequence 425 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACTGAGCCACTTA 

CAATTATT TCTG AAAAATCTCAGAGAACTGAGGATAGATCAGAAAATTTAAAGAAAGCAA 

ATACCAMTTTTCAAACCAGGATAGGAAGTGAATACCTTGTAAATACACTTTGTTAAGTG 

ATGATMTTCTGAGTMMATTTAGMGATTTTGAGAAAAGCATTTGAACTTCTAGGGGC 

CMTAAAATACCATGCAGAAGAATGTTTAAAAAGTCATGCCAAATTTGAATCCATTTGAT 

CCTCAACCTCATCAGATGTTATATGCCAAACTACTTATTTTGGCTTAGATAATAATCATA 

TAGAATGAAACTTTCCACAAATAGACTGTGGTCAGTGGCTG 

Sequence 426 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCA 

AMACMGTTTGGTCCCCCTGTTATAGNGCCTGGTAMGGTTTTTGTTGTTGTTTTGCAG 

GGGTGGGGGAACCAGGAAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGT 

AACTGCTACAGCAAAGGGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGT 
CGGA 

Sequence 427 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTC 

GTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGNAGGTG 

ATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTG 
GAAAGCACCT 

Sequence 428 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCAC 

GCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGTTGGGAGCACAAAGA 

TGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCA 

GGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGCCCTGA 

AATGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCT 

TTACAGCAGCCTCNAAGACTGATCAGGTTACTATACCCTCTCTTNTGTCCACNGTGCATT 
TNAA 

Sequence 429 

CCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCAAAAACAAGTTTGGTCC 

CCCTGTTATAGAGNCTGGTAAAGGTTTTTGTTGTTGTTTTGCAGGGGTGGGGGAACCAGG 

AAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGTAACTGCTACAGCAAAG 

GGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGTCGGA 
Sequence 430 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

74/467 



CCGGGCAGGTACACTCCAGCTCCTCTATCCCTTGTTCCAGGTGAGCCTTCTCATTAAATG 

CTGTGTC7TGAGCNATC7TTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCT CTTTC AGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGAAGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAAAGAAGGAAGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTAGGTCGAAGCTTGTACACCTCGGCCGCTCT 

AGAACTAGTGGGATCCCCCGG 

Sequence 431 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGATTTAAAATCTC 

CTCCTCCTACAGCGGTCGAGTATTGAAGCAGGTCTTTGAGGATGGGCNGGAATTAGAGTC 

ACCAAAGGAGGAATACCCTCACAGTTTTCTGCAAGAGTCTCTTGAAACAATGGATGGTGT 

TTATGGGTCTGGGGAAGACCCCNCGCCCCAMTGTTGCTCCCCT 

Sequence 432 

CTTAGGGCGAA 7TGGA GCTCC CCGC GGTGGCGGCCGAGGTACACACAAACACACACACAA 

AGTTTAATATAC AAAAAATTTTATTGTATTGTTTTCTTGAAATAGGGTCTTGCTA 

TGTTGCCTAGGCTGGTCTTGAACTCCTGGGATTAAGCAATCCTCCCAACTAAGCCTTCCA 

AAATGCTGGCATTACAGGTGTGAGCTACCACAATCAGTCTCTTAGATT7TGTT7TTTAAG 

AACA ATTCGAAGTTTACTGCAAAATTGTGAAGAACGAACAGACTGTTCCCACATATCCCT 

TT7TCT7TACACACCGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 433 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGCTTGAGGTGGG 

TTTAGGAAACATTTGGTATCTNTGGCAGGGACAGATGTTGACCTGGCCGGTCGGCAGCTT 

TTACAAACCTAAGGACTTCAGGGTCCGGTTGCGCATGAGGACCGGGGAGGACAGAGCTGT 

TTGCAATAGGTGTGGGCTTTTATAGCATTGTGAGCATTTCACGTTAGCGTAAGTGTTGCT 

GCTGTGCAGGTGGTCTCTGGGGCTTACAATCTTCCCCAATGTTCTTCCCCACCCGTCCCA 

GCATTCTGGTGAACAAGCCTCTTGGGATTCTTTGAAAAAAAAAAAAAAAAAAACCT 

Sequence 434 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATACAGAG 

GTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGNNGAA 

TGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACTTTAG 

AATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGTAGGA 

TACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTA 

TCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACAGAAA 

ATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 435 

CAGGTACAGGCACCTATATG MTTTA MCGGGGAAGATTTCTTTATTTTGTATTCAATGT 

ATNMTAAGATTNTTAAAACATATTTTGGAGAAATNGCTAATTAGTGTATAATCCTGATG 

CCAATTCTAAAAAACC I I I I I 1 1 I I I I I GNAGAGACAGGGTNTTATTCTGTCACCCGGGC 

TGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAATC 

CTCCCACCTCAGCCTCCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCTAA 

CCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCT 

Sequence 436 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCAACAAGGGCCCTGAGAG 

AGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGTGTGA 

GGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCAGGGA 

GGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACTAAAG 

CCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTTTAAA 

AGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTGCCCG 



Sequence 437 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCACGTAGCAAC 
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ATATGAGTATTTCTCTAGATAACI I I I I I I I I GACAAGGTCTCACTCTGTTGCCCAGGCT 

GGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTTGACCTTCCCTAGCTCAGCTGAAC 

CTCCCATCTCAGGACACCATTGCCTCCACTGCCCATCCTGCATCTGCCTGCCTACCCCAA 

AAGTGTTGAGAATACAAGCATGAGCCAGAGCCACGGAACCTGGCCTCTAGAGAGACTTTC 

TATTTTAGTTTT7TCTTCTCTTATTTGTGMGCCTTGAAAAMCTACTGTGGT 

GATTCTGGTTTGTGACTTTTTTAAATAAAC I II I I ATTTTGGAATMATTTATGTTTAGA 

GMTAGTTGCAAACATAATAMGTGAGTTTTCATAAACGCCTTACCAGTTTCCCCTGNTG 

GTTAACATTTTACATCACCATGCTGTTGCATTGGTCAAAACTA 

Sequence 438 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAMCTATTTTMGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCMCCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

TTAGCTAAAAATT 

Sequence 439 

TCGAGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTC 

AAAATNGCAAGCNGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGA 

TTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTNAGCTAATACATAAAGTGCC 

GAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCA 

TTCTACCTAACTCANGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGNCCAC 

AAAAAAACACCTTATGGTCACACTTAAAA 

Sequence 440 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAACTTAAAGTA 

TAATAGTAATMTAMAMAGAGGTGCTTTTCTCCTAAGTCMCATTTTAGAGGAAAAGA 

GTCAATTCAAGCAATTATCACATATGTGTAACTGAAGCACATATGTGTAACTTTTCAAGA 

GTGATTAGATG GTCTG TTGTCTTTGAAGTGATAGTCAAATATCAGGTGTGTTCTAGGGAG 

GTTGTGTAAGACTTTTGCTTGTATTCTCCCGGACGCGTGAGTCGACTCAAGACCTGCCCG 

Sequence 441 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTNTCTAATGTC 
ATATTCTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 442 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCACG 

CGTCCGCTTCAGAATATCCAATTCATGTGMCTACAGGMATTATAGTTTAGATATTTTT 

AAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTCA 

TATCATTMTMAAGTCAAATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCA 

TTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 443 

CGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTC 

CMGCTGACCTAGTGCAACACATTTACACTTGGATMACTATCACCTTGAAGATTTACTT 

TGTATACTTCAMAGATCTGGTCATGMATTTTTAGCTMTACATAMGTGCCGAATTGA 

AATCCAGAATACAATAGCTATGAAGGGCCGNTAGAGTGCAGATAACCAACCCATTCTACC 

TMCTCAGGTTGAGATTGCTTTANAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAA 

CACCTCATGGTCACACTTAAA 

Sequence 444 

ACNGNCAGGTACCAAGATTAAGGACAGAGTTCCCTCCATTGGTCATTGATTTGNAAACCA 

AAATGTATCTGTGACAGGTATTAATCCGGACGCGTGGTCGAAGACGAAAGGACACGAGAA 

ATANGGACCTANNCCGCTCTANAACTAGGNATCCCCNNNNCTGCAGGAATTCGATATCA 
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Sequence 445 

CCGCGGTGGCGGCCCGCCCGGGCAGGTGCCAAAGCCTTAACTTAGATTGTTACTTATGTT 

TCTMATCTGTGGMGCACATTTCCTTTTCTTCTTCTTTTCTTTTACTGTTAATATCCTT 

ATTCTCTATTTTACC AGTGG AGAATGTTTAGTATMTTTCCATTTANCTCAAGATTCAAG 

AAATGCAAAGTGCTATTTTTATCAAATTTCTGAAAGCCTACTGTCTTCTGCTTTGGAAGT 

CCCACMCAGCTCTTTAATTTTCTTAAGCCCCACTTTCCTCATCANCAAGTNGGTGTGGC 

AATGGATCATAATAGGTTGC 

Sequence 446 

CGGGCAGGTACCCTTATTTTCCCTGATACAGCACAACTCTGCCTATTCTAATCATGACCT 
AGACACATTCAATGAACTACAAGTTCTCCTTTACACGGACGCGTGGGTCGACTCCGGACG 
CGTGGGTCGAAGACCTCGGCCGCCT 
Sequence 447 

CGAGGTCTTCGACCCACGCGTCCGCTTTCTTTTGTTGTTAAGGACCCATGACCTGCAGTT 

TCCCTMCATTCATTTTTATACAGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAAT 

TCATATGCCTTTAATACAATCAGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGG 

AGAGTGAAAGTGGGCCAGCCTAAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTG 

TGGCTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATCATGNTCATAGCTGNTTNCTGGG 

TNAAAATNGNTANTCCGNTCANAATTTTCACACANTCNTANNAACCGTTNCCTTNAATTG 

TTNANNCCNGGGGTNCCNAATTGGGNGNNCTNACTNAAATNAANTNGGGTNGCTTNT^^T 

GNNNNNI I I I IANNTNNGTTTANCCTTNTNNNCANNTTNNTTNTTNAAATTNGNNAAANN 

CCNNGNGGGGAGGGNNNNTTTNNNNNNTNNNGNNANNNTNNNNGNNNNNNNGGNNNNNNN 

TTNATTNNNNTCTNTNNNNNTNNTNNNNNNTANNNNANTTTNNNNTTNNNNNNNNNNGGN 

NNNNNNNNNNNNNTNNANNTTTANNNNAANNNNGGNNNNTNNNNNTTNNANTNNNNTTNG 

NGTNGTTNTNTNNNNNNNANNNTNTNTTNNNNNNNNNNNNTNTNNNNTTTNNTTTNNNNA 

NGNNNTTNTTN I I I I I I I I 

Sequence 448 

GCGGCCGCCGGGCAGGTTACAAGTCGACCCACGCGTCCGCTTCAGAATATCCAATTCATG 

TGAACTACAGGAAATTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACA 

CAAGGGTGTCATGACCAAGCTAGATCTCTTTCATATCATTAATAAAAGTCAAATTTTAAA 

TTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGA 

CCT 

Sequence 449 

CGCGGTGGCGGCCCGACCAGTGCAAATATCTACCCAGTTAGAAGAGTAAATACCATCTTA 

GTGTTATTATCAAAATATTCTGAACTCATGAACCTCCTCAGACTGTTGCTGGGACTCCCA 

GATATCMTACTCTGAGAACCACTGATCTMTGTCTTTAGTCAGTTTCTATTTGTTCTCT 

AGTATAACCAAGCATAAAAGTAATATACTTCCGCCTCTCTCTCCCCCAACTGACTTTAGT 

CAATAGTACCTCGGCCG 

Sequence 450 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTGTCAAGCTTCGACCCACG 

CGTCCGAAAGAGACCCCTGGGCTAGGGAGCCATTTGCAAGTCCCTCAGATGGCCTCAGGT 

GTGGGTCATGCTCCACTGCAGCCATGGTTACTGGGAAGCCAAGTGGACTCCATGGGGCTC 

CAAGCAGGGGAGGGAGCTCCTGGGAGACACAGTGTGGCCCAGATGCCCTGGTCATGGGGA 

CATCCAAGCACAGGTGAGAAACTTGGGCAGGGAGGGTGCAGAGGAAATTTGGAAATGGCT 

GAGAGGCTGGACTCAAGGTCTCTTGGGCTTGAACTGAGCTCAAGCAGTCCACCCACTTCA 

GCTTCCAGGGTGCTGGGATTACAGTCGTGAGCCATGGCATCCGGCTGGACTCAAGGTCTC 

TTGTTCCCTGATCCTGGGCTTGGCATGGAGAAGGGAGGAAGCTGAAGGGGGCAGGTAAAA 

AGAT 

Sequence 451 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 
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TCCGATTATACCCTAAAAAAGTAGAAAGATGGAATAATTAGGAGTAAAATTAGTGAAATG 
GAAAGATATGTTATAGAAAGGACCAAGAAACGCAAAAGTTGGTATTTGAAAAGACTGAAA 
A AATT CATGAACCTGTGAAAACAGTGGTCAAGAAGAAAAGAGAGGCATATTTTGATCTCT 
GTTTTACATGTTACTCAATGTTCATTGCTGCCTCCCTTGTCCATAAAGTGCCTTTAGTGT 
GTATGTTACTTTAGATTATCTTGGTGTCATCAAGCTTTACTCAGCAAAGAACCACTTTGT ' 
TGTCTACTTTAAAACATAAGTTATCTTTAAAAGAATGGGTATCTTTTATAGTTCCATATT 
AATGGCGAAGAAACTGCAGGTAACAGTGCCTTACCAGCTGGGTTTTGCTAACTTTTCTC 
Sequence 452 

C CGCG GTGGCGGCCCGCCCGGGCAGGTTTTATATTTTTTTCCTCTTTAAAAAATAATTTG 

GTTTTGAATATTAAATTTACATATTTCTAAGTTAAATCAACATTCGTAGAGGAATTATCA 

AAAAAAACTAGTAAGTCTGAAAAAAAAACCATATTTTATATTCTGAGGTCCCGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 453 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACCCAACACAAACTA 

TTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCA 

CCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAA 

AATCAACATTAGCTGGGCGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGA 

GGCAGGAGAATCACTTGAAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATT 

GCACTGCAGCCTGGGCAACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAAT 

AAACACAACAGAATTATTCTGCAAATACAGATATTGGAGTAGCTGAGTTCCATCTCAAAT 

TTGA CTATGCAGGTTGACAGGTGATCTTGGCAAACTACTTATCCTTTCTGAAGTTCAACT 

T TTTC ACCAAATGGTATTGGGATACAACACTTGCTCTTGCCTATCTCACATGAATTATCC 

ATTTTGGACAACTTGGTAAACTATA 

Sequence 454 

CCGCGGTGGCGGCCCGAGGTCCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGAATGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTAAGATCTGAAGCAATCT 

TTAAACTCGTGAGGAAAATTAAGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAAA 

TGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 455 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTACAGCATCCTGATAA 

CAGCCTCTGCCCTGGGAAACAGACTGTGACCATGCATTTCTAGTCCAGCATATCCTATCA 

GAAGACCAAATGGCTTCATCAAAAACAGAGTGACAACCTCTTCTCTTTGCCTCTTCTGTG 

CTTGTTAACAGGCAGCATTGGGGCAGGAGAGCCTGCAGGCCTTTCACGGCTGCTTGAGTT 

CTCACCTGTTTGTCTGAGCTCTGATTCCTCTGCCCTGTAAGCGTAAAGGAGATGTGCTGA 

GTGGAAAGACCTCTAAACAGGCAGCCAGGAAGCCAGATTTCAGGTCCATCTCTGCCTCTA 

ACTGGCAGCTTTGCCTTGGGTAAATCATCAAGTGGGCAATAGTTTCTCTCCTGTAAAAGG 

AAAAGATTGGGTTTAAGATTGTTTCTGAAGTTCTCTCTAGATTTAACCTGGAAGGAGTTG 

AAATTGCTAACC 

Sequence 456 

CCGCGGTGGCGGCCGCCCGGCAGGTACAACATTTTACATTTCCAGGGACTGCAAAAATGT 

TAGTTCCTTCCCCCATCATTTAGTTTGMAATTCTTAGATAATTCTTTGCTGGTAAATTC 

CAACAGAATAGTTAGCACACAGGTTCCACACACACAAGTTCTAGATAGGAATCTGAAGCA 

CCACMTGAAMGAACATTTAACATCTTTTAAAAATGTTTAATGTTATCAGAAAGATGTT 

TGGTATATGTGTTCCATGCATGCTCCTGCTGGTTCTATTTGAAAAAGAAGTTTTTACAGT 

TATCTGTTGTCACCATATTGTAAACGGACGCGTGGGTCGAAGACCT 

Sequence 457 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACAATAAATAGCA 
TTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACACC 
GCTGCCTTCTGACGGTAAATGTATTGTAGAATTCATGTTGTATCAGGCTTCAGTTTCCTC 
ATTCTAAAATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTTTTACTTTTAATCAGTGG 
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CATC TCCCATTTA I I I I ICAmGAMTAAMCTTTTGAATTTTATCTTCTACCTAAATA 

ACATATTTTGTTTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAAAGC 

AAAGGATCAAAGCTGGGGGAAAATACTGGACCATGTGCTTCACCTCATGGTGCCAAATAA 

AGAGAAAATGGGGAGAAGATAGGGACAGATAAAGATCTATTTGCTCGGATTGNGCTCTCA 

TCCTTGGCAACATGTTGACAATGCCCTGGAAATA 

Sequence 458 

CCGCGGTGGCGGCCCGAGAGCTCCAGGACGAAGGTATAAACACAGCAGAGGGCAGAGCCT 

GATTTCAATCAGGGGCTACTCTAAGAAAGGCAGGAAACTAGATAAATACATTTAAAAGAA 

ATTCCTCAGTGGCAGGGACAGTAGAGCAGCAGGGGGAGATCCCAGCACGGACAGGTAACA 

GTGTGATGTGGCAGAAAGGCTTTGGTTGCAAGTGGAGAACAGATGTCTCTGGCTGCCTCT 

GGCAGCTGCCTCCTTCTGGGCCTTGACTTTTCAAAGCCAGGCCAGGCCTCCCCCACCCTG 

GACCACCTGTAGCTGGTTCAGAAGGCCCCAGGCTGGGCTTCATAGATGAAGACACAGCTG 

ACTCAAGTCTCTGGCTCTGTCGGCTCTTGGCCACCTTGCCGCTCCCATAACGGTGTTTCT 

CAGGTCAACCCCTCTTCTCCATTCCTACTTCAATGACCTCAGGTCAGGCCCTTGCCACTT 

CTCTTCTGGACAAAGATGACAGCCCTTCACTGGTATCCTCGTCTNCAACCTAATTTATNC. 

TTCACAGTGCTGGCAGAAGTGACATCTTTTAAACACACAACGACCCCN 

Sequence 459 

GCGGCCGAGGTCCGCAC I I 1 1 I I I I I I I I I I I I I I ATTTTTACTCCAGAATTTTCCTTTA 

ATATTTAGGACTCCAATCTTTACTTACAAAATAGCTTTTATTTACGTGCACATGATCGTG 

GTTTCMAATTTTCTAAGCACTATGCTAAATTTGTCATCAAAACATAACAGATTCCCATC 

TTACAAACATAGTTGCTAGTTGAATGAGTAAAAGAGATTTCAAATTTCAATTCAAGGAGG 

CATGTCTAAAAGACCAGACCATTCATTTGATGAAATTGTAAATGCCGATCATCCAACTTA 

ACAGGAACTGCACATTTGTTCCTTTCTAGTTAGAAAAAAATAA 

Sequence 460 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTT 

TGTTGATTGACTGATTTATGCCTCTAAGAGGAACTATCTTTTGATAATATTAAATAAGAT 

GTCCTAATACAAAACTGATAGAGTTCAGAAATAATAAGAATCTCCTGGCCAGGCGTGGTG 

GCTCACGCCTTTAATCCCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGA 

GATTGAGACCATCCTGGCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAA 

TTAGCCCGGGTGTGATGGCGACCT 

Sequence 461 

CGCGGTGGCGGCCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATAAGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 462 

CCGCGGTGGCGGCCCGAGGTGGMTGTCTGTTTTCACAAAATTTTGTATTTTCTCCTAAT 

AGTATGAGGTNGAAGAAATCTACATCTTCTCAAGTGAGCTTATGATTAACTCGATGAGTT 

TTCTTGCTATTCTCAAATCGGAATNTCCAGACCTGGCTAGAAACTAAAGTCTAAGCCCAT 

TCATTAAAGTCTTGAATTTATTTACTTTNGCCAAGAACAGCTATATAAAATTAGATTCCT 

CCTGGTATAAAATTGGGTGTTTTCTTAGATATTNGCTATCAAMGTCATTTTTCTTGGM 

ATCGGACGCGTGGGTNGAAGCTTGTACACCTGCCCGGGCGGGCGGCCGCAC I I I I I I I I I 

I I I I I I I I I I 

Sequence 463 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAAG 

CATACAACTGAGGGTTCACGGATTTCTAAGAAATGCATTTCCCTTGTCTATGTTCATCAG 

CCTTTAATACTTTGGCTACAAGGCATATCAGAGAAAGGGAGGTTAAATTGGGTAATGACA 

AAAGAACATATGTAACTCTGGAAATAGGAAAAATGTTCCAGAAATGGGATCAATGTGCCA 

GCAATAAGCATAGTTCATTTCATTTGAAAATTCAGTTAAAGAGCCCAATAAACAGTTCCA 

AACCGGACGCGTGGGTCGAAGACCT 

Sequence 464 
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AGTATNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACTCAAGCTTTCAG 

ATATAGGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAAC 

TTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGCATCACAAGTTGTACCTGCCCG 

Sequence 465 

CCGCGGTGGCGGCCGAGGTAATAGCTAATGCATGCAGGGCTTAATACCGAGGTGATGGGA 

TACTTCATGCTAACTGGGTGTTCCTTAGTGTAATAATTGACTGCCTGCTCCCATGTGGCC 

TCACAGAGTCATAGCTTCTTGCCCCAMTGTTTTTCTGGTTTTCTGTGGAGCCATACAGA 

ACCATAAAGA GAAG TTGTTGAGAAACCGGTTTGACTTTCAAGTCTGATGAGGAGCCTGGT 

TTCCATGAGATTTTCAGCTAAGTTAGCATATTTTTAAAAATTTCAAAATGAACAGAGGAA 

AGGTTGACGATCTTTAGAGCCTTCTGCCAGGCTGAAGTGTTCTGCATCTTCCCTAGACCC 
GGTACCTGCCCG 

Sequence 466 

CCGCGGTGGCGGCCGCCCGGNCAGGTTACAAGCTTCGACCCACGCGTCCGGGAAATTTTA 

ATTA AAMT AGGTGAACATTTTAAATGACCTAATACATATTTAGTCCACATTGAAACTTT 

GGCATTTTGTCATTGCCATTAAAATTTTGATGGCATTAAMTTTGATGCCATTAAAATTT 
TGAT 

Sequence 467 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTC 

CTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTT 

CACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAG 

GGGGTTGGTGTTGGAGGGTTGGAGGAATGGAAAAGGCCCTGAAATGCAGGCAGAGAAATG 

ATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTTTACAGCAGCCTCGAAGA 

CTGATCAGGTTACTATACCCTCTCTTCTGTCCACGTGCATTTC 

Sequence 468 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTT 

CCCCAAACAGTAATATGGACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTA 

TTTTGATATGGTAATTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGC 

T GTGTA AAAAGACACCTTGTGGTACCTAAAATAGGTTTATGGTACCTATGGAATTGCTTC 

TATTTTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATG 

CGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 469 

GACCTCTTTAAGTGAAACTTAGTATGCTAATAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACAACAACAAAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 470 

GCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTTACAAGCTTCGACCCACGCGTCCGGGAA 

ATTTTMTTAAAMTAGGTGAACATTTTAAATGACCTAATACATATTTAGTCCACATTGA 

MCTTTGGCATTTTGTCATTGCCATTAAAATT7TGATGGCATTAAAATTTGATGCCATTA 
AAATTTTGAT 

Sequence 471 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAA 

TAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAG 

CCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTNGGAGGAATGGANAAAGGCCCTGAAA 

- TGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTT 
Sequence 472 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCG 

GAGCGTTGCTTGGATTTCTAATTACTTCTMGNGTAGTTTTATTTAATTTCAGTCCTTTA 

GAAAAAANAAAATAAAANNAAATGTGCGGGCCCGGCCTGCCCGGGCAGGTNCCACNCGTT 
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CG AAA AAAG AAAG AAAAAA AAAAACTTTCTCTTTG CCANTTCTTCTTCTTCTTTNTT 
Sequence 473 

CTA CTTA GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCA CI I I I I I I I I I I I I I I I I I 
TCCTTTTGAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCA 
ATC CTCCCGCTTT CAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGG 
CCCTTTTCI I I I ICATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAA 
GAATGTGAACTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACT 

GTTTCTTAGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGAGACTGCTTCTCCTGGTTG 

ACCTCATCAATGCCCAACC 

Sequence 474 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGMGATTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

AACCCATTCTACCTAACTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTCATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGATTTAATACTTTGGAAGTAGCATCATAAG 

Sequence 475 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCNGTGCAAATATCTACCCA 

GTTAGAAGAGTAAATACCATCtTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTC 

CTCAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTT 

CTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGC 

CTCTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 476 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAACGCTTCGACCCACGCGTCC 

GGTTTGGGTGGMTTATAATATTTTAGATAAGATTTAAGAGGATTGCTAGATNGGAATGC 

GAATGATGATAAGGCTTTTAGAGTTAGATAAGAGAGAGGGCGCTCTAGAACTAGTGGNTC 

Sequence 477 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTTCCC 

CAAACAGTAAT ATGG ACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTATTT 

TGATATGGTAATTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGCTGT 

GTAAAAAGACACCTTGTGGTACCTAAAATAGGTTTATGGTACCTATGGAATTGCTTCTAT 

TTTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATGCGG 

ACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 478 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTATACTAGAA 

GATGCTCCAAGGTTTCAGAAAGGAATTMTTACTTTCAATTTGCACAATTTAGAACAAAT 

ATCTGGCTTTTCCCTAAGCTTAATGATTTTCCATTTCACACAACTAAAATATAATAGCAT 

TATTTTATAATCAAGTTTAACTGATGGTCTATGATAGTAGAGCGATTTAGTATTTTGACA 

AAAATCTTATGAGACATGAAGTCATTCAATTTGCCGGACGCGTGGGTCGACTCAAGCTAG 
ACCTN 

Sequence 479 

T TAGG GCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTTT 

CTTTTGTTGTTAAGGGCCCATGACCTGCAGTTTCCCTAACATTCATTTTTATACAGGGCA 

GAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCCAGGCAG 

ATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCTAAAAGG 

ACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACCTGCCCG 
Sequence 480 ' 

ACTACTA TAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCTTTCTTTTGTTGTTMGGGCCCATGACCTGCAGTTTCCCTAACATTCATTTTTATAC 
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AGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCC 
AGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCT 
AAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACC 
Sequence 481 

GACCTCTTTAAGTGAAACTTAGTATGCTAATAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACAACAACAAAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 482 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGT 

CAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACA 

AATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAG 

TCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATTTTGATAATAACACTAAGAT 

GGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 483 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGA 

CCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCAC 

AAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACC 

CTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGC 

CCTGAAATGCAIMGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAA 

TATCTTTACAGCAGCCTCGAAGACTGATCAGGTTACTATACCCTCTNTTCTGTCCACGTG 

CATTTNAAAAACNTTGGCCGNTCTAGAACTAGTG 

Sequence 484 

CCGCGGTGGCGGCCGAGG TCTTC GACCCACGCGTCCGATGGTTTTTGCAAAAATTGAAAA 

TGCATCGAT ATTACA GTTMTTTTTTCAGTGTGTATGTGGTATTAGGCTTAGAACTATAA 

CACAGGMGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTTTGTAAGTATTAATACCTG 

ATAATTTACAT CCTAC AGCCCTGCC 1 1 I 1 1 I I I 1 1 I I I CAAGTTTGTCCCAGCAAGTCTT 

GGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGCCCG 

Sequence 485 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I 1 1 I I I I I I ACCTGAAAATGCTTATTCTAGCTT 

CACATTTGATTGTTTGGCTAAGAAGAAAATTATTTATTAGACTTAATTTTCCTCACGAGT 

TTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 

CTGATTCTTGCMCTTACAAGTAGTCTTTTTTTGTCTAGACGCTTTCAGGACCTTCTTTT 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 486 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTGGAGGCCCAAG 

TGCTGAGACTCAGGGGACGCAGGCCTGAGGACAGTTATGCAGGGTGCAGCGGCCTACGGT 

AACCCATGTAGCAACAGAAACCCTTAGCTAACTGCCGTAATTTAAGGCAATTAGGAGCCA 

TTCATCATCCAGATGGCTATTGGCTCTAATCGTTACTGGCTGAAGGAACTATATATAGCA 

GCTACCTTTCCGCTCCATTCCCAGGCCTTGTTCCTGTCTTCTGGTTGCCAGTTCTGCACT 

CACTCATTCCGGACGCGTGGGTCGAAGACCT 

Sequence 487 

CTAT NGGGC GAATTNTTNCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACI I I I I ICATTNATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGTTTTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCAATAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGGAAAATGA 

ATTATTAAAATTCCATGGAATTCCTTTTGAAGTTTATGAAGT 

Sequence 488 

CCGGGCAGGTACAAATCAAGTCATTAACATTTTCAATGTCAAAAATACAGCACGCTGTTA 

AGAGTTCTGTCAGTGCTCATTATCCCACTAGATCCCACAAAGGGCAAACTCAAAAGATGA 

AACAAAGGCAACGCCATCAATAACCACCATATTCCACAGGCTTTCTCCCCTAGGACGTAC 
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CTN 

Sequence 489 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGA GTTCT ACTTTAGAGGAAATCCTGGTAACTGTTGTTTGTTTGTAAGTTA 

TAGCTGTAATTMTTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAAGCTATTNTTCA 

AGGGAGNTNTTTACCCATCATAAAGGNGGTTTTTAGTCATTATAGATAATATTCAATCAA 

TTAMTACCGGGGGATGGCAAAAA 

Sequence 490 

CCGCGGTGGCGGCCGAGGTGTAATTTGGAGAATATTTAAAGCAAAAGAGCAAACAACAAA 

AACTAAGTTAACACTTACCCAGTGCAGTAAGGGAATTGTAAGATACAGCCTGCTTAAGGA 

GGTCTGCAGACAGATGCACCTMGATTTCAGCTGTTTTAGGTCACTTTTCTCAAAATATT 

TATTATCTGGCAATGGGGATGGGAGTGGGGAACACCTNTCTGTGAGGCAAATGGTATCTC 

MCAAATACCGACTTTTCAAGGAAGAAAGCTCTCCACTTCTCTCATAAACTTATATACTA 

CCTTAAACAGTATGCAGTATTCGCGGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGC 

CCGCTCTAGAACTAGGTG 

Sequence 491 

CCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACCAAAGCTGATCTCTCA 

TTCGTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAAAAGTTTGAAATTCAT 

TTTCTTCAGGAAAATACCCAGCCAGTCCCTCTCAGTGGAAACCCTGGTTATGTCGTGGGG 

CTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCCGTGTCAGCTCGTA 

GCACAGMGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCCAGATTACGTGGCCCCTTTT 

GGAAATTCCCAGGCCCAGGGACATGCTGGACTGGGTGCCCATCCACTTNATCACCCAGTC 

ATTCAACAGGGA 

Sequence 492 

CCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCAGNTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

NATCAGTGGNTC TCAGA GTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATMCACTAANATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 493 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGA 

ACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGA 

ACTGGATCAGAATGGTCCAAGGACTGTTMACAGAGGMGTATTTACATTTTGAAAACTT 

GCGGACGCGTGGGTCGAAGCTTGTACACCT 

Sequence 494 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 
TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTAAACAGAGGAAGTATTTACATTTTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 
CACCT 

Sequence 495 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAAA 

CTTAMGTATMTAMAAAAAGATTATTTCCTATACTTCAAMTCAACAAGATTTGATTG 

CATTTAMTTTTTCTGTCCATTTGTTCTTCTATGTAATACTTTMAAATAATTGGCATAA 

AAATTCAATCAATTCATAAAAGTCCAAAGCAAAAAAAACAAATCTACTGACATNTCTTGA 

GGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACAGAGACCACTACACACACAC 

ACAAAAA GAAG GATGGGTGAATGGATGCA GAGA GAATTTAATAAGACTGAAATGATGCCA 

TACATGCTTTTAMAMTAAAAAGTATTAAATTTTAMTTTTACTTCAATATAAGAGAAA 
AAAAAAAA 

Sequence 496 x 
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CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGA 

CCCACGCGTCCGTAGTAATAGGGAATTAACTGACCCCTTTTGGATGGGGGAGAGCATCAG 

GCTGGGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGC 

CCTGACTTGAAACCATTAGCGCTAACTTGCTCTGTTTTGAGAAAAACTTTCCAAACTTTT 

GCATGAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAAT 

TGTC TCTG TGATAAAMAAAAATGAAATATTTTCTTATTGAATTAATATTTTTGTCTTGA 

AGCATTTTCTAGTGATAGAATGTATTTGTCTTTTTTCCTGGGGGGNACCTCGGCCGCTCT 

AGAACTAGTG 

Sequence 497 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCCAGGTC 

AGC ACTTT GAACACTTGAACATCTATGAAATCACATAGTAAAGTGATAGGAGATGGGGCT 

AAGCTTTTAATGGCCTTTAGACATAGCATTAGACATAACCTAAGCTGAAAGGCTTTGGGA 

AGTTGTTGTGTTAMTCCCCAACACACTCTCGTGTTTTCTTAGGACTTGCCTNTTATTTA 

AAAAAAAAAAAAAAAAGTTGCGGCCCGCTCTAGAACTAGTTGGATCCCCCGGGCTTGCAG 

GN 

Sequence 498 

CCGCGGTGGCGGCCGAGGTACCCTTTTATAAGGGTGTATCCCCTTTTGGTAACTTACTGT 
TTGTTAATTTGTAGTGTTCCCTGCCAGTAAGCTTGTAACACTCTAGTGACTCACCTTCGG 
GTGGGAGGGTAGGAAAGGGAGAGGCCTGCCTCCTAAACCTGGGAAGATGGGGAGAGAGTG 
GTAAAGCTGAGAGCCCAAAAAACAAACCAAAACAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 499 

AGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGT 

CCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTG 

GACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAAATTAAGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

AAG CATTTTCAG GTAAAA AAAAA AAA A AAAAAAGT 

Sequence 500 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTC TCAGA GTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 501 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCC 

AGAGAAGCTGAAAGCCTTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGA 

TGTGGCCTCGAGGCTGGGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACA 

GTGATGGCTCAAATCCAGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTT 

TCCCAGGTGTGAAATGGGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCA 

CACAGCGAGCCTNAAAACGCTTGTGTCCCCTCCCTACCTCACAGCCCATTTTAGAAGTTT 

GCTGTCACTTA 

Sequence 502 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTCTTA 

TGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGT 

GGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGATTG 

GCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAGTGATTTCTTGTGAGATC 

CGGTTGTTTAAATCCANAGGCACCTNCCCCTACCCTCTAGCTCCCATTCCTGCCATGTAA 

GACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGAAGCTTTTTTGAGGCCTCCCCAGA 

AGCTGATGCCAGCCCTATGCTTCCTGCACAGCCTG 

Sequence 503 

CTACTATAGGGCGATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTTTTGAAATGCACGTGG 
ACACGAAGAGAGGGTATAGTAACCTGATCAGTCTTCGAGGCTGCTGTAAAGATATTCATT 
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TMCCTCACCACAGCCCCTGAATTGTTTCATCATTTCTCTGCCTGCATTTCAGGGCCTTT 

TCCA TTCCTCCACCCTCCAACACCAACCCCCTACTACGCTGAGGGCTCCTGGAGGGTTCT 

TTTTCTCTCMTCAATTCATCTTTTCTGTGAACCTATTGTTATTATTCATCTTTGTGCTC 

CCAGCATGTAATACAATGCCTGGCACTTAGGAGAAGTTATCACGGACGCGTGGGTCGAAG 

CTTGACCTGCCCGGGCGGC 

Sequence 504 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTC 

CGAAATTCTTGGTTAAGGATTATTATAAAATAGAATGGTATTTCAGTAAATCCCTGAGGC 

TTAGGAGTCCAGGTACAATGTTGGTTCTCAATTAAAATATAAATCATGTCTAGGGACACT 

TAGGAACACAGAACATATATTTAGAGCTAGAAAATATACAGCTTCAGACCAGGCAAAGTG 

CTGGGATTACAGGCGTGAGCCACCACGCTCGTCCTCACATGGGGTTTTATTATTAGGATG 

GTAAGAGTATTATAAGGGATTNGGTACMGGCATAATGAGTCCTTTTGCTTTTTAGGCTT 

TTGACTTNT GGTTTTM GACTTTTIsrTTTAGCTTTTTGTTN 

AGGCTTNGGTTTTTTAATAAAGTTTGCTTGGGATNAMCNTGACCTTAATGGAAATTGTC 
CCCTNCCCCCAAAA 

Sequence 505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACTGATACAAATAGCATGGATGAAACTCAAA 

ATCATTATTCTAAGAGCCAGATACTATAGCCTGTATTTTATGATTCACTTTCAATGAAAT 

TCTACAATAGACAGAACTATCTATCAACAGAAAGCAGATCAGTGGTTTTCTGCAGCCAGA 

GGTATGA AAGGTTTG AAACATGTGGCACCAGTAGGACATATGGMACTTTTTTGGTGTGA 

TGGAAGTATTTTTTTATCTTGATTGTGTGGTGTTTGTTATACAGTGGTATACATTTGACC 

T 

Sequence 506 

GGGGCGGCCGCCCGTTCAGGTACACGTNTTTNCCAACCAATTTTATANGNATATATATAT 

TCTACTTCCAACACCCNTNTTCATCCTGGTNCAATCAAAGCCTGGTTNTGGCCAACAANA 

AACTCGTCAGGAGATCGAAGGNTGTAGATGTCTGCACGTGGCTTCCTTGGAGGTCCAGNG 

GNGACTCCCTCTTCCAAAATCCATTCTGTACCCGCTGGCTGCTCTAACGGGCAGGACAAC 

AGCGTATGAAGCCTGACTGCAACTAGGAGAAGTACCACACTCCCGGACGCGTGGGTCGAA 

GCTTGTACACCT 

Sequence 507 

GGCCGAGGNCAAGCTTTTACCCACGCGTTTGAANCCATCTGTTTGGNACCCNGAAAGGGG 

GCAGGAAAGGCTGGGGTCCCAGNCCACCCTAAGGGNATCTGAGTGGCCCAGGGCTNCAAG 

NNNNCCCACCTGNCCAATGGGACCCTTTCTGNCCTCACCCTACAAGGGGCACAAAGGGAA 

GACACCAAACCTGGCAGGAACTTTTCACGCAATCAAGGGAAGGAAAGGCANTCCTGGCAG 

AGGGAACAGCANGCCAAGCGGGAGAAGGCTCAAAGTAAGGAGGGTAAG 

Sequence 508 

C CGCGGT GGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGCTTTTCAATTTTATTGTAT 

AGTTTTTGATAATGTTAATGTCTGAGATCTTTATGGGTGAGTCTGCTGTCATTTCTGCTA 

mCTCGTAGTGATTTGCTTGTATGGTTTATGATTTTTTAMMCTGMTGTGT^ 

ATTGTGTCTGGTAATTCTTTAGGGACCCATTGTAGATGTATTTCTTCAAAGAGCATTTGT 

GGTTATTATATTTGGGTGCTTGGGGCACTGCCAGTACCTGCCCG 

Sequence 509 

CCGCGGTGGCGGCCGAGGTATTGAACCAGGTCAAAACATTGTTGAATATCAAACCCAATC 

TATTTAATCTGTAAGAAACAAGGACCCTGAGAAAGATTCTGACCAAGGGTATGTGATCGG 

AAACTTGACAGATAAATGTAGTATACTTGTAAAGCCATACTGTGAAAAACTTGGGGATTA 

TTTGAACACAAATTATCACCTGGAAAAAGACAGAAAACAAGGCAGAAGACTGTGCAAAGA 

GGTTGGAATATTCAAAACTTCAGATTAGAAG 

Sequence 510 

CCGCGGTGGCGGCCGAGGTACCCAANNGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGMTGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATC 
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TTTAMCTCGTGAGGAAMTTMGTCTAATAMTMTTTTCTTCTTAGCCAAACAATCAA 
ATGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 
Sequence 511 

CCGCGGTGGCGGCCGCGACCACGCTTGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCG 

TCCGAMTTAATGAAATGTTTTACATTCTTTTAAAAACCTTTGAAATATGGTGTGTATTT 

TATGCTTTAGCAAATCTCAGTTTGGACCATTTCAGGTGGTCAGCAATTACACATGGCTAG 

MCTAAGAGCAATCAGTTTTNTTCCACAGTTTTTCTMMTTTTCTTGTCAAAAATCTTG 

ATGGTATGA ATfAC TCTTTTAAAAAGTGCACTTNACCAGCAACAGAAAANAACCCTGGAG 

GGGTATGGGTTTTAAAGCTGGTACCTNGGCCCGNTCTAGAACTAGGTG 

Sequence 512 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCA 

CAGGAATGAGCCACCACACCCAGCCAAATTGGGCACAMTTTAAAATTTGACTTTTATTA 

ATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCA 

AATCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTCTGA 

AGCGGACGCGTGGGTCGACTTGTAACCTGCCCGGGCGGCN 

Sequence 51 3 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATAAGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 514 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCACTATAGGGATCT 
AGATCACGAGCGGCCGGCCGCAC T I I I I I I I I I I I I I I I I I I ATTTCTGCCACCTCTTTC 
ACTTGGGAATCTATTTTCACTGCTCTCCAAAGTTTTGAGAAGGCAATAGTCCTGGAAAAT 
GGGTCCTGAGCTCCTCTCAGCAGTCCTGCTTTCTTTCCACCTGCACTGTAAGGNGACCCT 
AACTGGGTCTAAGACAAAAAGTCGGACGCGTGGGTCGAAGCTTGTACACCTT 
Sequence 51 5 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTAGTTACCACTTCA 

TTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGACTTGAAGCTGGAAGCAAATAC 

AAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCGTCTCCCTAGCTCTCTATGGCTCTAC 

CTCTCTGCTTGMGCTCCCTGCACTGCACTCCCATTACTCTGACTGGGGATAGGACCACT 

GCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTTCCAGGAAATCCCACCCTGGG 

ATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCACCTAATGAGTGCTCTAGAGAT 

CAGCACCAAAGATGCTTGCCAGAGTCTTCTCTA 

Sequence 516 

ACACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGGAACCCTTGTTCAGGTTCTTT7TAGGCAACCCAAGCCAAGACAACAAAGTA 

AGATAGAGCCCCAAATGTGGTCGTATAAGGTTTTTCAAAGAAAGTAACACTTGAGTTAGG 

TCTTAAAGTTTNACCTAAGAAACTGCCCAGGTGGACAAGAAGAAAGGGTGTTCCAAGTAG 

AAATAATANCATGGACAAAGGCAATGTAGCAGGAAAAGTNTTCGTAAATTCAGGGAATTT 

CAAGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTTGCGGTGGCAGGGGATGTTG 

GAAATGTAAACAAGAGTGAGATACAGAAGATTTTATGTGGCATGCCAACTGGGACTTTTT 

TTTGTAAACAA 

Sequence 517 

TCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGCTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAAT 

GTTCTAGGGNGGGNGAGGAAANCTAGGGTGGNTATCTAAATCAACAAATATTCTAGATAT 

TCCAATATCTAAATTATTGTTGGAAATACTCNTCCTGAAGNGNTCATTTGAACNCTAAAG 

CAGGAGNACAGCNTTTGTTGTATCAANATGGGCAGGGGTTTTTAAAGGGTNTCCATTTTT 

TNTTANTTTCCNCATTATTAAATTCCNTTWTAMTNNTTTTTAGGACCAAAAATTTTTCC 

CNTTTCTTNGAGGTNTTTAAAGGGGGATTTAANAAAATGGGNNANNTGGGGGGTTT 
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Sequence 518 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGTAGAGAAAGATTCAA 

AATGCTGCTGTTCTACCTGAGATGGGAAAAATGAAAGCAAAATAACATCAACAAAAACAA 

ACAAACAGCCTTGTAGTTCCATGTCACTAGCCAGGGATTTAAGACCAGCCTAGAGAACAT 

GGTGAGACCCCTTCTCTACAAAAATAAAAATAAAAATACAAAATAAGCTGGACATGGTGG 

TGTGTACCTGCCCG 

Sequence 519 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGCTTCGACCCACGCGTCCGCTCACT 

TCATCCTCCCAGCAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTG 

GGGCTTMGGAAATTMAGAGCTGTTGTGGGACTTCCMAGCAGMGACAGTAGGCTTTC 

AGAAATTTGATAAAMTAGCACTTTGCATTTNTTGMTCTTGAGCTAAATGGAAATTAAT 

ACTAAACATTCTNCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAG 

AAGAAAAGGAAATGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGGTTAAGGC 

TTTTGGCACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGGGAT 

Sequence 520 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGAGCTATGGACCTAAGGCAGCGAGTGGATTCATTAGTCCTCTTTCAGCTGA 

ATGCATGCTACAGTATAAGAAAAAAGCTGCTGCCTATATGAAGTCTTTGAGAAAGGTTTG 

TTA GCTGC TGTTAATATTTAAATCAGAGGAAACATCAGGAGTCATTCTAGAGAATGGCAA 

GAGTTTTTCTGCAGTTTATATTGTTGACTTmATACGATATTG 

CTAGAACTAGT 

Sequence 521 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGC 

TAGGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAAGACTGGCCAGCATTAGAGGTGGGTTG 

GCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCG 

TAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 522 

CGAC TACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I 

TTTTAAGTAAAGAAGGTTTTATAATATAGTGAAAACAATACGGAGATGAAAACCAGGAGA 

CCTGGGTCCCGCCTTTGTTACAAATGCCTTTCCTAAAAGCTCCAGAATGGTGCGAGGTCA 

AAACAGATGGGCAGAAAGGAAGTGGTCATCAGAGCAAGAGAAAGAGCAGGTGCCAGGCAC 

TCACGTGTGCGGTCATATCAGGTAGAGATGATGAGTAGAGATCTGCCCTAGAAGACACTG 

MTTCTGAGATTCAMGGGGAAAAGTTGATrTTATAGCCAGTGATTTTATAGCCCACTTT 

CCTGCCCCACCCCTACTNTAAGAATTGCGGACGCGTGGGTCGAAGCTTGTACCTGCCCG 

Sequence 523 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCCCAGAATA 

ACTAAATAAATAAAAGGCTAAAGAAAACTGGAACAGTACTGCGTCTCCATCTGAGACGCA 

NTCTTCTACTTCCAGCATCGNAGAGAAGGGCTAGGGACA A I M I I II I CAAAAGATTTAT 

ATACAGGCTTGAATCCAGAAATTAAGGNTAAAAGCATAAATATTGATAATTTCAACTAAA 

TTCAGAATGGNTTCAGAAAGATATGATACAACAATTTAGAATAAAACAAAGCAGAAGAGC 

ATNATATTTTGCGGACGCGTGGGTCGAAGACCT 

Sequence 524 

CCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGTCCGCTTAMAGATTTTTTT 
TTTATGTAAACTGTTGAATATTTGAAATAGTCCACTTCACCTTAATGGGTCTTGTCTATC 
TTCATTAGTCTTCAAAGAAAAACCATTTGCTACCAAAGTAAATCAGTATTTTGAATGTGC 
TTCTCTTGTTTTTTGTTTATTAGCTAGTTCCTC 

MTCGTAAGGAAGCTGTTTCTTTTAAAACACAAACCACCACCAAAAATTTAAATGTACCT 

GCCCG 

Sequence 525 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTCAAGACCCGCCTGGC 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

87/467 



CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 

TGCCTGTAATCCCAGCTACTCAGGAGTCTGAGGGAGGAGAATCACTTGAACCTGGAGGCA 

GAGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACT 

CTGTCTCAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGCC 

TTTTCTGGTCTTTGAACACATCATAGAAAGTGACAAATGCTGCAAAGCCATGAAGAACAT 

GAACTATAAACGGGTAGACTAACTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGC 

TGAATT 

Sequence 526 

CCG CGGTGGCGGCCG AG GTACCAAAACCCGGCTTTTTTCG AAATACCTGCAAAAAA AAGT 

GGATGATTCCAAATCCAACTGAAGTGTCTGCTCTCTCTCCAATTCAGAACAACCAGAAGG 

GCCTGTCTTGAATTAGGTAATGCATTAAAGAAAAGTAGGATTATTATATTCCAATTCTTT 

CCATCAGATGTAAAACATTATTGGTAGATCACATCTGTTTTAATAAATCTGTAAGAAAGA 

CGTGTAATTATAATTATGTTACCATTGTATGTAAATGGCATTTTAACAAGACATATTAAT 

ACATTTTTATAGAGTACCTGCCCG 

Sequence 527 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGT 

CCGGTTAATTTGGGAGAAGGAAGAGAGAGTGACATATTTGGCTACCTTCAGGGAACAAAA 

TCTAACAGCACAGATGGTAGTAGAGGAGATACCAATTTACATATTAAGGAGCTAGAGTTG 

ATGATGGTATGACTCAGCCCTCTGAGATTAAATTCTACTTACTAGGGCTATGAATGGAGA 

TAAGTAGGTATCCCACCTTTTATTAGAAGGTTCCTTAAAATAAATATGGGACTCTGGTCA 

GAGAGTAGGGCCATTAATTTGCTCCTGGTTTTTACCTGGCATCCACCCACCAGTACCTGC 

CCGGGGGGCGGNCGGCCCGCCCCGGGCAGGTCCCGCAC I 1 1 I I 1 1 I I 1 1 1 1 I 1 I I CCTTT 

Sequence 528 

CCGCGGTGGCGGCCGCCCGGGCAGGTATTGCCCTTTGATGTCCCCATGAGGGCCAGGCCC 

AGGCAGAACCCATCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAG 

GATTAATATGGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTC 

TCTTAGAGTTCAATGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGGAATA 

TCTMATATTGTTGATTTAGATAACCACCCTAGATTTCTCACCACCCTAGAACATTTAGT 

GGGGAGACATTCTTTTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGTGCAAA 

TGGTATTTCTCAGCTAAAATCTCCCTTATGAACCCTTCTCGAAATCC 

Sequence 529 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAACA 

AGCAGCCAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGCT 

TGTTTTCCTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTTT 

TG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAATT 

CATTTTTTGAAAGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATTG 

CAAACATGTCATCACTCCCTTGATGTTTACCTCCTTGTTATGCATTTTTAGCAGGCTTTA 

TTGTCACCTG AG A I I I I I I I I 1 1 CTTTGACAGGCCGGAGTCTAGATGAAGGAAAATGTGT 

TAGAAGCACCTTATCCACAGATGGGG 

Sequence 530 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTAGCAGAGCAGCTCeCTC 

GCTGCGATCTATTGAAAAGCTTGAGTCGACCCACGCGTCCGCTGCATAAAAGTTATGCAA 

AMGCATTTTATGATATACCAGCAAAAAACATGGAAAATGAAATTTTGAAAAGCAATGCC 

ACTTCAAAGATCCCTCAAGTGCCTAGAGGGAGAAAATGAGTTAATATGCTTTGAAGAACT 

GTATCC AGAAAA TAAAATTACAAAGGAGGAGAGGGATAGGATTCCAGGACAATCTCAAAA 

CTATTGCTTTTTCCTAAATTCATTGCAACCTTAAAATCCTAGCAAGTTCTTTAATGTAAA 

TTAACAAGCTAATTCTAGAATTCATATGCATATTCAAAAGTCGAATMTTGTCAAGGCTA 

TCCTGTAGAATGGGACAGAGAGGATTGAAATTT 

Sequence 531 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTTAC 
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AGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAGCC 

AAGATCTTGACCTACGCCATCTGATACCCTGAGCCCATGCTATAAAAGAGGAGCATTAGA 

MTATTTGAAAGATAGAAATGAGAACTAGTCMTATTTATTTTGCTTAGCACTGTATTCA 

GTATTATGGCATCTTAAAGTAGTTAAGACTCAATATTTCATCAAAAAAGTTTAAAATCTA 

ATCAGAGAAT 

Sequence 532 

CGCTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATAAGGCCACCC 

CACTAGGAACTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTAC 

TGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGG 

TTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAAC 

CCGTAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATAC 

CCCCTTGCCCCGCCAACGTAGT 

Sequence 533 

CTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCACTTTTT 

I I 1 1 I I I I IGTAGAGACAGGATCTCCTTATGTTGCCCAGGCTAGACTTGAACTCCTGGGC 

TCAAGGGATCCTC CTGCCT TGGCCTCCAAAAGTGCTGGGATTATAGGTGTAAACCAGTGT 

GCCTAGCCTACAGTTTTTTAATTTTATMAATGTTATTTCTAATTTTTCTCCAAAAGTAA 

AAGTGGCATTCCAATGGCAATATTAATTCAGGTATCCAGAACTCTTAACCTAAATTTGGG 

TGAGATGAGGAAAAGTGTATTGTTAATTTTATGTGTCAACTT 

Sequence 534 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCC 

ACGCGTCCGTAGTAATAGGAATTAACTGAGCCCTTTTGGATGGGGGAGAGCATCAGGCTG 

GGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGCCCTG 

ACTTGAMCCATTAGCGCTAACTTGCTCTGTTTTGAGAAAAACTTTCCAAACTTTTGCAT 

GAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAATTGTC 

TCT GTGA TAAAAAAAMMAATGAMTATTTTCTTATTGAATTAATATTTTTGTCTTGAA 

GCATTTTCTAGTGATAG AATGTATTTGTC I I I I 1 1 CCTGGTGGTACCT 

Sequence 535 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCACGGCCCAGGCAGAACCCA 
TCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAGGATTAATATGGG 
AATAAAAAATGAACCTTAAA 
Sequence 536 

GAANTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCAGTAGATTTGGAGAGTAATACAAATC 

CTTTCTTTCTGGTTAGAACACACTGCCAAAAGCCACCTCTTTCATCTAAGGAAAAGATTA 

AAAATGCATGTTGATATCTCCTAACTATCACACAACTTCCACTATTACAATGAAAAATCT 

G GTCCC CTTTCATTGCCTTTGAAAACCNTTTTGCCGAGGTGGNTTTCAAAAAAACNCGNG 

ANTTTTNAMAANTTG GNTTTGGTTTTACCNGGGGAAAGGGGACNTTTNNCNNTTTTTTT 

I I 1 1 I I I I I I I I II I I I NAA ANGGNGATTNGGTTNNGGTTNTNCCTGGGGCCAAAATNCC 

NTTT TGNGGAACCTTTTTTGGGGTCCNAAAANNNACAAAANAAAGGGNTTGGGACNATNT 

TTfTGNATNCNCNCNAAAAAA Al I I I I I I I I I I 

Sequence 537 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGCTA 

GGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTGAC 

ACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGGCC 

ATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGTAA 

GCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 538 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCG 
ACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCA 
CAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAAC 
CCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGNGGTGGGAGGGAATGGAAA 
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AGGCCCTGA 
Sequence 539 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGACAAGCTTCGACCCACGCGTCCGCAAGT 

TTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATCCAGTTCAC 

CAGTAGGTTGGACAGCATATAATTTGCATCATTTTGTCCCTTGTAAATCAAGATGTTCTG 

CAGATTATTCCTTTAA 

Sequence 540 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAAC 

AAGCAGCCAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGC 

TTGTTTTCCTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTT 

TTG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAAT 

TCAI I I I I IGAAAGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATT 

GCAAACATGTCATCACTCCCTT 

Sequence 541 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGA 

TTGTTAAAAGAGGAGTCTAGAAAAATTAATCCTGAACCCTAAAGAATAAATCTTAAGTGG 

TGGATACATGGGTTGAATAGTGTGCTCCAAAATTCACATCCACTTGAAACTTCAGAGAGT 

GGCCATATTTGTAAATAAGGTATTTGCGGGTGTAATCAGTTAAGGATCTCAAGATAAATT 

CATCCTGAATTATAAGTTGTCCTTAAATCCAATTACTGGTATCCTTACAAGAAGGTGAGA 

GGAGAGAGAATAGAGCCATCTGAAAAGGGTCAGAAA 

Sequence 542 

CTAACACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 
CCACGCGTCCGCAAA AATCA ATCAAGGGTTCCTACTCAAGTAAAAAGCAACTTGTAGGAA 
AATAATAGGGGATATATTTTGCTCATTMGGATCTTTTTATAGTGGCTCTTGGTGCACTG 
CCTGTGAGTTAGCCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAAAAAAAAAANGT 
Sequence 543 

CTACTATAGGGCGATTGGANCCTCCCCGCGGTGGCGGCCGAGGTACTTCCTGGAAATCAA 

TTAACTGAGTCTTTTGAAACCCCTAGAGAAGATAGGAGAAAATTGGTTCAGANCGAGCAT 

TTAMTTAAGTCAGCAAAGTCAGAATTTAAAATTGGGCAATTCCTTGTCTACATTTTCTT 

TACACTCAA 

Sequence 544 

CTNACTATAGGGCGANTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCC 
T TAAC T TAGA TCGTTACTNATGTTTCTMATCANGTGGAAGCACATTTCCTTTTCTTCTT 
CTTTTCTTTTACTGNNMTATCCTNATTCTNTATTTTACCAGTGGNGAATGTTTAGAATT 
MTTTCCATTTAGCTCMGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAA 

AGCCTACTGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCC 
Sequence 545 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCACACAGAAGGAGG 

AGGGAGCTAATCCAGTAACAAACATTCAAAGATTAAATTGTAGATATGCACCTCTGTATT 

TGGCACTGTTGATTAATATTATAACACCTTCCTCTCAAAGACAGGCATTCTTAAGCGTTA 

GTCACAATATACCA GMTT TGCTATTCATATTAAAACCACCTTTTAAACTTTATAACAGT 

AACCAATTATTATAGTTTTAAGAAACAAAACGCAATGAGAACTGGGAATGGAATTCAAAT 

CCTCCAAATTCTTGCTATGCTCCAAGCTGCCATCCATAAAACAGGTTTAATTTGGGTAAT 

TTTTCCATTGTGGGGAAGGGTCAACAAGAAACAATTTAAAGACAATATTTTCCAATACAA 

ATAAAGACATACACTTTTTGTT 

Sequence 546 

TACTTAGGGCGATTGGNANNTCNCCGCGGNGGCGGCCGCCCGGGCAGGTACAAGCTTCGA 

CCCACGCGTCCGMATAATAAAGCTAGMGTAATATTTTTCCTTTTGTCTATTTTCCAAA 

TTGACTCGATATTGATGGCTACTTTTGTMGTTTTTATTTAAGNTTAAAGGGAATATTTA 

TTGATCACCTCTATGTGCTCANTACCT 

Sequence 547 
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TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I M I I I 11 1 i 11 I I I I I 

TTCCAGTGCTAAMCATCAGATMGAGCCTACCTGACATTTTGGAGMTTTGCTGNGCTG 

GGATTGATATTCCGCATTGCCTAAGAGTAAAAAATAAGACGGACGCGTGGGTCGAAGCTT 

GACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCACAGGAGGCAGAAGGAAATCCTCA 

ACCTTCCGAAGAAGGCGTAAGCCAGGAAGCAGAAGGAAACCCCAGAGGAGGGCCGAATCA 

GCCTGGCCAGGGATTTAAAGAGGACACACCCGTTAGGCATTTGGACCCTGAAGAAATGAT 

AAGAGGAGTAGATGAGCTTGA 

Sequence 548 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNAGGGTCGCGGTGGGTCGA 

CTNANGCTAGAGAATTGTAATACGACTACTATAGGGATCTAGATCACGAGC 

Sequence 549 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I I 1 I I I II I I I I I 1 I I I I 

TACCAGMCATCACATMGTTTATTTCAGATGTAACAGCAATGTTAAMTTGACAAGTTT 

MTTCTTAACTGCACCMGTAAACTTAGCCATTTAAGTATTTTTTTMG 

AAMAACTGAGGGAGCTTTTCTTTTCCACCACCACACCATGGTTTCCCAATAGTTCTCTT 

TTTGGAGGACTTTTCMTTGATGAGTAAACTGCTTTAGATATTTCAGAACTTCATTCCCC 

MATGMAGCTAATCTGGACMACTATATATTGCATAGATTTCTCTACAGATTCTTTGCT 

TTAAAA 

Sequence 550 

CACTAGTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGATGAGGCGGGGA 
GGTGGGACCCCCAAACATATATCAGCCCMCAGCCCTAAGTCTCCTTCTTTATTATTAGG 
AAAACAACAACAACAACAAACAAAAAAATGGCGTCATGAATATGAACAGCATTGTCAGAT 
GAATTAGTTGAAGTGGN I I I I I I I 1 G I I I I I I I I I I I I 1 I I IGAACCTGCCCG 
Sequence 551 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGTGCTTCGCCCCAC 

GCGTCCGNAATAATTGGAAANGGCCTATAGATTAAAAAGCTGAGAAAGTATATGGTAGGG 

AGCACACTCCCCACAAGTATGAACTCTGNGATTACGACATCTCATAAATNCATGAGCACT 

CATGTTGGC7TGCTTTGTAGCTATGAACTTACCCTGTATTATTGAAACGTCAGCATAATG 

ACTGGMGGAGAAATTGGTCCATTTTAGAGCATTACTATTATGCTATCTGTCCATTTAAA 

TTAATMTTGCATTAAATTCATTTTAGAAGGNGCTATTACATTNGTAGTAAGAAAGTAAA 

TTCATATATAAATATTTGATTATCAGATGGTTTACTTACAGATACTTATTTTCCTGTAAA 

ATAGGAGAGTTTACCTGAAGAAMATAAAACTTTTNACTTTTCTGGGAAAAAA 

Sequence 552 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGGCAGGTACCAAGTGAA 

TTTAMTAATTGGTGTGGATTGGCCAGTAGCTMGMGTGGGCTTTTAAAGAGTNTTGAA 

NATNGMNGGGTTTTTNTTTCTTTTTTAAAAMGAM 

TGA AAAGC TAGGNNTTGCCCTNTTCATGTNTACTCTCCTTCCAAATAGTTATATCCAAAA 
CTGTTTTTCCCTCTCCCCTACCTTGTCCCCCCTATTMAATANAAACNGGGAtTGATTAA 
TGTCCCGCTCCTGAATACATGTAAAATTTGTACCTCGGCCGNTCTAAAACTAG 
Sequence 553 

CTTAGGGCG AATTGG AGCTCCCCGCGGTGGCGGCCG AGGTCAAGCTTCG ACCCACGCGTC . 

CGCTTGTCAATAAAAAACAAACAATTGCTAAATAAAACAACCTGAGAAAATCTCCAGAGA 

ACTATACTGAGTGAAGGAAGAAAAATCCCCAAAGATTACACACTGTATGTCATTTATATA 

ACATTCTTGAAATGACACAATCACAGAAATAGAGAATACTGGTCACTANTGCATTAAGGA 

AGGTGTGGAAGGATGTAGTGATGGGAGGAAATGTGTATGGCTGTAACAGGGCAACAGAGG 

CNTCATTGTGATGATGGAAGTGTTCTGTNTCTTGGG l Mil I GAATGTCA 

Sequence 554 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAG 
AACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGNCCAACCTACTGGTG 
AACTGGATCAGAATGGTCCAAGGACTGTTAAACAGAGGAAGTNTTTACA1TTTGAAAAC 
Sequence 555 
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CTACTTAGGGCGAATTGNANCTCCCCGCGGGGGCGGCCGCTAAAGGAATAATCTGCAGAA 

CATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAA 

CTGGATCAGAATGGTCCMGGACTGTTAAACAGAGGAAGTNTTTACATTTTGAAAACTTG 

CGGACGCGTGGGTCGAAGCTTGTACACCTT 

Sequence 556 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I 1 I M I 1 I H I I H I i I G 
TTTTGGTTTGTTTTTTGGGCTCTCAGGTTTACCACTCTCTCCC^ 

GAGGCAGGCCTCTNCCTTTCCTACCCTCCCACCCGAAGGTGAGTCACTAGAGTGTTACAA 
GCTTACTGGCAGGGAACACTACAAATTAACAAACAGTAAGTTACCAAAAGGGGATACACC 
CTTATAAAAGGGTACCTN 
Sequence 557 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCACGCGT 

CGGCTTTCAATAGATCGCAGCGAGGGAGCTGCTCTGCTACGTCACAATCTTTCAAAAAAA 

TGAACATGTAAGAAAAAGCAGTTTTCATTGTGCTAATTATTGCAGGCCTTCATGCACGTA 

AACCTCAACAAAATGTGTGCCAACAATATACAAATTTCCATATAAACAAAGTCATTGATC 

ACTAACAAAATATAAACATGGNTTCTTTTATATTAG A 1 I I I I I I I AAA AAA AAG CT ATTT 

ACCAGCAAGAAAAAACAAGTACCTGCCCG 

Sequence 558 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTMGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATT 

GATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTAT 

TA AATCCTC CAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGG 

TGTTTTTTTGTGGACCATTTTAAAN^^ 

TAGG 

Sequence 559 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGGCTTCGACCCACGCGT 

CCGCTCCAGGAGACTTCTGCTTACCTCTCAGTGATCAAAAACCGTTTCACCACAGTTACT 

TACCA CGTCCTACCGATCCGCATTCTCGCAAGTGTCCTTCACTCCATTTACTCTACTGCA 

TTTTTCACTGTATTTCTCATGCCAAAACTTGGGCTTCTCCACCAGTCTGCACACGTTCAT 

GCTCTCAATTCTCACAGCCATCTATTTCATTCTCCACTAAACTGTTAGAGGGATTTCTGN 

AGAAATTAAAGAAATTCCTATCACTCCTAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAG 

AACTAG 

Sequence 560 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACG 

CGTCCGCAAGTTTTCMMTGTAMTACTTCCTCTGTTTAACAGTCCTTGGACCATTCTG 

ATCCAGTTCACCAGTAGGTTGGACAGCATATAATTTGCATCATTTTGTCCCTTGTAAATC 

AAGATGTTCTGCAGATTATTCCTTTAA 

Sequence 561 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTTATTTATTTCTT 

TTAGGAATTGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTA 

GGCAGGAACATTAGACTCCAAAAGCAGAGAAATGCTTAATTTTTCTTCTACCTGTTTCAC 

CACATTCATGTANAACTGTAGTAAAAAAGATGGNGAATCAGGCTGAATCAATCTAAATAA 

CAACTTAAGGCTCCCAAATCACATGAACCTAGGACCACTAAATCCAATGTCAGACGTGTT 

TAMTGGNGCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTG 

TCATTATGACATTCCATGAAAATGAAGAAAATCTTTCANGAAAAATTTAGAAAATAAAAA 

TGTTTACT 

Sequence 562 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCC7TAACT 
TAGATTGTTACTTATGTTTCTAAATCTGNGGAAGCACATTTCCTTTTCTTC^ 
TTTACTGNTAATATCCTTATTCTCTATTTTACCAGTGGAGMTGTTTAGTATTAATTTCC 
ATTTAGCTCAAGATTCAAGAAATGCAAAGTGCTATTTTTATCMATTTCTGAAAGCCTAC 
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TGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTMTTTCCTTAAGCCCCACTTTCCTC 
ATCAGCAAGTTGGTGTGGCAATGGATCATAATAGGTTGCTGGGAGGATGAAGTGAGCGGA 
CGCGTGGGTCGAAGCTTGTACCT 
Sequence 563 

CTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGTAATCACATA 
AATTCTGCAATCATCTGTTTATTTAGCTTAACTG I I I I I I I I I I I ATTTGTTG AAGTTGT 
TGTTGTTATTNCAGTCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAA 
GTTCCATAAT 
Sequence 564 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTCTTCAGATTCAAAT 

GCCAACACTAATTTGAACTTCTTTGGGTCTATGACAGTTTGCAAGCCATACAAACCCAAA 

GAGCTAATCTGTGATTTCTTAACTTGAGAAAATAATAATNATAACCACCACTGGAACCTA 

CATAGGTTTGTNGNTTATTTAACATGACTTMCCTTTTGTTTGTATTTrTTTGAAAAAAA 

AAAAAAAAAAGTGACCTGCCCGGGGCGGCCGGACGGCCGGGCAGGTCGCNCTCAAATTNT 

TNAATTThnTTTGGAAAGACANGNATTTTTTATTTTTGCCAANGCTAAAACTTNACNCTG 

GGCCTTTAAAGGGGATTCCNTNCTGGCCTTTGGGCCNCCAAAAAGTGCTTGGGATTNTTN 

GGGTNNAAACCCCGGNGGGGCCCTAGCCTACCAGTTTTTTTAAA 
Sequence 565 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 

GTGCAAATTTGACTTT GTAAA TGGCCCTTGGGCTCfGGGAGGAAAGCAACTGTTGGGCCA 

TGTGGTTGTATCTTTAGTTTTGTAAAGAATTGCCAAACTGTTTTATAATGTGGGTATATC 

TTTCCACACTTCCAGCACAATGTATGAGTGATCCAGTTTCTTAGCACCATAGTCAGAATT 

TACTGTTGCTACTATTT7TTAGCTATCCTGATAGATGTGTAGTGATATTTTATTCTGGTT 

TTGAAGCAGTGTCATTGTCTGGGGTAAATCCTTGAGGTTTGTTGTCTCAGTCAAGGGGAA 

TCAAGGGACATGGACACACAAGTAGTGAATTTAAGAGTGGAAGTTTAATAGGTGA 

Sequence 566 

TACTTAGGGCGAATTG GAGCTC CCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGTCTTATTTTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCA 

AATTCTCCAAAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAA 

AAAAAAAGT 

Sequence 567 

ACTTAGGGCGAATTG GAGC TCCCCGCGGTGGCGGCCGCCCGGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGCATATTTtCATTACTTGACAATGTGGGATGGGTGCAATTTATTCCATCT 

T CACTA AAATAGAAGCAATTCCATAGGTACCATAAACCTATTTTAGGTACCACAAGGTGT 

CTTTTTACACAGCTCATTTGAATACAGGTGTTCTGAGMGGGGTTTCTATTTTAAAATTA 

CCATATCAAAATAAATGTGCCTTATTTTTTTATAAGNCTTGTTAAATCAGTGTCCATATT 

ACTGTTTGGGGAAGG 

Sequence 568 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCAGGTACAAATAAGCCCACCCCACT 

AGGAACTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGT 

AAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNNAGAGAAAAAAAGGCCATCTATACCAATTCTAAG 

Sequence 569 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAGCGGCCGNC 

CGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTNAAAATTCCAAG 

CTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTTTGTA 

TACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGNGCCGAATTGAAATC 

CAGAATACAATAGCTNTGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAAC 
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CTCATGGGCACACTTAAAATAGTTTGCATCCCACACTACAGGCTTCTCTGGAGGGATTTA 
ATACTTTGGG 
Sequence 570 

CTACTTAGGGCG MTTG GAGCTCCCCGCGGTGGCGGCCGAGGTTATTTTCCCTCAGTAAC 

ATGTAATTGCTACATTTTTTATAAGAAGGTATGGTTAGAAAAAAATGTGAAAGATCACTT 

AAACCMAGCCAGTTACAAGGAGTAATCTCTCCTGTTGGTTTACCTTCACCTCANAACTA 

CAAGAATATTACAATACATAGTGAATAGTTGTCTGTAACATTTCTACCAGTTGTTTCANT 

AGCATATTGGTC TTGGCATTTCTTGG CACTGTGGTTCTGCTGTATTATTTGTGATGTCTT 

ATTGTTTGTG AGCTTTTG I I I I I I I I I AA AG AA A AAACAA AAACTAAGTG 

Sequence 571 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGTAATCACATA 

AATTCTGCAATCATCTGTTTATTTAGCTTAACTGNTTTTTTTTATTTGNTGAAGT^ 

TGTGTTATTTCAGNCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAAG 

NTCCATAATTATGGGGMGATTTCCTCTGAATTGGCTAAATTCCTGTAGCTGAAAAAAAA 

AAAAAAAAAAAAGT 

Sequence 572 

TTAGGGC NATT GGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGT 
CCGCMGTTTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATC 
CAGTTCACCAGTAGGTTGGACAGCATATMTTTGCATCATTTTGTCCCTTGTAAATCAAG 
ATGTTCTG CAGATTATTCCTTTA A 
Sequence 573 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTACGGC 
TTCTGACAGGCCTCAGAAG AACATTCC CTACCCCAAAATTATAAAAATAATCTTGTATAT 
ATTCT TCTCMAACTTTATACTTTTTTTAAGGCTTGGATTTTTAAfCTATCTGG 
TTTTTAMTACTGAGTGAGTCACTTTTCTCCCGGACGCGTGGGTCGAAGACCT 
Sequence 574 

GAATTGGAGCTCCCCGNGGTGGGGCCGCTCGAGGCCGCTCTGACCTCTTTAAGGNANACT 

TATTATGCTAATAGTTGATGCGCCCTTTTGCTGNANCAGTTACCATAGGTTACATGATAA 

NTATATATTGTTGNGATGATCTGATTNCCTGNNTNCCCCACCCNTGCAAAACAACAACAA 

AMCCTTTACCAGGCTNTATAACANGGGGACCMACTTGNTTTTGCTCATCATTGCCGGA 
CG 

Sequence 575 

AGGTA CTTACCCCAGACAACGACGCCGCTTCACCATGATGATGGACAACAGGCAACTTTT 

TTTTTGGAGTTTCAGCTTGCTTCCMCAGGGACGGTGAGTGTGAGGTTTATTCCCATTTC 

TAAGACGATAGAAGTTTTCAGCCTAAGCCGTATTCCTAGGTAAGCAGCTGGATTGCAAGT 

TTTGTCTTGGAAATTCTCCTTAATGACTAAAAGTTAAAGAATTAAACAACCTAGCTGGGC 

TTAAATTTCTTNCTTACCCATTAGAAGTACCCTTGCCC 

Sequence 576 

TAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGNGGTCTAAGAGACAGGGTCTCACTACA 

TTGCCTAGATTGGNCTCACACTCCTGGGCTCAAGCAATCTTCCTCTCTTGGCCTCCCAAA 

GTGTTGGGATTGCAGGTGTGCGCCACTAGGCCCAGCTTGAAAAATTTTTAATGCATGTGG 

TAATCCACAGGAGATCACATTTAGTATATGACCAAGTTAATTAAGAAGNCAAAAAACACG 

TTMATTTAAGCAGMTAAGGCTGGGTTCGGTGGCTCATGTTAGTTTTTATCCTTAAAAT 

TGTCTGAGTTCTTAGAACACAGAAAAAACAAATTTGAATGCATTTCTAACAGCTTAATAA 

TTTATATGTCCCATTATG 

Sequence 577 

GGGGNGGCGGCCGCCCGGGCAGGTCAATGCTTCGACCCACGCGTCCGATTTCAGGTTGAC 
TTTTCTCACCTTTAACCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCAT 
CTGGGTTATCTGTAGCTTGAGTTGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGACCT 
Sequence 578 
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AGGTGTCGACTCAAGCTTTCAGATATAGGCATTCCAGAATCTTCTCTTTACGAGTTCACC 
TGCTAGTATAATCTCCACAACTTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGC 
ATCACAAGTTGTACCTGCCCG 
Sequence 579 

CCGGGCAGGTTACAAGTCG ACCCAC GCGTCCGCTTCAGAATATCCAATTCATGTGAACTA 
CAGGAAATTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGT 
GTCATGACCMGCTAGATCTCTTTACCATATCATTAATAAAAGTCAAATTTTAAATTTGT 
GCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 
Sequence 580 

GTGAACCTGCCCGGGCGGCCGGCCGCAC I I I I I I I I I I I I I I I I I I I I CCTTTTGAGACA 

CAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCCCGCTTT 

CAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTTCTTTTT 

CATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTGAACTCA 

AGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTTAGCCTT 

GTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTC 

Sequence 581 

CCGGGCAGGTACCCTAAAACTTAAAGTATAATAAAAAAAAGATTATTTCCTATACTTCAA 

MTCAACAAGATTTGATTGCATTTAAATTTTTCTGTCCATTTGTTCTTCTATGTAATACT 

TTAAAAATAATTGGCATAAAAATTCAATCAATTCATAAAAGTCCAAAGCAAAAAAAACAA 

ATCTACTGACATCTCTTGAGGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACA 

GAGACCACTACACACACACACACAAAAAGAAGGATGGGTGAATGGATGCAGAGAGAATTT 

AATAAGACTGAAATGATGCCATACATGCTTTTAAAAAATAAAAAGTATTAAATTTTAA 

Sequence 582 

CCGGGCAGGTACTTATACTAGMGATGCTCCAAGGTTTCAGAAAGGAATTAATTACTTTC 

AATTTGCACAATTTAGAACAAATATCTGGCTTTTCCCTAAGCTTAATGATTTTCCATTTC 

ACACAACTAAAATAT AATAG CATTATTTTATAATCAAGTTTAACTGATGGTCTATGATAG 

TAGAGCGATTTAGTATTTTGACAAAAATCTTATGAGACATGAAGTCATTCAATTTGCCGG 

ACGCGTGGGTCGACTCAAGCTAGACCT 

Sequence 583 

AGGTCGCCATCACACCCGGCTAA I I I I I I I I GTATTTTTAGTAGAGACAGGGTTTCACCA 

TGCTAGCCAGGATGGTCTCAATCTCCTGATCTCGTGATCCGCCCACCTCAGCCTTCCAAA 

GTGCTGGGA TTAAA GGCGTGAGCCACCACGCCTGGCCAGGAGATTCTTAATTATTTCTGA 

ACTCTATCAGTTTTGTATTAGGACATCTTATTTAATATTATCAAAAGATAGTTCCTCTTA 

GAGGCATAAATCAGTCAATCAACAAACAATAGGCAATCACGGACGCGTGGGTCGAAGACC 

TGCCCG 

Sequence 584 

AGGTGTACAAGCTTCGACCCACGCGTCCGCATTTTTCTGGTGTTCCCTCTTACGTGCACA 

CCCCTTGCTCCCCTTTGGGTTGACTTATAATCTGACTTTTGTGACAGATGTTAGGAGGTG 

GAGCAAAGGAATTTCAGACCAATCAGTTAAGAGACTGCTGTGGGGTAAGAAAAAAAATTA 

GCCTCTTAAAATTACTCTTATCAAAGGAAAAAAAGTTGGAAGCACATGATAGTATAACCA 

GAAACATGACACAGAAGAATTAAGGGAAGAACCTGCCCG 

Sequence 585 

CCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTAAGTTATAGCTGTAATTAATTT 

TCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TAAAGGTGGTTTAGTCATATAGATAATATCAATCAATAAT 

Sequence 586 

CCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTTTGTTGATTGACTGATTT 
ATGCCTCTAAGAGGAACTATCTTTTGATAATATTAAATAAGATGTCCTAATACAAAACTG 
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ATAGAGTTCAGAAATAATTAAGAATCTCCTGGCCAGGCGTGGTGGCTCACGCCTTTAATC 
CCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGAGATTGAGACCATCCTG 
GCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAATTAGCCGGGTGTGATG 
GCGACCT 

Sequence 587 

AGG TACAT TGTTAGACAAGTGTTTATCACTAATCTGGAATACATCATCTTCAATAAGGCT 

CTTGTTTTCCTCCAAGCTGCACTGCTCACACTGCTCAGTTTTCTGTTAAGCAACCTGCTC 

ATTATAGTAGAGCACCAAG GTGA TCTGTTCTTCTGTTCTTCAGAAGTTCACTATTTCTTG 

TTGCAACAGGGCTACATGATTTTAAGATTCCTCAAAGTCAATACGAATTAACATTATTTT 

CCATTTCCATTCTGTATATCTTCACATTCCATAAATATAATACTCATGTATACGTTAAAT 

TTCCTTATAAGTTCAACACATTGAAAGCTAAAATAAAGACTTCCTACTAG 

Sequence 588 

CCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTCCAA 

GCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTTTGT 

ATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGTGCCGAATTGAAAT 

CCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAA 

CTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAACAC 

CTCATGGTCACACTTAAAATAGTTTGCATCCCACACTA 

Sequence 589 

CCGGGCAGGTGACTTGGCTGTGAAAAGTGCTAAAACAGATAAAAGACTATACTGACAGGC 

AAATGGAGCCTGTTATGAC ACTGA CATTGAAGGTGAAAGGAGAATCCAGTTCACATTAGC 

CAGGGTCTCAGGGACCAGGTTTTGAGGCAGTATTTCTGCCTCTTGAGGACAGGGCAGAGC 

AGGTGGGTAAAAAGCAAAGAGACCAGGGAAGGGGGACTAAAAGTAAGGGAAACAGCATCT 

GAGGAAAGGCTCCTCTGACTGGATTTTCACAAACATTATTTATTAACCTCACTAAACAAG 

GATAATGGGACAAAACAGGGGCAAGCTGGAAAACCAGCAGGGGTATTTGGCAG 

Sequence 590 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTTGGTCACTGNCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGTTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGAC 
Sequence 591 

CCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTAACTGTTGTTTGTTTGTAAGTTATAGCTGTAATTAATTT 

TCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TA AAGGTG GTTTAG 

Sequence 592 

AGGTCAAGCTTCGACCCACGCGTCCGCAGCCTGGGTGATAGTGAGATCCTGTCTTAAAAT 

GAAGAAAGAAAGAAAAAAAGAATGAGAAGGAAGGATATTAATTGAAGTAAGAGCACATTT 

GATTACAAAATAGAAGAGGAGTAAGTGAGAACTAAACGGGGAATACAGATAGCAGAGATT 

AAATAGGCTATAAGAAAAAAAAGGGATGATAATAAGACCATGGTAGTACCTGCCCG 
Sequence 593 

AGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGTTTA 

CAATATGGTGACAACAGATAACTGTAAAAACTTCTTTTTCAAATAGAACCAGCAGGAGCA 

TGCATGG AACAC ATATACCAAACATCTlrCTGATAACATTAAACATTTTTAAAAGATGTT 

AAATGTTCTTTTCATTGNGGTGCTTCAGATTCCTGATTCTAGAACTTGTGTGTGTGGAAC 

CTGTGTGCTAACTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAATnTCAAAC 

TAAATGATGGGGGAAGGAACTAACATTTTTGCAGNCCCTGGAAATGTAAAATGTTGTACC 
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Sequence 594 • 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAATAGC 

ATTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACAC 

CGCTGCCTTCTGACGGTAAATGTATTGTAGAATTCATGTTGTTATCAGGCTTCAGTTTCC 

TCATCTCTAAAATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTTTTACTTTTAATCAG 

TGGCATCTCCeATTTATTTTTCATTTGAMTAAAACTTTTGAATTTTA 

ATAACATATTTTGTTTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAA 

AGCAAAGGATCAAAGCTG 

Sequence 595 

CTATAGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTT 

AA CTTAG ATTGTTACTTATGTTTCTAAATCTGTGGMGCACATTTCCTTTTCTTCTTCT^ 

TTCTTTTACTGTTMTATCCTTATTCTCTATTTTACCAGTGGAGAATGTTTAGTAT^ 

TTCCATTTAGCTCAAGATTCAAGAAATGCANAGTGCTATTTTTATCAAATTTCTGAAAGC 

CTACTGTCTTCTGCTTTGGAAGTCCCACMCAGCTCTTTAATTTCCTTMGCCCCACTTT 

CCTCATCAGCAAGTTGGTGTGGCANTGGATCATANTAGGTTGCTGGGAGGATGAAGTGA 

Sequence 596 

ATGGCGAATTGGAGCTCCCCGCGGTGGCGGGCGAGGTCAAGCTTCGACCCACGCGTCCGA 
TTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCATATTCT 
TCCAAAAAAAAAAAAAAAAAAAGT 
Sequence 597 

CCGGGCAGGTCTTCGACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

AAAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTAACCATTACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTGGGATAATATGG 

CTCACl I I rrACTTCATTTACAATATTTAATAAGTGCGATTTTAGACTTGAGAAGAGAAT 

ATTTTCTGCTAAAATTATCCCCACTAGAGATAATCACCAGTGAATTAATACACTGCAGCA 

ACGGAACCAGTCAGCTTTTTTGGTAATCATTCCCTTCCT 

Sequence 598 

CCGGGCAGGTCTTCNACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

MAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTAACCATTACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTAGGGATAATATG 

GC TCACT TTTTACTNCATTTACAATATTCAATAAAGTGCGATTTTAGACTTGAGANGAGA 

ATATTTTCTNCTNAAATTTATCCCCNCTAGAGATAATNNACCAGTGAATTNATACACTGC 

ACCNNACGGAAACCAGTCA 

Sequence 599 

AGGTGCTTCG ACCCAC GCGTCCGGTATTTCTCTTAMGTTMTTTTGATAGATATTTATC 
TAGATGCTTTCTTTTTTCCCTTGCCATAATAGCTGGCTTGTAGAGAGAGTTATGTTTGAA 
AAGGCTTGCC I I I I I ICCGTCGCTCTG 
Sequence 600 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

TGTGGTGCTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACC 

TTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTTGAGTATCTGGAGGAGGATGCCCA 

GAAGTCCNCACAGGAGGGGGTGCTTGGGACCACACACTGATGCTCTTGNCATTCAGACTC 

TGAGAAACATGCCGCGTGATGAANAAACCATCCCAATTANANGAAGCTAGCTGNNTTNCA 

TTG N AG C AG CTTTACCCC AATTT 

Sequence 601 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATT 

GTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAAT 

TGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAG 

GGTGGAGGAATGGAAAAGGNCCTGAAATGCANGCAGAGAAATGATGAAACAATTCCNGGG 

GCTGCGGNGAGGTTANATGAATATCTTTACAGCAGCCTNGAAGACTGATCANGTTACTAT 



• 
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AGCCTCTCTTCTGTCCACGTGCATTTNA 
Sequence 602 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATT1TAAGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

T 

Sequence 603 

ATTGGAGCCTCCCCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAG 

ATCCGTTTCAGAAACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACC 

TGATCJTATTCTCATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAAT 

TTACTGGAGTTGGCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCAC 

CTGAGMTTTGATAGACATAGACCCAGAGTTACTGAGGGCAGGTGCTCTGTTTTGGGGAC 

CAGCAATCGGTGCTTTAGCAAGTNCTTTGGGTGATAGGGGTTNTGGAAACTACTGCTCTA 

AAGCATNATCTGTTTTTGAC 

Sequence 604 

GGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGTC 

CTGAAAGCGTNTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTGG 

ACTTCTCAGCTTTCCTCAT TAGAA GTTTAAGATCTGMGCAATCTTTAAACTCGTGAGGA 

MATT MGT CTAATAAATANTTTTCTTCTTAGCCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAGTGCGGC 

Sequence 605 

CCGGGCAGGTACANNTTGTGATGATTTTGGCAGCAATTACAGAACAACCAAGGCCATTCA 

AGTTGTGGAGATTATACTAGCAGGTGAACTCGTAAAGAGAAGATTCTGGAATGCCTATAT 

CTGAAATCAGAATCCTAGTAGTTTGTAGTTTGCCTCTTCCTAGAAGTTCAAGAGACTCAA 

GTCATAGGCTACAGATGTACCTN 

Sequence 606 

AGGTCTTCGACCCACGCGTCCGCAACTGTTGATCTAACTTTTCCACTTGAATGTCTAATT 
GGCAAATCAAACCTAACATGTTCCAAACGAGTTCTGAAGCACCCCCTCTGCCAAATCTAC 
GTCTCCCACAGCCTTCCCTATTTCTCTACCTGGTACCTGCCCGGGCGGCCGCTCGACCTG 
CCCG 

Sequence 607 

AGGTCTTGAGTCGACCCAGGCGTCCGGAGATGTATACGCCACTATAGGAACTATAAGAAA 

AAGTCAAATGGAAATCTTATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGA 

TAAGGTGTCAGTGAACTCAAAAATCAATCAATAGAAACTACTCAAACTAAAACTCAAAGA 

GMAAAMAAGATGGGAGATAATTATTTTTTAAGAATTGGTCATCAAAATGTAGCAACAA 

GTT TGCCT TATCCTATATCATTTGAATTTTCAAAAAATAAGCTCATTATACAATCTTTAA 

MTATTTTGAATAGAACTGTTTCATGTGTTATTTGT 

Sequence 608 

AGGTCAAGCTTCGACCCACGCGTCCGGGGMCCCTTGTTCAGGTTCTTTTTAGGCAACCC 

AAGCCAAGACMCAMGTMGATAGAGCCCCMATGTGGTCGTATAAGGTTTTTCAAAGA 

AAGTAACACTTGAGTTAGGTCTTAAAGTTTCACCTAAGAAACTGCCCAGGTGGACAAGAA 

GAAAGGGTGTTCCAAGTAGAAATAATAGCATGGACAAAGGCAATGTAGCAGGAAAAGTCT 

TCGTAMTTCAGGGAATTTCMGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTT 

GCGGTGGCAGGGGATGTTGGAAATGTAAACAAGAGTGAGATAC 

Sequence 609 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

GTGGTACTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACCT 

TTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTGGAGGAGGATGCCCAGA 
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AGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTGTCATCAGACTCTGAGA 
ACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCTGACTTCATGGAGCAGC 
TTACACCAATTGAA 
Sequence 610 

ACT I I I I I I I I I I I 1 1 I AGCTTGAGTCGACCCACGCGTCCGGGGATCTAGATCACGAGCG 

GCCGGCCGCCCGGGCAGGTACGGAAGCCATGCACTTGCCTCTCCTTCAGAGCTGGGATTT 

TTTT TCATT TTGCTGGCTGTGAGCACACACACGCCCACAGGTGCCTAAGCCTCTTGTATG 

TGTGTTTTGAACTGTGTCCTCTGAGTTCTGTGTCTGGGTGCATGCTCTCCTCTTAGCGTG 

GGTCTGCTTCCCCTGTGTAGCACTTCACAATGTTAGGCATTTGTCTGTGATAGCAGCTGT 

TCAGTAATTTCCTACTT 

Sequence 61 1 

AGGTTTCGACCCACGCGTCCGGAATTTATCTGGCCAGGCATTGGTAGTTTACAGAAGTCT 

ACCAGATGATTCTAATGTGTGGTCAAGACTGAGAACTATGTGTTTAATTGGGTTCATTTC 

AAGAATAC TGTAAA AA TTTTAT CTAAATACTAAATATCCATAAAAGAAACCTCGGTAATC 

AGGCCAGGTTTTTGAGTTTTTCCAGATTAGCCCAACTACAGGGGAAAGAGACTTTCGCAC 

TATATCCCAGAGTCTCTGCTCCTGCTTCCAGCCTCAATGCACTGGGCCTTTCTGCTGCCT 

TGGAGCACTTAGAGGGATTACAGGAGGAGTGATCTGTGGAGTT 

Sequence 612 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGCCCGGGCAGGTACATACAGCCTGAAG 

TTAACCTTTCTATGTTAAATGAAAAAACTTTGTCATTCATCAGGTCACAGAAACCAAAAA 

CTAMCAATGCAAAAAAAAAAAAATCTAAMATAAMGAATTTTATATTTGAAGTTATTC 

T GGATA TTCGCACCATTTTAGCTTCTGAAAAAAATGCAACTATGAAATGAAGACCTCATA 

TATTTTCATTTATCAATATAATGTTAAAAGTTTCATTCCACCGGGTGTGGNGGCTCACAC 

TTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGATCGAGA 

GTATCCTGGCTAACATGGTGAAACCCCGTCTCTCTAAAAAAATTCNNNAAAANAAAAAAA 

AAGGAA 

Sequence 613 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGTAGTTTTCTTAGCCATTTTTC 

MTAGT CTACACAGTGTTTATGTTTCCTTTTATTTGTGTATAGTGGAGTAGAGGGGAGGT 

TTTTTTATTCAAATAGAAGAAGCTAAAACTCAAATGCAATGTCAGATCTCANAATAAACT 

GACCCAATTTCTGAMCCCMTAMCACATTTCATTTGTMTATTCTTTATTATATAGCT 

CTATGMAAGTAATTTGTGACTTTCGATCTTAAAAGAGAGTTTTAAAAATACACAGTAAA 

TTGAA AGAA AAACTACTACATTTAAAACAGTATTTTCCTGAAAACATAGAATGAAAATGC 

MGTATTTTGTGCATGGCAGCTGTTTTTAAGGAACCAATGTTATATATGGNGAATTTTGT 

GGMGACTATGTCTCTTMAATATTTCTTATMAATANCATGGCTTTTTAATAGCTGGGA 

ATCTGANGNNGGATTTCCCATGAAGACCTTAAATGGCTNNGCAGGAATTATAAAAAAG 

Sequence 614 

ACTTAGGGCGAATTGGAGCTGCCCGCGGTGGCGGCCGAGGTACATGCCTGTAATCCCAGC 

TGTTGGGGAAGCTGAGATGGGAGGATTGCTTGAGCCTGGGATACTGATATTGAGGCTGCA 

GCGGGCCGAGAATACCACTGCACTCCAGCCTGGATGACAGAGTGAGACACTGGCTCAAAA 

MAMAAGMAAAGGAAGAMAAAGTTTAAATCAATGAATGTTCTCATTTCTAATGAAAT 

AATGAM CATTATT GGGAGAGTTATAGTCATAATCATCTTACTGCACTATCAATTAAAAA 

ATACATCATTTTTTAGAGCACAATATATACCATAAAGAATTATTCAM 

TACGATCAMTTTTTAATAGACTTTGTTACTTMACTAAAACTGTATTAGTCTGTATTAG 

TCAGCTCAAGTTGGGATTCACACCTGTAATCCCAACACCTAGGGGGGC 

Sequence 61 5 

CCGCGGTGGCGGCCGAGGTACACTGTGTMGTGGTCMAGATAGACATGGTTTTATTTAC 

AAGGAAATTTGCTGAAGTGTAAATTATAACACGAAGAGATGGGAGGGAGGGGTAAACACC 

TAAATGTCTAACAACAGAGAATGGTTCTCTGTTGATACAAAATTATGATACATCAAAAAA 

GAACAACAATCAAATTCTCTGAGAATTCCCATTACAGTTAAAAGGAGCTCCCAGCCAGGT 

GCGGTGGCTCACGCCTGTAATCACAGCACTTTGGGAGGCCGAGTCGGGTGGATCACGAAG 
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TCAAGAGATTGAGACTACCCTGGCCAACACTGTAAAACCCCGTNTCTACTAAAATACAAA 

AATTAGCTGGGCCGTGG 

Sequence 61 6 

TTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACTGAATAACTGCCAATGCCA 

TCTGCCTGTGGCCTTCTCAAGTTTGTCTGCACCTGTGGTTATCCTGACTTCAAACCCGGG 

GAGACAGAGGCTAGAAGAGGCAGACAGCTCTTGTGTATTCTCCTGTCCAGTGCAAAGAAC 

ATCT GGAACTCTGAGCCCTAACCTTAAATGCAAGACCTNATCTGCAGGTGTTCCTNATCC 

TTTTAGCCCCTCAGTGATGTAAGCAACAAACGTCACCCANCTCCTGGGGCACACTTNACT 

CCCAGATGAGCTTGTCCTGGATTTGCAGGGAGCCTGGCTCCC 

Sequence 617 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGA 

GGAACAAAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGC 

Sequence 618 

GCCGGGCAGGTACAGATGGGGTTNCACCGTGTTAGCCAGGATGGTCTCGANTTCCTGACC 

TCATNANGCATCCANCTCGGCCTCCCAAANTGCTGGAAATTACAAGGGCGTTGAGCCCAC 

CCG CACCTGGGC CAGAATCTTACATATTTCTTAMCATCATTAAATATATATTGTATTTT 

TTACI I I I I I | I I GAATAGGGGTCTTGCTATGTTGCCCAGGCTTGGTTTTGAACTCCTGG 

CCTTNANGAGATCCTCCCGCTCTCAAACCTCTCAAAAGCAATGGGTA 

Sequence 619 

AGGTACCCCATTTTATGCCATAAGTCAGGTTTCTCCCTCAATAGCCCTTTGGAACTCTCA 

AGGTCCAGAGTGGCATCAAACCAACTGACACATGAGTTGATACATCATGTGCTGCCAACA 

GAGAAATTAGTCTGTGCCAAACTCAGCACAATCCTGCAGTTCAAACCAGAATTTCAAAAA 

Sequence 620 

ACCAAGATT TGAA TCATGCTTTCAAMGCTAATGTGAAGTTAGACATATTTGGTTTCATA 

ATCACAGAATTTTAAAAACACCAGGTCTGCAATATTCAGAAATCACCATTAACGCTCTCT 

TGACACATACAATCAATTTCACTTTAGATCGCTGATTTTCTTAACAACTGATTTAGTTAT 

TTCTGAATACTGCTAGAAAATTCAAAATCTACAATTAAT 

Sequence 621 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCAAGGCATTGTTCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTNTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATNAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAAGCCCAGCC 

AGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

Sequence 622 

NCCGGGCAGGTACTGGATGACAGCAAGTGCACACATCAAGAGAAAGTTACCATTCAGAGG 

TGCAGTGAGTTCCCTTGTCCACAGNGGAAATCTGGAGACTGGTCAGAGGTAAGATGGGAG 

GGCTGTTATTTCCCCTAGGTCATCTCTTACATTCTAGTTCTGGTGCTCTCTATCTGTTTA 

AGACAAACCCTTGNGCACCTTTCTCCCACCCCTCCCTTTCTCCCTTGTCTCCCTTGAGAA 

AACAACTNCAGTTCTCTGCCTGCACCATGACTGTCGATACGCGGGGGCAGTTCGGCGGTC 

CCGCGGGTCTGTCTCTTGCTTCA 

Sequence 623 

AGGTACAAGCTGTGCACTGCAAGGTAACCACGTGGCCAGAGGCACATCCCTCCCTCACAT 
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ATACTGAGTGGTGTAATGCAGTCACCTTGTCATCTGGCAAGAGGTGATCGATGGACACAA 

ACTCCTCCCGGAACTGCCCCTCCAGCGAGCTCACTCTGAGGTTATCTGAACTCACATAGC 

TTGGGAAACCCAGCTGGGCACGGGCAACATTTGCGTAGTGACCCTTCCAGTCATCGGAGC 

ACATGGTCTTCCACGAAGCAGCTGTGAACACCTGGAGCACGGCATTCTGACCACTCACCC 

GGACACAGCGGTACCTGCCCGG 

Sequence 624 

CGGGCAGGTACTTTGCAAGACACGCCTGGCTACGAACAACATGGGACAATGGGCAGCCTC 

GCTGCACTGNACAGAGGAAAGGAAAGAGGCCTTGCAGCCACTGCCTGGGAAGGAGCAGCA 

CATTCTGCATTAACCAGGCATGCCTCACTCACTGCAATCCCCAAACAAGCCCAACTCTCC 

GTGTTGATTATTCTTACCATACTCCACCAGAAAGCAGCATGATTTTCTGTCCTCAAATAC 

TTCAGATTCCAAGAGAACTGCACCTTCTAGAGTCTCTACTGATAACCTCAGNCACTTACC 

CACTTGAAGCATNAGCACACACTTAAAAAGGAAA 

Sequence 625 

TT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATTCTTCCAACCTTTCT 

TCTGTGCATMTC ATCTA GGTGTGGTGCTTACATTTTCTTTTGGCAGTGTTATCTTAGTA 

TC TTCC AGCATGGTT TTCTCACCT GATACTGTAACCATACTTCCATATCCTCAAATGTGT 

TGTTTTCTAAAATAAC1 1 I I I I I I I CTTTTTTTAGAGACGGAGTCTCACTTGGCCAGGTG 

CGGTGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGT 

CAGGAGTTCGATGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGC 

GCACGCCTGTGATCCCAGCTACTCGGGAGGCTGACGCAGGAGAATCTCTTGAACTTGGGA 

GGTGGAGGTCGCAGTGAGCCGTGATCGCGCCGCTGCACTGCAGCCTGGGTGACAGAGTGA 

GACTCTGTCTCAAAAAAAAAAAAAA 

Sequence 626 

AGGTACGCGGGACATACTCCCTAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATG 

CCGGGAGGCTTCCTGGAGCAGGTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATA 

AAGCATACTGGGAGCTGAGCTGCTGTTTAGTAATTAAAAATCCTTTCCATTGTTTAGAGC 

TCAGCACCTTTGTGCATTCATATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCAC 

T TCTGCTACTGCA ATGTATGTCTACAGCTGACAAGTCTTCCTTGGGAGCCCTACGTAGCT 

C l I l l ll l l CTTTTCTTTC I I I I I I I I I I I GAGACGGAATCTTGCTCTGTCACCCAGGCT 

GGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTC 

TCCTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGGGTGCCCACCACCACGCCTGGNTAC 

I I I I I I I IGGTAT 

Sequence 627 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTCTGCCATGAAGGT 

TCTGGGGTGGAGAGGGAAGCAATGTATATCCTACCCATGGTGATTGGTCCGATGGAAGTC 

ACATCCTGATGGGAAAAAAGGACTGAGCCAGAGTGGACTGTCTAAACCAAAATGGGATAA 

ACAAGCATGGCATGGAGCCAAAACAAATGGCTAAGTCAGAGGTCCTAATGCAGAAGGCTG 

GACAACTAGGATGGTGGGGAAAGACATGAGCTTGAAGGACTTCCCAAGATAAAGCAGAAC 

TMCCAGAAGAGCCTGTTATAGATTCATATTGGGGGAGTTTGGGGGGTTTGTGCAGGGTG 

CATCAAAAAGCACTCGCATGGAATAAACATATCTTGCACAGGAACATATGACAGGTAATT 

GAATAGTTTGATTTGAACTATGTAAAGACATGATCCTGATGGTAGAAGGATGGTACCTGC 

CCG 

Sequence 628 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I M I I I I I I 1 1 I II I I 1 I I 

TTGGGACCGAGTCTCACTCTGTCGCCAGGCTAGAGTGCCATGGCGCAATCTTGGTTCACT 

GCAACCTCCACCTGCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCCCTNACCACCACGCCCAGCTMTTTTTGTATTTAGNAAAGATGGGGTTTCAC 

TGTGTTGGCCAG GCTG GTCTCGATCTTTTGACCTTGAMCTTTNACATAAACTTTCACAT 

TTCCATGACAAAGTTTTAGCAGTAACTTCCAAATTGGTCTTATTCAACTCCAACATTAAA 

CTTTGTATGTACCTGCCCG 

Sequence 629 
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CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGATCC 

AGGATGTGATGGGATCTTAGGGCTTGGCTGGAAGGTTTCTCCAGTCAGCCATCTAGCAGA 

GCTGCAGATCTGGGCTGGGCTGTTGGCTAAAGTGCTCTTCACAGACACCTCATTCGGCTC 

TTCCTTCAGCTTCTTCACTTATTTCTTACTCAGTCACTACTCAGCTCCTTGTCCATGTGT 

CCTTGAAGCCATCCTAGGTCTTATTCTGATTCTGAATTCTTCAGTCACCCATAAGCTTCT 

CCTTACCCCGGGAGTCAGTGGGTGTGTGTTCCCAGGTGGACTTAACCATTCTTCTCCTTT 

ATGATCCTTTCCCTTGGGTGGACAAGTGTGATTTGGTTGTAAGGCCATTTTTCAAGTTGC 

CTATACATTGATAAAAGAAATCCCACTAACGGAAGTAGACTGCATGCCAAATTTCAGTGT 

CTTTCTCCAGGGGCCAAGGTTGGACCCANAAGTGCATGGG 

Sequence 630 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTGCTTCCTGGGAGAGCTGACCATGA 

GTCAATTGGCCCACAATAANTTATNAAATGAAAACCGGCCATCATCTGCATCTTATGAGT 

GCACGTCATCAGAGATGTCCACTCCAGTTACAAGAAAGTCCTGAGGGCTTTCTTGGAGCC 

TGANGGGCGCTGGAGGTGAGACCTGGAGGTGAGCAGGAGTTAACTAGGATGAGGGACNGG 

CGCAGCATACAGGAAAAGCTGCCTGGGGGAGAAAGGACCAACAGCAAAGACTGAGAAAAA 

AATGCTGTTGTGACCAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCAT 

GGAGGACTGAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCATGGAGG 
Sequence 631 

CCGGGCAGGGTACTAAGGACAAAAAGACATTTATTCTCTTTGACCCTTGCTGCCAGNACA 

GAAAATGACTTCACCCAAGGACACAGCACTTGCGGGTGGCCTTCTCCACCTCCAGCTATT 

GCTTGGTTTCAGGTGACCACTCCCTTTCTCTTCTCAGGCCTATGGGTGGTAACAAGCTCC 

CATC CACT GCTAGTCTTAGACATCTTTACTTTCCTTGATTGATNCCCTTGACTCTGCCCA 

CATCTTTTAAAATATCCCATATTAAACTTTTTACACCCCTTTGAATGTGTCCTGCTTCCT 

GCTGGGACCATGACTAGTCTCTTCTAGTNGGAATCCATATCACCTTCTGTGATGTAGTCT 

CCAAGTCAGGCAGN CTCA TTTCAACTACAGNCTTTCTTTATGCTTCTCTTTTTCCTTTCT 

GGACTCTTACCTTTCTTTTATTTCTTACTCAGCAACAGTGTCTGCCCATTAAAATGCACC 

TTTGCGGNGGNGGTTNGGGTATCTTTATCTCTCTTATTTCTTCCTTNCTGGTGCATTTTC 

ACTGGCATTGCATGGGAATTTTGGTT 

Sequence 632 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACATACTCCC 

TAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATGCCGGGAGGCTTCCTGGAGCAG 

GTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATAAAGCATACTGGGAGCTGAGCT 

GCTGTTTAGTA ATTAA AAATCCTTTCCATTGTTTAGAGCTCAGCACCTTTGTGCATTCAT 

ATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCACTTCTGCTACTGCAATGTATGT 

CTACAGCT GACAAGTCTTCCTTGGGAGCCCTACGTAGCTC I I I I I I 1 1 CTTTTCTTTCTT 

I 1 1 l l l l l IGAGACGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTC 

GGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGT 

AGCTGGGACTACAGGTGCCCACCACCACCCCTGGCTACTTTTTT 

Sequence 633 

GCCGAGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGNGCTAATAT 

NCAANGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAATGGCAGCAGTGACAGGA 

TCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATTTTCAAATCATCAAGGC 

CAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATnTCCCCCATGG 

TGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGA 

GTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCC 

CATAAACACAGCCCTTTCCTGGCTCACACGGGCATGACCTAATTAAGAACCCCCGCGTAC 

CTGCCCGGGCGGC 

Sequence 634 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAGTGA 

AATCCC TAAG TCAAACTGTGGCTTATAAGCAGAAATCCTGGTTAGTATTTCAAAGTTCTC 

TTAGCGTTTTCTCCTGCGACTTAAAAGACTTAAAAACGTGAAAAGACATGGACGTAAGAC 
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TCCAAACAAAAATACATTTCTTTGAAACTAAATAGCTCTTAAGTAAGAAAAATTTCTATA 

GATCTTCAAATCATCCCCTAAGCAAAATATTCTCTAATTAAGTATTTCTGTATTTCCATC 

TATGTTCTTCCCAGGCTTGGGGCTGTTGATCAGACCTATTTTTAGGGGTAAGTTTCTAGG 

GGTCATAGAAGATACAGATTTTGACCTGCTTAATGTCAAGAGGTTGCACGGTTGATTTGT 

CCAGTTGTGAATTCTATGAATGAAGC7TTTTGCTTAAATAAAACGATATTCCCCTCTGGC 

TGCTGTGAGCACCGGGAGACTTGTTTCGGCAGTGCCTGGGTGCTGGGGCAGGGCCCG 

Sequence 635 

TG GGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGAGTAAAATAAAAGGGTCG 

GATTTTGTAGGATTCTAAGGAAGAGGCAGTGTGCTCTGTCCACAGGCTGCAAGGTGAGAA 

CCTAAAAGAATGAGATATATTCCCATTTTGGAATGGCAATCAAAAAGAGGATCTCTCTGT 

CAAGTCTTTACATTAATGCTGAGTAACAATCTCAAAAGCCTGCCCATTCCCCTTTAGACA 

CATGTGGCAAAAGCAGAACTGAAGGAATGGCCAAGGGGCTTGAACAAGTAGAGAGACCGA 

CAGTCTTTCAMTTTCAGGGAACCCCAGATACATTTTGGGGGAGCCACTGTTTTCCCATT 

TTCCTGAAAAGTTCTTGCAGGTATAAGAAATAGGAATAGAAATTGAATAGGTTCTGGAGC 

CAGGGCTACAAAGGCCCCAGCTCTGATCTGTTAGACTGAAAACCACACATCAGATGAAAT 

TATATNCACAAAAAGGAGAGTCCTTAAAAACAGCCATTTCGGTCCCT 

Sequence 636 

AAGAGCACGTATAGCATGGGGGAAAGAACCTAAATGTCTCTCTGTCCTGTGAGCTGGTGA 

AAAACCCAGCATGAGAACGCAGTGTCAGGTGTGGGACTCCTTCTGCCCCTGCAGTGGGTG 

TTACGGGCGGTGTGCCCTGGCGAGCMGCTTTGATTCTTGGTTCTTTGAGCTCGTTTCAG 

AGGCTGAGTCCCCACATCAGCTTTAGTTCTTGGACTTCCCTGTATTAAGCAAGAATTAGG 

AGAATGGCTG TCCC TGCAGGCGCCTCCCGTAAATCCTGAGCTCTCTGGCGCAATCTGAAA 

CTTCTCTTCTGTTTTCTTTGGCTGTATCAGCCGAACCAGGAGAGGCC 

Sequence 637 

CCGGGCAGGTACCAGGAGAGATCTGAGACANGGTATGAAGTAAAAGATTTAAGA7TGGAA 

GTGGAGAGTGTCATGGACCAGTGCCTTTCGGATGGGTGACTTCTGGAATTCTTGTTAGGC 

ACAGCGGAGGTTGGT CCTGT GGGAAAGGAAGAATATTTCCGGGGTGAGGAGACTTCGGGG 

TGTGGGCCGGGTGCCTTTTTAAATTTGGAATGGTGTATACAATAGGGAAAGGATGTTAAC 

TTTGCAGCAGCGGGGATGGTGAATATAACCTGATAGGGACCCTTCCATTTTGTTGGAAAG 

GGGAGGAGGGGTGTGCTACCCAGACCCAGTCTCCTGGNTGTAAGGGTAAGAAAGTGAATT 

GGGAAGAATCCTCAGG 

Sequence 638 

CCGGGCAGGTACCTGGACTCCTAAGCCTCAGGGATTTACTGAAACACCATTCTATTTTAT 

MTAATCCTTMCCAAGAATTTTAAGGATCTTAAATTTTTCTGTGGTTCTATTG7TATCT 

GATATATAGATGATCTGCTGCCATATCCTAAAGAGCAGATGAGGCCGGGTGTAGTGGCTC 

ACGCCTGTAATCCCAGCACTTTGGGAGGCAGACGAAGGTGGATCACCTAAGGTCAGGAGT 

TTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGC 

TGGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACTAGGAAGGATGAGGCAGGAGAATCA 

CTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCTGA 

Sequence 639 

AGGTACCACTTAACAAGGGTTCTCAGCTGTGNGGNCACTGGACCACTGGGATATGCTGAG 

CTATTGCTTAAACACTGACTTAAATAAAACAAATATTTTAAATAATGAGAATGCTACTGT 

AATTAGAAGGCAATCATTTCAAAGTCTANATGGAGGCCAGGGGCGGTGGCTCATGCCTGT 

AATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACATGAGGTCAGGAGTTTGAGACC 

AGCCTGGCCAGTATGGTGAAACTCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTG 

GTGGTGTGCACCTGTAATCCACTGAGGCAGGAGAATCACTTGAACCCGGGAAGTGGAGGT 

TACAGTTG AG CTG AG ATAGCACC ACT 

Sequence 640 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCAAGGCCTTTGGAATAATGAACTCCAGT 
TGTTCCTCATAGGTGC AGCA GAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 
AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 
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GATTCATTGACGGATTGATACAGGGAAATGAAATGAI I I I I 1 1 I I I I 1 1 IGGTTGGGGGA 

GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 

AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 

ACAGGTGCACACCACCACACCCAGCTATTTTTTA 

Sequence 641 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCANGGCCTTTGGAATAATGAACTCCAGT 

TGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 

AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 

GATTCATTGACGGATTGATACAGGGAAATGAAATG A I I I I I I I I I I 1 1 1 IGGTTGGGGGA 

GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 

AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 

ACAGGTGCACACCACCACACCCAGCTATTTTTTATAT 

Sequence 642 

AGGTACCTCGTTTCTGAGGATCAANACCTNAGNGACCGNGTGTGTGTGTGTGTATTTGTG 

TGTGTGTGAGTCCTATTTGGGCCCCGCCTTTCAGCCCTGTCTTGCAGC 

Sequence 643 

AGGTACTTTCMTTTCTGTGGGATAAACTCCAGCTCCAGTTTCAGAACCCACTCTAATTG 

GTTTAAGCCAGGAAAGGGAGAGGGACATGTTGCTGGGAGGCCCCCATCTGGGGCCTGAGC 

TTGGAATCAAATCAGAGGAAGGCAACACATGTAAAGTGCTGAGAGTGGAAGGATGAAGAG 

AGCTAGGGCTTTGTGCCATCACTCGTGCTCTGGACATAAGTGGAGCTGGGATTCAGCATT 

ACCTGCCCCTGTACCTGCCC 

Sequence 644 

CGGGCAGGTACTAGTCCAGGTGTGAGATGAAGGGGGCCTGGATGAAGCAGAGGGTGAGAG 

ACAAGGAAGATTCTGAGGACCTTGTGGCTAGATGTGGGGGTTAAGTCAGGTTCAACTCCT 

AGGCTGGATGAATTGGCAGATGGCACATGAACTACAAGAGAATGGAAGGCAGAACCTATT 

TTGTGGGCAAAAAATAMTTACATTTTGCAATACTGAATTGAGGGGCTTCTTGGAAGTCC 

AGGTGTAGATGTCTTACAAAAATAGAATATTCTGGGCTGGGTGCAGTGGCTCACCCCTGT 

AATCCCAGCACTTTGGGAGGCCAAGGTAGGGGGATCACCTGAGGTCAGGAGTTCGAGACC 
AGCCTG 

Sequence 645 

GGNCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCAC 

CAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGA 

CCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCT 

GGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACC 

Sequence 646 

CCGGGCAGGTACAGGGGCTTGGGGGCTTGGCCAGGCTCTTCTCCATCCATGCCACGGGGC 

TGACAGCCACAGATCTGGAAGCTCAGGCCTAGGAGTGCAGGCTCCGTTAAGCCCTGTGTC 

CAACATCCTGACTCCTAGGGGTGCCCAAGATTTGAGTGGCCACTTTCACCTCTGGAGGAA 

GTAATACCTAAGGCGCTGATAGAAATAGAACTTCCGCTGCCAGGCAAGGTGGCTCACACC 

TGTAATCCTAGCACTTTGGGCAGCCTCAACGCAGGTGGATCACTTGAGGTCAGGAGTTCG 

AGACTAGCCTGGCCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 

Sequence 647 

AGGTACNTGTTTCAGTCACTGGGCTGANNTGGNNCACAGCACAACCTTCATAGCCACTGT 

ATGAAGAAGTANAAGACCCAGACTCTTGCTTTATGTTGGTATCAAAAGTCATTTCAGAGT 

CAGGCTGATCACTCCCAAGTAACCCACTGACTTCTTTACTCCAGCTCTCTGTCTGCTGNT 

GACTCANA ANGT NACACTTNATTTTCTCCATTGCTGATATAATCATATCTGCAACATAAA 

AGTGGGCATTTTCCTTTTCTACATCAACAGGCAGCACAAATACCTCTGGTGAGAAGGAAT 

TCNAAGAAATGGTTNTTCTACTGACTTGAACAGGACCTTCATCAGCAGCAGATGTCAGAT 

GGGAAGGC 
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Sequence 648 

CCGGGCAGGTACCACT AGATT GCCTCCCTGTGCCTGGGCAATTTCAGAAAATGGTGGTTT 

TCCTTTCGTTTCCATCTTTTTTMGACTTAAAAAGTATCTGCTCTCATTTTCTCCTAGCG 

GCCTCCATGCCTTGACTCAAAAAATGCTGTCTTAGTTGACAGCCTTGAAATGAGTATGAC 

CCTAGCTCTAGTTGGGTGGAAATCACCTCGCATAGAAATAGACCTGGAGGGCCGGGCACG 

GTGGCTCACTCCTATAATCCCAGCACTTTGGGAGGCCCAGGTGGGTGGATCCCGAGGTCA 

GGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAAAAAAAAAA 

AAA 

Sequence 649 

AGGTACTAGTATGAAGGAA ATAA TATCCACACACTGATACTGGTCCAGCNGAAACCAAGA 

CCGCTCCTGGTGCATTAACTTTTAACAGAGCANGGACTCANTTCTCTGAAAATAGTGCCA 

TAAAACATGTGCTCCCAGAAGMTAAATATTTGGCTTGCTAGAATTTCTGCNGCTTTTNT 

GTAAAAGTTGA7TATTCGGTATTAAAGAGGAGTATCAAATATGNGTNNATGNANNAAAAA 

CTTG GAAAN AGTANNGGACCNNGGCTTATCTCNTCATTTTCATTCTGCACACTNCAANTC 

ANTCNTTTTCCCATCTTNNTTCCCNTCTCTGNAATTTATCACCCTCCCCCTCT 

Sequence 650 

GGCGATGGACTCCACCGCGGTGGCGGCCGAGGTACTTACCACAGAGAAAAGCCAATAATC 

ACAATATATGTTGTCTTACTGACCCATAACCCATTTTCTGAGGGTGGCAGGCATTGTGCC 

CCCACTGTGAGGTGGGACTATATACATATACAAAGGAGGTGTTTAACTGGGTGGCATGTC 

TCAGGGAGATGTAANGACTTACCTGCATATCCTGGCAGTNTTGAAATGATAGTGAACTGT 

TCNTANGGCNTCCNCTTGATGGCATAGNCNAAACACCAGNATTTTCTTGGNAGAATGA7T 

CGGNAAATGCTACATAGAAGAAATGGNTGGTGAGCTNTTACTGTGACTGTGCCCATAGTA 

AGTCATCCTGGACCCTCTGAATCTTATCCCAC 

Sequence 651 

GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTG I I I I I I I I I I I I I I I II I I I 

TTTTTATGATATTCATCTATGMCCAATTTAATTTTAATTTAAAAATGACTTCTGATCTG 

GCAGATGA7TTGGGTTCTAGAAAGAAATTGTGCCAGGCATGGNGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGA GGNGGG AGAAACCTGTNAGCTCAGGAATTGGAGACCAACC 

CTGGNMCGTAACAAGACTTNTTTTTTACAATATTAATAATAAAAAAAGGCCAGGTGCCG 

GNGGTTCACACCTGTAATCCCAGCACTTTGGGAGGCCACCAGACAGGTGGATCATNAGGT 

CAGGAGTTCGAGACCAGCCTGGCCAACGTGGGGAAACCCTGCCTNTACTAAAAACACAAA 

AATTATCTTTGCTTTGNTGGCGGGAGGCTNTAATCCAGCTTCTAGGGAGGTTGAGGCAGG 

AAAATCNCTTGAATCTTGAAAGCAAAANTTNCAATNAGCCCNGGTCACACCATTTGCCTT 

CCAANCTGGGCAACAAGAGCCAAATTTTN7TNAAAAAAAAAAAAAAAGGGCCGGCC7TGG 

GGGTNNTCCCNGGAACCCCACNCNTTNTGAGGCCCCACCGGTGGTNNTAGGGNCNGAGTT 

CAAAACNNCCTTCCACTTTTNGMCCCGTTTTTTAAATTCCAAAAATTNCTGGCTGGATA 

AATCCGNAGCCCCTTCTTGNGGGTGNGGNNGGANTNTTTCCCNCAANNGGGGGTGGGTNA 

NNNCCGAMNCCCTTATTTCCCCTNTGNAAAGGGNCTTTTCNAAAAAAAAAAAANA 

Sequence 652 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGGCAGAAGGAAGGAAGGGCAAAGA 

AACA ATTCCAGGGCCCTGGTTTCTGGGATGACAGGCTTCCCAACACTCATGCCAGGACTA 

TTTTCCACCTCGGTTCACTATGGG I I I I I I I I C I I I I I 1 1 I I AATATAATGAATTTTTAA 

AATGTGTGTTTGTT GCCC AGATTATTCCANAAAGAGTTGAAGGGAGGAAAGGNGTGCNTG 

GGGTGCNTGGGANTTTTANCCCTCTNTCCACCCNGATTTCTAAGTTGGGGGGGGCATCCA 

AACAGCTTCACCCANGTGCCCAGGCTNTTTTTTGNTNTCCAAAGCCAACCCTTCCAGGGC 

ANGGANGGGTGAAGNTTAGGAGGGCAAAGGTTAGCCTGGAGGCTGCAATTAACAAGAATC 

AAANTGGGGTTTAAGGATTCTCACACCCAGTTTGCTAATTTCAGCTGGTCCTTGTAGAGG 

TGACACCTAGTAGGACAACATGGNTTTTNGGGCAGGGCTGGGGTGCTGGTCTCTGCTTTC 

TAGGGTAGAA AGGAA TCATACATTGAAAATGCTTAAATCGATGGAATGATTTATGTTCCT 

NATCTTTCATCTTTTTCTGNGTGGCTGGTTTTCTGCCANCCTTACTTGGACAAGCACCAT 

TCTANACCTTTTCTNTAGGCATNTCCNAGAANGNGAAGTNGAAAGGAAGAAAAAAAACTT 
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ATTTTTTN NG N CCAAAAATTNG GC AAAAAA AAA 
Sequence 653 

TCCACCGC GGTG GCGGCCCGAGGTACGCGGGGGCCCTTCTATCTCAGGATGTTTGCACTT 

GCTATTTCCTTTTCTTAAAGGCTCATCCCTAGATATTTGCATGACTGGCTTCCTAATTTN 

NTGTAAGCTTTTGCTGAGAAGTTACTTTACCAACTGTCATTGAGGTTTTCCCTGAACATC 

TTAGG TMGATAAC MGCTCCCCTCCTTTTCTTTCCTCACTTCTTGGTATTCCTTATCTC 

GTAACI I I 1 1 I I I I NGGGGGGATNGANACTTNCGTNTTGNTTTTGTTGGCCAAGCTTGGA 

GTGCANNGGGTGCANTCTTGGCTTNACTGAAACCTCCACCTCCCGGGTTTAAAGCGATTC 

TTCTGC CT TNAGC CTNCCGAGTAGCTGGGACTACGGGCAAGTGCCACCACACCCAGCTAA 

I I I 1 1 IG TATTTTTAAGTAGAGGTGGGGTTCACTGTGTTAGGATGGTCTCTATCTCCTGA 

CCTTTTGGGCCACCCACCTCGGCCTCCAAAGTGCTGGGATTATAGGTGTGAGCCAGTGCN 

CCCGGCCTCTCATAATTTTCTTAAATT 

Sequence 654 

CNAATTGGAGCTCCCCGCGGTGGCGGTCAGTTNGTCTTAGAGATACCCATGAGGTCACCT 

ACTCAAAATGGGGCTCAGAGTAGCCTTGTCCCATTCTTGTCCAGTGGGCGCAGCTACAGT 

CTNNCTGGNNNGGAGTGACTGGAGGCTGTCCCCACGTCCCACTTCAGTGAGGCATTCATG 

TGCACCCAGCACACTTTCTAGCTTTATTTGCCTGGAGGGGAAGATTCTCCAGAACCTTGT 

TAAGATGCACAGNGNGGGCCCTTGGACTGGCAAGTGTGGCCTTNGGCAGTCCCTNGGAGC 

TTGTTAGGAATGCAAAATNTTAAGCTTCTTCCTACTGNATCTAAAGGTTGANTTTAAACA 

AGATCCAGCTTGTTTCGTTTCACATGAAAGTTGAGGCACACTGCTCTAGAAAGTTCTTTT 

ATCTTTACTGGCCCACCAAAGTAATCAAACTTTGNGAAGTACCCTCGGNCCGCTCTAGAA 
CTAGTG 

Sequence 655 

GCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGATATGAAGTGAGGTTAAGTCAGA 

TGGAATGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTTGTGN 

NCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAATAGGT 

GCTCGAAAAACATCTGTTGAATGAAATGAATTCTTTTGTTTGCAGTAGGGCAAAGAAGGG 

TAGAGAGGAACNACTTTGCCAAGCTGATNTGTAAATGTTGCAAAAGGGTTTNGGCCAGAA 

AATTCCNANAACCCATTNGAGAGGCAATACATGTTAAGGGACCTNTAAGATGTTTCACAA 

CCTTGGAAATAATTAMMGMCTTTCTACTGhrnTACTTATTTCCCACTCCTGGCTGCC 

CCTCTTGGGTGGACTGCCTNCTGTTGGAGGGAATACTGNGTGAGACACATCTTTTAGTAA 

AACAGAAATGTGAAACCAACTTGCAGAAATCACAAGCACACTGTTACCAAATAGGTCTTG 

ACTGG CTCCC TTNCTGGGGGACAAATGTTTTGATAATGTCTGTCAGTAGATTCAGTTCCC 

TATT TCTTTT TMGAC TTGA TATTTAAGAATACTGGTTCTTTTTTGGCCAGCATCGCAAN 

GAAG l l l l l CTTTAACTTTTGGGCCAAAAAAAAAAAAAAA 

Sequence 656 

CGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGATGTTTTCCTAAATTTTAAGT' 

CAMTTCTTCCTTGACACATACCTATTTTTATTTTGTTTTGGTTCTCATCTCTGTGAACA 

GAGCAAAGCATGCMCCATTGTMCACTTTCATTTGTTTTTATAAACTCAAGTTCTAGAG 

TTGGATTTCATGATTTGCATAACTCGGCATAGfGTAAGTGCTTGTAGTTTTAAACAGAAA 

AAGAGGGAAGAAATGACNATTCCANAAAAAAAGATCAAATCTTATGACTGTAATTTATTA 

AGGNATCCMTGGAATTCTTTCCCTTTTTCTTT C I I I I I 1 1 I I I I I I I AAAGAGACAAGC 

TCAAGTTCCATAAGCTGGGAATGCAGTATCATGATCCATAGTTCACAGCAGCCTTCAACT 

CCCTGGGGTTCAAGGNGATCCTAAGAACTTGNGGGCCTCAAGCAGTCCTCCTGCCTCAGC 

CTGCCAAAGTGCTGGGGATTACAAAGCATGAGCCACTGCTCCTAATTCTTAAGAGATA 
Sequence 657 

GAATTGGAGCTCCCCGCGGTGGCGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGAAC 

TTTGGGAAGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCAGGACCATCCTGGCTAACA 

CGGTGAAGCCCCGTCTCTACTGAAAATGGAAAAAATTGGCCGGACCGTGGTGGCGGGCGC 

CTGTGGTCCCAGGTGGCTGGATACACGGGTGTGCACCACCATACTGGCTGATTCTTGTAT 

TTTTGGTAGAGATGGGGNTTTGGCCNNGTTGGTCCAGCTGATCTTGAACTCCTGCACCTG 
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CCTNGGCCTTCCAAAGTGTTGGGATTACCGGTGTGAGACACTGGCCCCTGGCTATATTTT 

ACTATTTGGAAATCACAATGCATCTTAAAATTGATGGCTTCTTGCAACCACTTTCAACCA 

GGTGCCTGTCATGATTTAAGTGCTAGCATCAAGGCAGGTTAGTTATGAAGAAATAGAGTG 

TGTGTTTATATACTCACACAGTTAGAAATCGACCCTTTTAAAMTTATTTCTTTTTGAAA 

A 

Sequence 658 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCNGGTACCGCGGGATATGAAGTGAGGTTAAGT 

CAGATGGAATGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTT 

GTGTTCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAAT 

AGGTGCTCGAAAAACATCTGTTGAATGAMTGAATTCTTTTGTTTGCAGTAGGGCAAAGA 

AGGGTAGAGAGAAANCAACCTNGCACAAGCTGNTTGTTNAATGTTGCAAAAGGTTTAGGC 

CAAGAAAANTTCNAAAACCCATTNGAAAAGCATACATGTTTAGTGGAACCTTGAAAATGT 

TTTCACAACCTTGGGMATMTTTAMAGTAACTTCTACTGGTTTTACTTATTTCCCACT 

CCTGG CTGCC CCTCTTGGGGTGGGACTGCCTCCTGTTGGGAGGGGAATACTGTGTGAGGA 

CACATCTTTTAGTAAAACAGAAATGTGAAACCNACTTTGCAGAAATCACAAGCACACTGT 

TNCCAATTAGCTTGACTGGCT TCCTTN CTGGGGGGACAAATGTTTNGATAATGTCTGTCA 

GTAGATTCAGTTCCCCTATTTCTTTTTAAGACTGATATTTMNAATACTGTTTCTTTTTT 

GCCACCTCGCANTGGAAGTTTTNTTTACTTTTGGCCAAAAAAAAAA 

Sequence 659 

CCGCGGTGGCGGCCGAGGTACTGGTAAAGGGATAGTCACATAGATCAATGAAAAAGAACA 

GAGAATCTGTGAACAGACCATGCAAATATGCCTGCCTGGTTTTTCACAACAGTGCAAAAG 

CAACTCAGCCAACAAAAGACAGCTTTTGGCCAGGCCGCAGTGGCTCACTCCTGTAATCCC 

AGCACTTTGGGAGGCCCGAGGCGGGTGGATCAACGAGGTCAGGAGATCAAAGACCATCCT 

GGCTAATATGATAAAACCCCCGTCTCTACTAAAAAAACACACACCCCAAATTGCCCGGTG 

TGGTGGCAGGTGCCTTTNGTCCCAACTNCTTTGGGANGGTTAAGCAAGGGANAATGGCNT 

TGAACCCGGAANGGAAANCTTTGCCNTGGGGCCCANATTTNNNCCCNTTNNNCTTCANCT 

TTGGGTGNANAAAAANCNGGACTTGGTTTCAAAAAAAAAAAAA 

Sequence 660 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGAAGGTGAGCTTTGAA 
GATGCAACATGAATTTGACAGTANAGATGTAGGGAGGAAGGAAGGCAGGACAGGTCAGAC 
AGAAGTGCAGGAACAGCCCAGGCCTTTGCAGCCTTCCCACACCCCCTACAAGACCTGCC 
Sequence 661 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGGACT 

TGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTTTCTCCCCGCTTGCCGGGGTGGTCC 

TCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTGTGGAGAGGGACTGACGTGTTTGGA 

GGGAAATGGGAATGTACCT 

Sequence 662 

AGGTACTCCAAGCTCTGAGACCACCTCTTCTGCAAAGCCTTCCTGATTCTGCAAAGAACA 

GGTAGGCATTTCATCCTTGGGACCTCACAGCAATTCAGGACACATTTGTGTCCCAGCCCT 

GCTTGGCTTGGCTGTCTCCATGAATATACACTTTGTAACTTCTGCACCAGGCATCATACC 

MGCACACAGTAGGCACTCCTGTGTTTTTTGAATAAGTGACTATATCATCACCACATTTC 

AAATGCGGAATATATGAGCTACTAGAAAAGACATAAGGGTAGATTTTACATCTTTATTGT 

ATCCTAGATATACAAGTCTATTACTGCCTTTTCCCATGTTCTGTCAACATAGCATAAAGA 

ATGTGGATTTACCTGTTAGAAATTGAATAAGCGGCCGCTCTAGAACTAGTG 

Sequence 663 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTACACGGCAATTT 

TNTTAATAACTCCCTTTACTATGTCAGACAAGCTATGTCAAGCGTCTTCTGTATCCTNTA 

CGGGGAAAAAAAAAAAGTTAACCAGAGCCAAATGCTTGCTTTCAAAGATAACTTGCCATC 

CTGAAMTATAATTTTTACAATTCAATAACACCTTTTTGAAAAATAAAATAATNTGGCAA 

MAT GCCCA TGCATTAACAAACCATTTTTTCAGTTTAATCTCTTTATATGTTCAACTTTG 

ATGTATTTTAAATAAACAAAGCAAAATTCAACTAAAAATACAATCTGGATTCCATAGCCA 
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ANGGTTTTATTTACMTTCCTANTAGGAAGGCTTTATTTTTAGCTNTCAAATGGGGNNGG 

ACCTATMGGGAAATTTAAACCGTTTNCNTTGAGTTTTTNTNTTTNAAGGGGAANGGGGG 

AGGAN TTCC CAAAATGGGGAMGGGGAAAAAAAGGGGGNAANNCCNTTTTGGCCTTTTNN 

GGNANTTTTAAAAAAAAANTTTNCCCCCCCGNGNCCCCCAAMAAAMANNMNNTTTTT 

TTNAAAAAAAAAAAA 

Sequence 664 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTGCTGGTTTTCTGGT 

GTTCCTACAAGCTGCCTAGGTCTCTTTTGTTCTCAGCAGTTCCCAGGCATGCAAAAGTTG 

CCAGTTCTGTCAGCATTCCAAGTCAGGTAAGACAGAAAGCCATCTCTTAGGCAGTCCCCA 

GAAMGCTGAAAGGTTGGATATACTTTCTACTCTTCTCTTTTCTTCATGAGAGAAAGGCC 

ATGTGGGCATTTTCTCCCAATAACACTGAGTTCTGTTGTCTTCTGTCGGCTGTGCTGCAG 

GTTCTCAGGTGCTGCAGTTAGCTGCT 

Sequence 665 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGTAAGATTTCCTGAAGGGATCCATAGCC 

AAAACATGTTTGAAGGCCACTGGGCTCGCTAACTTCTAAAAGCACCCCAGTTCTAGCAGA 

CATCCTAAGGAACATTCCCAGGAAAATTCCAGCCTAGAACCTCCTGGGGTCTGACAACCT 

TAGAGMCAGTGCTGGCTTTGAATGGGCTTGGGGGCAGCCTCGAAACCCTCTTTCCAGTC 

TCCATGCAGGCAGGGGAGCTCCTTAAGCAACACATAGGACATTTCTGGGAGAAATGGGAT 

CCCCAACACAATGAACACTATAGATTTTAATGGTCTATATGGTTAAATACACAAGGCCCC 

TCATTTCCAACCCCGCCTGTTCATCTGATTCATCTGTACCTCGGC 

Sequence 666 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGACATTCCCCCAGTGGCT 

GAAGTGGCATATGAATTATGAAGTTGGATCATTTGGAATGAATGTAAGAGAATTGCCAAG 

GGCTCCTCCTACTCCAGAGAGGAAACCTCATCCAGGGCCATGAAGCCACTTCCTCACCAT 

CTGTGTGCTGCTTAAGCTAATGCTGCGGGAACCATGGTTCCTTGGGAGGAATCAAGCTGA 

CTCTTGGCATGAGATTCCTGCCTTCCTAGGGTTGAGAGCGGCACTGCCATGGCTTCTCTG 

GACGACCCAGGGGAAGTGAGGGAGGGCTTCCTCTGCCCTCTGTGCCTGAAGGATCTGCAG 

TCTTTCTATCAGCTTCACTCACATTACGAGGAAGAACACTCAGGGGAAGGACCGTGATGT 

CAAAGGGCAAATTAAAAGTAAGAGGCGAGGACCTTGCCTACCCCCTGCTTGTCGTTGAGA 

GCTTTMCTCACNGGATAGTTCTTATCACTTTTGGTGGTGGCACAGGNATATGATAATTA 

GTAGTAGCCAACAGATGACTAGTNGTTGTCATGTGCCAAGCGTTTTAAAAGTTNCCTGTT 

ATTAATTTCATTA 

Sequence 667 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAGAG 

AAAGCGTGAGGGCTGGGCCTGCGGCGGGCTTTAGGGAGTGGTCCCTGGCTGTGGATAGAT 

CTGCTGATGAGTCCAGGCCCCGGTCCATTCTCCTCGCGCTGCAAGGATGCTCCTGGGATT 

TCGGAGAGGCCGCAGGAGTCATTTCAAACACATCATCCATGGCCTTTTACCTGCAGCCAG 

CGTTGCTCCGAAGGCAGCTGTGCCACGCACACCTCCTCCCCGCAGCCCCMCCCATCTCC 

AGAGAGACCAAGATCTGCTCTGGCAGCAGCCATTCTGGCGACAACATTGAC 

Sequence 668 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGATAGGAGAGGGAGGAGGG 

CTGGCAGCTGAGTAGCCAGAATCAGGCAGACGGTGGTAGCAGAAGTCAGAGGCCGAGGGA 

ATCAGGGAAAGGAGTTCAACCATGGAGGACCTTGCTGGCCAGGTTAGAGACTGTGGACTT 

TTGTCTGGGTGAGACAGGAAGTCACTGGAGGGCTGTGACAGAGCTCTGAGGCTGTGAGGC 

ACTGCTCTGTCAGTGCCATGAGTGGGGAAAACAGGAGCTTGCTGCACTGGTAGAGACCAC 

AGATAATGATGACTTGGACAGAGCAGCTGGGAGAGAACTAGTTCAATAACCCTAACACGC 

CTCTCCATTCTGCATTTTCCCTAAAAATGTACCTGCCCG 

Sequence 669 ■ 

CCGCGGTGGCGGCCGAGGTACCTGCCCTATCTTGCTGAATGTTTTATAATCTAATAAAAC 
TCAGATAAAGACCCAGATGTCACACCTGAACAGGAAAAGCTGAAAGGAAAAGATAATTAA 
AATATAAATCAACAG AATCAAG ATTTTG AAAAG ACCTAG AA AACTTG AAG G ATTACTG AA 
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GCCAA GCAAGAGGAAATGATAGGATTAAAAGTAAAATCTTGTGTTTAGA I I 1 I I 1 I I I I I 

TTTTTCAAACGGAGTCTCGCTCTGTCACCAGGCTGGAGTGCAATGGCGCAATCTTGGCTC 

ACTGCAATCTCCATCTCCGGGCTCAAGCAATCCTCCCACCTCAGCCTCCCTAGTAGCTGG 

GACCACAGGCATGCGCCACACGCCTGGATMTTTAAMTATATACATATTTTTGTAGAGA 

CAGGGTGCTGCTTTATTGCCCAGGCTAGTCTCAAACTCCTGGCTTCAAGGCATCCTCCTG 

CCCCAGCTTTCCAAAGTGCTGGGATTACTGGTGTGAGCCACTGTGCCGGGACATAAATAG 

TTATGCTGTATTGGTTAAGGAATAATGACA 

Sequence 670 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAA 

GGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATC 

TACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGG 

TCCTGAAATAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATT 

CCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGC 

TGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCAT 

Sequence 671 

CCGCGGTGGCGGCCGAGGTACAAGGMGGCCTTAAAGACTGCCCCACTCTCCTTGTTTCC 

CATCCCCTGTCCCTTCTACTTCTCACATTCACCACTATGTGCCCTAGGACAAAATCAAAT 

GTGGAAACATTTGGTCATGTCTACTTTGTCCAAGGGTGGGAGTTCTTGAGGAATTCAAGT 

GGGAACTAGAACMCTTTCTACCCTTTCCTTCCCTTCCTTCCTCCCCACCTCTACCTAGA 

AGCCCATCAATCACTTTGAACTTCTTGGAGAAAAAAGGAAACAAAAGAAAAAGAAAAGGA 

GAGGCTGGGTGCGGTGGCTCATGCCTATAATCCCAGCACATTGGGAGGCCAAGGTGGGTG 

GATCACTTGAGGTCAGGAGTCGAGACCAGCCTGGCCAAAATGGTGAAAACATGTCTCTAC 

TAAAAATACAAAAATTCGCTGGGTGTGGTGGTGGGTG 

Sequence 672 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACACTGGTGGGG 

GAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCGCAGGGCCTCTACGGACCTTACTAGA 

AAAATGAAACCTGATGAAACTCCTATGTTTGACCCAAGTCTACTCAAAGAAGTGGACTGG 

A GTCA GAATACAGCTACATTTTCTCCAGCCATTTCCCCAACACATCCTGGAGAAGGCTTG 

GTTTTGAGGCTTCATGCCAGAAAGGGGAATGGGGAATGGCTGCTTAACGGCATGTNTTTT 

TT 

Sequence 673 

CGCGGTGGCGGCCGGCCGGGCAGGTACACGATGAAACGGGGGTAAGGAAGGAGAAGAAAA 

ACATTGAAAGGCATTTGACAGGGTAAGGTTGTATTCCCCAGACAACCCTGTCAAGCAGCT 

CTGAAGGGATGATGAGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGAC 

CATGTCCTATTTTTTGGAGGCGTTGGGGGGAATTGTCTTACGCAGCACCCAAGCACACTG 

CTATGCAATGGACCACAGATAGGAAGCAAGCACTGCATTTGGCTCCCCXGCGTACCT 

Sequence 674 ® 

CCGCGGTGGCGGCCGAGGTACTAAATCATTAATTCATCCTGAGCTAGTGGCTTTATTAAT 

GAGTATCTCACAAAT ACCAC AAAAATTCAACCTGGCCATGTGGAGCAATATAAAATTATG 

GCATTTCTTGGTATGTTTTTCTCTTTGGCGAGGAGACAACTTGATCTTGTGTTTCCAGAA 

GCATGTTAATT TGCC C TGCTT GCAGAATCTCTCTGGCTTGAAAGGAGATTATATTCATGG 

CAGTCTGTGAATTTTCATTTTATTTCATTTATTTATTTGAAGACAAGAGTCTCACTCCAG 

CCTGGGTGACAAGAGCAAGACTCCCGTCTCAAAT 

Sequence 675 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGGAGGTACCATGAGGGGAGA 
ACCGGCAAGGGGTGCCATTCTAGCATCTGGGTGGGAGAGAGGAGGCTGAATGCCAGGGGA 
MCTTCTTGGAAAAAGTGATGCTGAGTTAGGACAATTTAGTCAATGAGAAGGGATCTGGC 
TGTTCCTGGCAGTGGAGACAACATNTTTAAAGGCATGGGAGAATATCTAAAATTTACCCT 



Sequence 676 

AGATAATAACATCTGATATCCACATGGGGTCTGGAGGNGCAAGCCACCTTCCTTTCATCC 
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CACGGTCTCACAGCAGCCCTGGAAAGAGGCTGCTCTCTGTTGGAGGCTAAGGGCCAGTGT 
TGGAAGGAGCTCGGGTGGAAAGTGTGGTCTGCATGAGGGGCTCCCGTGAATAGAGGAGAG 
GGGTGGCNGGTACCTGCCCG 
Sequence 677 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACCTGCCTC 

TGCCAGATACCCCTGAGGGAAGAGGATGTTCTATAACCAGGCCGACAGGTTAGCATTTGT 

GMCACAGTTCTGACGTTGTTGGGAGGGTTTGTTTGCCAGAAACATCCCCATGCGCTACT 

CTTTCMCCAGAGGTCAAGAAGTCCTTTACTTTTGTGTCI I I I I I GTTTGTTTGTTTGAG 

ACGGAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAATGGCGCAATCTCGGCTTACCACA 

ACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATCA 

CAGGTGCCCACCACCACGCCTGCTAATTTTCATACCCGCGTACCTCGGCC 

Sequence 678 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTGTGGTATTTCA 
CATAATATTAACCAGC TGTTA GCAATGACTGATATATACTTCCCATTGAAAATGATGTAA 
G GTCTGM AG GATT CATTTTGACAATTTTATATCACATATTTATATTTACCTTAGGTGGT 
TCTTTTTAATGTTTTAATTTGGGACCACACTMTTTC 

CMAATTMTACCAAGCCAAGAAAAATGGTTTCATGMTAGAATCTACTAGTCTTTTATA 

TCTT ATAAT GGTAGATCACTGATGAGGTAGAACTCCATAAGAGCTTCNCTCTCACAGTNA 

MGGTTTTGGTTGT GCATG GATTACACCTGGTGAAAGTTGGTTAGTATTTGTCTAAGTGG 

CTTMGACAAATTTATTTTGATTTGTATTGTGAATGACTTTGCGMNCACCCAGAATTT^ 

TNCCGCTTCGTGTGTNGTGTGTGTGTGTGTT 

Sequence 679 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCCACATAATAGCTC 

AGT GCATGC AATTTACAAAG TAATAA GTGAAATGCTCCCCATAGTTGACTATAACATTTC 

CTCATTTTTCTCTGAATTTGCTTTTTAAAAAACTCTTCCCCTTGCCATTCCCTTCCCCAT 

TCCAGATTGTAACTGCTTCTTTCCAGCTGCATCAGAAGAAGGGGACTTTCCATGTAGGTG 

TTATTCTCAGAAAAGGCCAGAAAAGACCAGGTCATGGTGGGGATGATTTGCTCCAAGCAT 

AAAAGAGAATTGTGATGGTTCAGGAAGACTGGAAAATAACGAGACTGGAAAGAAATGAGA 

AGGGCTTCAGAGGAATGGCACATTGAAATAAAAGGGAAGTGGTAAGAACAGGAACCCAAG 

NGGAATGAANGGGCNCACAGTGGCAGGGATGATTGGATAGACTGTGGAATAAAAATAATT 

TG 

Sequence 680 

AGGTACAMCTGGCTTCTTCTCTTTGTCACCAGCACCTGCTTCATAGTCTCTCTGGAGTG 

CCAGGMCGGGTCATTTAGATTAAATCTCCCATACCGTTCCTGGATAAATACCTCCTTCC 

TGCGAGCCCGCAGGGCCTCGATGACAAGGTCTCTGGCCTCCAGCTCCCCTTCCATCACGC 

TGAGGAGCATCCGCAGCTCGGATTTACTGAGAGTATCCACATCAAACTCTTTTTTCAGTT 

TTACAAGTGGAAATTAAGCAGTCCTCCTCCCCGTTTCTCCTTCCATTGCCAGGCTCAGCT 

CCTCTCACCCCAAGTACCTGCCCG 

Sequence 681 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGTGGAAAGTGAGGTGGTTT 

TATTTCTAGTTCACCATTGTCCTTAGGCTGTATAGACCTCTGGAATCCCAGCTTATGTGG 

AGAAGGTATCCTGTTAGACTTCCCTCCTTTGGTCAGCACTGGGCCTTAACTTCTGGCCCC 

TCAAAGCTGCTAAAACTGAAGGCCAGGCTTGCCTGGCTTGGCAAAGGACGTCGGGCAGAA 

GCAGCTTCTCCTCTCCTCTTGTTTCTCTG7TTCCCCTCACCATAGGCTTTGGCCTGGGAG 

TTTTCCTACA 

Sequence 682 

CCGCCCGGGCAGGTACCTTCTTGGTTGCTGTGACTGTCTGCTAGCACTAAGACTGTCTTA 

AGCAGATAGAGGGCAATGGTCTTTGAAGGCAAATGACAAAGCGTGGCCCTGAGCTCCCTG 

ACTGAGTTCATTTGGACTCTCAAGGGATGCCCTGGAGCTAGACTCGATCTGAGTGGTTGG 

ACTAACTCCTCTTTGTTTTTGTATTGMGAGCCAGCTTACCCCGCCATTTNTAAACCTCA 

GGCCAGGAAAACCAAAAAACAAAAAAACCCAACCCAAACAAAAAACAAACCCCACCTTCT 
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TNAGAANTNAGTAANCTTAAGGCTTNAAGAATCAACAGNGCCCCTTTGGGNATTAAGGGC 
CATT 

Sequence 683 

CCG CGGTGG CGGC CCGA GGTACTATMMTACTATCCTMCTTTTTATGTGTTTTTTTAA 

CTTGTTTTTTAGAAGTTTTGTAGCGGTTTTTAAMATGATGTATTTA 

TGCTAATATCTGTATCTTTTACTCTATMCCTAATTTTTACATTTTCAGAMMMTTTT 

TACAACAATGTAAAAAATACATGGCCCGGGTGCGGTGGCTCACGCCTGTAATCCCAGCAC 

TTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTAAGGAGTTAGAGACCAGCCTGGCCAA 

CATGGTGAAACCCCGTCTCTACTAAAAGTATAAAAATTAGCTGGGCATGGTGGCAGGCGC 

CTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAA 

GGTTGCAGTGAGCCCAAGATCCGCGCCATTGCACTTCTAGCCAGGGAGAGAAGAGCCGAG 

ACTTC ATCTT A AA AA AA AA AAA AAA A AAA AAAG GTC 

Sequence 684 

CGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACACACTCTGGCCCTGGCTTTATTT 

TTAGATTTTCCTTTCCCGGTTGATATCGGAAGGCACAGAGGCAGGAGGTGGGGTGGATAG 

TAATGTGTGCCCCCTTGGGGGTNANAGTGAGGTGGAGGGGGATGTTAATNACCATGAGAG 

GCAGAGGGTCAGNCNANTTTCCANNGCTTCNNGCTTCCTTTAAATGANGGAAAACACGTG 

CANGTNTTAGGAGACAAAGGAAGGGAANTGACTGTTTCCTGGCCTGGTNTGTGGGCCCAG 

TNGNCTGNTNCNTTCAGTGNCTNCGTGCANTTNGACTNTACACNTANGNNGGCAGGCATA 

GGTGTNCGGTTNTGAAAGACNGNNNTCTTTNCACATTCTCTNCTGCTCTAGGGACTGAC 
Sequence 685 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTAGTAGAACTCTA 

GGAACTGACCAAACCCTTTTAACAACCAGGGGAGTGCTTGATGAAGGAAGAGGCTACCGA 

TCTTTCATAAGTATGAATAAGCAGCATGCATAAACCAATTACCTTCCCCTATTCCTCACA 

ACCACCACCACCACACCCACCACCTCTTGTGGCAGTGGGGATAGCAGCCCATGTTCCTGG 

AGTTGCTAACCGGTGCCAGGAGGGACAGTAGGGATCATGTCCTTCAAAATTTAGGGTTGT 
ACCT 

Sequence 686 

CCGCGGTGGCATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGG 

AAATGAGTGTGGAAAGAAGAATGCANAAATCTGAGGACTAGAGCCTGGAGATGGGGAGCT 

TCGAGCTCAGAGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCTTCTGAGTATCT 

GGGACTGCAGGTTATGTGCCACCACACTCGGCTAATTAAAAMMTTTTCTTAGAGACAG 

GGTCTCTCTACGTTGCCCAGGCTGGTCTCAAACTCCTGGGCTCAAGTGATCCTCCTGCCT 

CAGCCTTCCAATGCCTTGGGCTT 

Sequence 687 

ATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGGAAATGAGTGT 

GGAAAGAAGAATGCAAAANTCTGAGGACTAGAGCCTGGAGATGGGGAGCTTCGAGCTCAG 

AGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCCTTCTGAGTATCTGGGACTGCA 

GGTTATGTGCCACCACACTCGGCTAATNMAAAAAATTTTCTTAGAGACCAGGGTCTCTC 

TACCGTTGCCCAGGCTGGTCTCAAACTCCCTGGGGCTTCAAAGTGAATCCCTCCCTNGCC 

CTCAGCCCTTCCAAA TGCCC TTGGGGGCTTACAGGCCATTGGAGCCCCACCATGTGCAAA 

NGAAAAGAANAGCAAl I I I I GGACATCCTGCCCAAAACAAACAAAGGTTTGGGCAATGGG 

TCCTGGTCAAGCAAAAACAAGTGGGGTTTGGNGGAATAAACCAACCCTTGGGTTAAAAAT 
AA 

Sequence 688 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACT 

TGAAAGGTCACGTAGCTGANACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAA 

AGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCT 

TGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGA 

GCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGG 

Sequence 689 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATACTCATTAGAGTTGCTCGGTGG 
AGATGGAATGATGGTGGGGTGCAGTTAAACATGGCTGAGTGCTTTCTGCTTAAGGACCTG 
ATGTATTAATGCTCTCCAGGTCATTCATATTTGGGGGAAGGAACAAAGAGGGTACCT 
Sequence 690 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCTATGAATCTCACA 

GATGTAATAATGAGTTAAAGAAGCTAGGCACAAAAGAATATTACTGTATGATTCCAATCA 

TATAAAGTTCAAACCAGATCAAACTAATCAATGAACGAGGAGTCAGGATTCTGGTTATAT 

TCAGGGATAGTGATGGAAGAGGGCTATAAGGAGGGTGTCTGGGTGCAGGTCATGTTCTAG 

ATCTTGATCTGAGTGGGGGTTACATAGGTGTATTCACTTCATGAGAATTCAGAGGGCTGC 

ACACTAATGATCTGTATMTGCTCCTCTATAGTATGTCACACTTCAAAAAAGTTTACAGA 

AACAGTTCCTTCCTAATTTTCACAGGGCCTAAGAGCTAAAAACGCAGCCCCAG 

Sequence 691 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCGCGTAGCTGAGACTTCAGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTC 

TGTCCCGCTGGAACCCATGGTCCACTGAAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAATACACCAAGTGAGAAATTCCTTGAAATTTGCAAGGT 

Sequence 692 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGCAC ) I I I I I I I I I M < ll I M I 

TTTTTTTNCCATANAGATGGGGCTTTGCCATGTTGCCCAGGCTGGTCTCAAACTTNTGAG 

CTCGAGCAATCTGCCCACCTCGGCCTCCCAAGGNGCTGGGATTACAAGCATGACCTGCCG 

NGCTGGCTAAAGTTTCTTATTTATACTTACTCATTCCTCTAATATCTGGATTTCCTTAGT 

CATCTGTCACTTCTCCCTGCATATTTCCTGTGATGTCTTTAGGTCCCTCCCACTNTTGTA 

GTAGCACTCCCTGGGGACCMTTTCGMGGATGCTGAGTCATATGGTTTTTGGTTTTGAG 

AGGGTTGAAAATGGAGACTCAACTCAATTTAGGAGCTATCCCATCATAACTAGTAGCAAA 

ACACGTCACTACTTGAGTCTCAACAAAAGACAAAAAAGGTTTNAAAGTTGGGGAACAAAT 

AGCTGCCAAGGGTTNTTNTTNTNTGACAAAAACATTGNGTTGGGGATTTAAAATCNATGT 

GAATCCTTAATCCCTAACTCATCCATGTTGGGG I I I I I I 

Sequence 693 

CCNCGAATTGGAGCTCCCCGCGGTGGCCGCCCGGGCANGGTACCTGCTTNCCAGAAGTGT 

TATCATGATTAAATGACAGACCAGTGGCAGTAGCATCTCCTGAGGGAGGGTTAGAAATGC 

ATA TTCTC AGGCACCACTGCAGTCTTGCTGAATCTGAAGCTTTGGGGATGGGACCCGGTA 

GTCTTTTTGGATMCTCTGCCMGNGGTTCCAATGTGCTCMGTTTGAGAGTTGCTGAAT 

TAAAGCGCTGGGTCTTGCCAGGCATACCTGTAATTCCAGCTCTTTGGGAGGCTGAGGTGG 

AAGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGTAACATAGCAAGATCCTATCT 

CTACCAAAAAAAAAAAAAAAAGTACCT 

Sequence 694 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTCTCTGTCTTTAC 

AGCTGAAGCATCAGAGGATGGAGTGACCAGGCTGGTTCCAATGACAGTTATACGGCCATG 

GGGAGTANACATGGAGTCTAATTCAGTGCTTGAGGCTAAGAATGAAGTTGTATGCATTGT 

GGAMTTGTTCCAGGAGATCTTGCAACTTTCAAGTTTGAAGTCATGTCTGTGACAGTCCA 

GGAATNTGATGCAGCTGTGGAAGACCAGGTGGAAGGGTGTTCTGTAGAAGTTGTGCGCCT 

CTCTGTGGCCGGGGTGCTGTCCATGGTACCT 

Sequence 695 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGTATCANG 
GAAAAGAAAACGTTATTTGTCCCACAGATGCTGCTAGGAGCAGCTACCCCAAGACAGGCC 
TTGCACCTTGGGTCATGACAATGCGTGGCTACTGAGAGCTGTTGACAGAGTGGACAGGGC 
CCAGACCAGGACAGTCTCTCTAGAGGTCTTCACCTCCTCAACCGTAACTTAATCAGCCCC 
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ATGCCGGGCTAGCCCCATGCCACAAAGGCTCAGAAATGCCCTGCAACATGTGGGACACCT 

GGTAGTATCTACATAGGGGCCAGCATCCATCCCAGCTGCTGGGGGTGGCTCAAGAGCTGT 

GAGGGACACCCTTTCCTGCCTGATACCGTGGACCAGTTTGCAAAGAGCTGACTGTCCTGC 

TAGGCCCA 

Sequence 696 

TAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAC I I I I I I I I I I I I 1 I 

Mill GTATATTTAGTAGAGATGGGGTTTTACCATGTTGGTCAGTCTGGTCTCGAACGGN 

TGACCTCAAGTGATCCGCCCACCTTGGCCTCCCAAAGNGCTAGGATTACAGGCATCAGCC 

ACTGTGCCCAGCCAGCCCTATGCTTTTAAGAGTTCGATGGTTGAAAGAGACTGAGCGGGG 

AAGGTAGAGCGGGGCAGGGGAGGGACTACTTGGAGTCAAGTCAAAGTTTTAGGGAAAGAC 

CTGAATCTGAAAMGATTATTTAACCTTTATGTGTCTGAAATACTATATTGTGCGAATTG 

TACCT 

Sequence 697 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACACGACAGAAGCCCACGGAGCAAGCCC 

TGTGCTGGCCCCTTCACATGACTTTAGGCCCTCTAGCMGGTGATGTTTATTTACAGGGT 

TGCATMACAAGGCCTCACCATTCAMMACCTTGTATTCTATTACATGTTTCACATTAA 

CAAAGACTGGAAATTCTNTAGGAAAGGGATCTTTTTTATCTACATGAAAAGCACAGGCTA 

GTAAAGACTTGTTGAAAAAGTTGAAAGAACATAAATGTATATGGTATATGCCACATAGCA 

TAATGGAGGAAGATAGCAAATAGGAAACATATTGGTGAGGAAGACTGGAGTTTGATGATC 

TAGTCAGGAAAACATCAAGTTAAATCCTTTACTTTACACCTAAACCATAAACTGGTGAAT 

AAAACAAGTATGTGAAAGCACAANANGAGAGAGGACAGGCCGGGCGCAGTGGCTCACGCC 

TGTAATCCTACACTTTGGGAG 

Sequence 698 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCTATGCAGNCATNC 

TGTNAGNACCCATTCCAT TNTNC TATCCCTGGNTNGCTGGTGTCAATACTNTNAAGCGAN 

TTACTGCNGGNGCTCTNNTTTTTCCCTCANAGATACCNNGTTGATrrCTTTGATTCTCTC 

CATCTCTACAGGCATAATAACTCCTAATATTTAAAAACNCTGTAGAGGGATGNANNGAAG 

CTGNGGNGAGAGCCCNTGGGCTTTTCNCNTGGGTNAAGATGCACATTCCTGAAAATTNTG 

GGCCTTGGCTTAAGCTGNACTAGNGCCGGCCACTCAGCTGATCTCACTAGCGTCACCTGT 

CGCAATGGTGCTGAAGCGCACTNCCNAGAGGCCATAAGGCAAAGCGAGAGTNCNTGGCTA 

TNGACTGGANCCCATTTAAGCAAAAAAACATGCCTCNCGNANGACAAATTCNATCAACAA 

AGGGNGGGCAATACAGGATCTGTACCTGCCCGGGCGGNNCGGGCANGAACC I I I I I I 1 I I 

TTTT 

Sequence 699 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGCCCGCCACCCGGCTT 

GTGTGTCATCCTGGGCCCAGGCAGGTGATGATGCCAAACACAAGGCCCAGCTATGTAACC 

AAGTAAAAACTTTCATCAGAATGCCCATCTTTGTGACCCACAGCCCATTGTCAAGAGCCT 

T CCCT GTGCCAGGAGTTCAGCAGGTTCACCTCCGCCTCCACTAGTCACTAAGACACGGAT 

ATTTTAAGAATTAAAAGCCTCCACAAGCCAGGCACAATGGCTTACACCTATAATCCCACA 

ACTTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCAACGAGTTCGAGACCAGCCTGGGC 

AACATAGCGAGACCTTGTCTCTACAAAAAAATTTAAAAGTTAGCCAAGCATGGTGGGGCA 

TGTCTATAGTCCTAACTACTTGGG 

Sequence 700 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAGTAGAGGCCAGGTTTCACC 

ACGCTGGCCAGGCTGGTCCTGATCCCCCGGCCCCAGGTGATCCGTCCACCTCAGCCTCCC 

AAAGT GCTGG GATTACAGGCGTGAGCCACCGCACCCGGCTCI 1 I I I I I I I I I I I I IAAAA 

TCATGAT TTTAA CAGAAGCCTCCATTCAAGGCGAGACATGCCTTTTATTTCTCTAATTGC 

GAGACACTTTTCTGAATCCTCTTGTCAGTTGCACCTTTTAATACAATTGAGGTGACACTG 

TTCTTCATG GTGA CACTGGTCTTTCCCAAGAGGTTTCAGCTMTTCAGTCTATCAGATTT 

TACATCAGATTTTMAATTTGCTTCAAACTTGGGTGCTTGTATTCAAATTCATGCTTCAT 

AGAAAAATGCATATCAAGTTCAACAGTTGACTAACTGCAGCCACGTTCACAGTACCTGCC 
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CGGCGGCCGCTCTAGAACTA 
Sequence 701 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAAGGCTGATACACAC 

TGACAGATTTTGTAACMGGGACATTTAAAACTGAGCTGGTMTAGACTTGATTTCTGGT 

G TTGCC ACTCAATAGGCATGACTAMTAGTGTACCTCACTGTTCTACTTTTTATAATTAA 

MTTTTAGAGGAAGCT GAGTTCTTGTATTTAACTACAAGTTAGAGACTCAGCCCACAAGC 

I I I I I I I I I I I l I I I I I AATATGGTTTC I I I I I I I I I I I I I GAGACGGAGCCTTGCTNTG 

TCACCCAGGCTGGAGTGTAGTGGCGCGTCTNTGCTCACTGCAATCTCTGCCTTCCCGGTC 

CAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACCGGCGTGCACCACCACG 

CCCAACTMTTTTAGTATTTTTAGTAGAGACGGGGTTTCCCCATGTTGGTCAGGCTGGTC 

TTGAACTCCTGACCTTGTGAACTGCCCACCTTGGCCTCCCAAAAACGCTGGGGTTACAGG 

CGTGAGCAACCATGCCCAGCC I ll I I I I I I I I I I I IATTT 

Sequence 702 

AGGTACGCGGGATATATNTAAATTTAAGAAANCATCCCCGGTAATATGGCTCTTCATAAT 

TCTAAGACTAAGGCTGGNGTAGAAACCTAACCACCTACCTTACAAGNGAAGGGGGCTATA 

CCATGGGGTAAGCCMGTTTGAMTTTATGGGGAATCNTACCCAACTTGGNTTAAGGGGG 

CCCTNGGATTTGGCCTNGGGGGCCAAGNNTTTTCTGTATTTTTTATAAMGGTGATCTTN 

CATNGGTATTCCCTTGGTTTACCTTGGATAAGGGGGGATTACCAATGCCTTCTTAAGGAA 

AAAAATTCACCTTATTTGGGCCTTGGGGGGAAGGTAGGGTNGGGCTTCAATAGCCCTTGG 

TAAATTCTCCCAAGCCACTTTNGGGGAAGGAAGGGCCTGGANNGTTTTGCCGCCCCACTT 

ACCACTTCCCAAGCCCTTGGGGGGTGGAACCAAGAAGTGGGAAGGAACCTCTTGGCCCCT 

CAATATNNAAAAAAAAATNAGAAAGGNAANATTNCACCTATTCTTACCANAACCCCTAAG 

NACCTAA I I I I I AAAAAAATACC AAAA AG A ATTGG CCCTNGTTTNTCAAA AACCACCTAA 

TTTGGGAAATAAMNAANGGGGTGGAAGAATTATTTTCTTTAACCCNNATNGGAATAAAA 

ATNNATNNNATTNGGGGNTCCCTTGGCCCCNGGGCCGGGCCCGCTTCTAAAAACNTAAGN 

GGGGATCC 

Sequence 703 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCGGGGCAGGTACTACTGACTTACCTGCC 

ATGGGCTTTCTCAAGACAAGTCCTGCAGGAGAGGCACACTCACTTCTAGCGTCAACTATT 

GAACCAGCCACTGCCTTCACTCCTCATCTCTCAGCAGCAGTGGTCACTGGATCCAGTGCT 

ACATCAGAAGCCAGTCTTCTCACTACGAGTGAAAGCAAAGCCATTCATTCTTNACCACAG 

ACCCCAACTACACCCACCTCTGGAGCAAACTGGGAAACTTCAGCTACTCCTGAGAGCCTT 

TTGGTAGTCACTGAGACTTCAGACACAACACTTACCTNAAAGATTTTGGTCACAGATACC 

ATCTTGTTTTCAACTGTGTCCACNCCACCTTCTAAATTTCCAAGTACCT 

Sequence 704 

CCGCGGTGGCGGCCGAGGTACTGTGAAAGAACTAGCACTTTGAGCAGAGAACAAATGCCT 

TACTTGAGTTTCCCCTGGACTCTATCCCTATTCAAAGATGCTTGGTTATACCTCAAGAGG 

GAAGCAATCCAGACCAACTCCTATGACATGACAGGCACTCAAGCCTGATGAGGCAGAAAC 

CTGGCAGCTGTAGATGTTGGAAAGGATAATTTATGTGTTCAAGTGACTAGGATTCTAAGG 

GCTAGAT GCTAG CTTCAAGCACGGCTGGATCTAGGMGCCCTTTTGCTCTCCCTTTTCCT 

TGGTCTACTTTTCTCTGTAGGCAAGTTCATTCATTCTTAGGCAAGTTCTCTGCATGTGGC 

AGCAATGATGGACACTGGAATCTCTGGGTATTCAGAGTTCTTTCAGTAGCAG 

Sequence 705 

CGGGCGGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTA 

GGTGCTCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGC 

CCGTTCTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGA 

GGGGCTGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGA 

ACCCCAAANGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCA 

TTTGTGGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCGTACCT 

Sequence 706 

NGGTTAANTGCCGCCNCTTGGCCGTMTCATTGGGNCATTAAGCCTGGTTTTCCCTNGTG 
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TGMMAATTTGTTATCCCGCTCCACAATTTCNCACACCAAACATTACCGAAACCCGGGA 

AGNCAATAAAAAANTGGTAAAAAGCCCTTGGGGGGGTGGCCCTTAAATGAAGGTGGAAGC 

CTMACCTCAACAATTTMATNTTGGCGGTTGCGGCTCAACTTGGCCCCCGCCTTTTTNC 

CAAGANGCGGGGGAAAAA 

Sequence 707 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAACATCCATGGCAGACGCTATT 

CTTTCCCTCTTAGAGATGCAGACACTGAGGCTCAGAGAAGTTGTCCCGCACCCAGTATGT 

GATGGAGAGGTAGAGGGTAAAAACATCAACTGAAGGATTTAGCATTTGGGGAAGAAGGAA 

GAAGCCCAAAATGGAGTAGATCAAAGGCTCCCCCGTGAACAAATTTAAAATTAAGGAGAA 

AGAAGCAGAATTCAGTCTTCTCCACACCCATAACCAAACAGCTCCTATGAAGGCACCAAG 

CCTGACGCTCATCCCAATAAAAAGGAACGATCTGGAGAGAGGGGCAGCCGCTGGTGACAA 

GAGAACCCCCCAGGCAGCCTCGTCATCTGGCCAG 

Sequence 708 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTGGGGGACAGATATCATAT 

AGG GGATTC ATGTCAGTGACCAAACGAAGTGACCATTACAGCCCTTTNGAAACCTGAGGT 

GTAAI I I I I AAAAATGAACTCATGACTTTAATAGTCATAGACTCAAACCTGAGTTGATTA 

TTATGAATTAGTTTATGGGAGTCTCAATATGTGAATATGATGGAGACAAGTTTTGGAATA 

CAGATAAATCAAGTCACTGTATTCACTCTCTCTCTCTCTCTTTGAATAGCCTTATCTTTG 

CCTATACACACAAACAGTGCAGCCATCAMATTTTCAATTTACAAAATGTTCACAGTCAT 

GCTTCT TCCTTGACTAAACACTGGGGTTGCTGCCAGTGGTAATTGGCTTGAAACCAGCTA 

ATTTTTATATATCTATTTAGTCTGGATATTCTAGATGAGTGGGCACTATAGT 

Sequence 709 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCCCACTCCTT 

GACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCACTACTTGGG 

AG GGCTA AAGGGAMTTTGGAAATAAAATTCCAAAGTTTGGAGTAAAAAAATTCAAGTGT 

TGATTTTATATTCTTTCCCTTTCTGACACAGCCTAMGCGTAGGGGGMCATGTGTTTAT 

CTGTGGGAGATAAACAAGATGGAGTCCCAAAGACTTTAACAAAATATTTTTTTAAAAATC 

CACTAGAATAGAAMTACATTATTTAGATATACTTTATGCTGAGAGTGAGTATATATGCT 

TGTCCTATTTAAACTTGTGAGAAAAAGTGGTATCCCTTGATACATTTAGAAATATGGGGG 

CTATCTTGGTTCATTGNGGGGGGTGGGGGCAGAAGGAGAATAAATGCAGGATGCCCTTGT 

TGAAAGGAATCTTAGCATGGCCCACAGGGGACGTTTCCAGTCGATTACCAAGGAATGCCA 

GCCT 

Sequence 71.0 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCC 

ACACTCCTTGACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCA 

CTACTTGGGAG GGCT AAAGGGAAATTTGGAAATMAATTCCAAAGTTTGGAGTAAAAAAA 

TTCAAGTGTTGATTTTATATTCTTTCCCTTTCTGACACAGCCTAAAGCGTAGGGGGAACA 

TGTGTTTATCTGTGGGAGATMACAAGATGGAGTCCCMAGACTTTMCAAAATATTTTT 

TTAAMATCCACTAGMTAGAMATACATTATTTAGATATACTTTATGCTGAGAGTGAGT 

ATATATGCTTGTCCTATTTAMCTTGTGAGAAAAAGTGGTATCCCTTGATACATTTAGAA 

ATATGGGGGCTATCTTGTTTCATTGTGGGGGTGGGGCAGAAGG 

Sequence 71 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACGGTGAGCCGGAG 

GAGTCATGTCAGAGGGGCGAGCAGGAGCGATTCCGTCGCCAAACAGGTTATGAGTGCCAG 

TGAGCCGCCTTAGATAGAAGCATCGTCAGCACTTTATTAATGATGGATAGNGAGAATAAA 

CCCGAAAATGACGAGGATGAAAAGATAAACAAAGAAGCACAAGACTTGACAAAGCTTTCA 

TCCCATAATGAAGA CGGTG GGCCTGTATCTGATGTGATAGCAAGTTTCCCTGAGAATTCT 

ATGGGCMAAGAGGTTTTTCAGAATCATCGAACTCTGATAGTGTTGTTATAGGAGAAGA 

Sequence 712 

NCCCGCNGTGGCGGCCGAGGTACTCTTATGAGAGGAACATTAAAATTTGCAATTATAATG 
CAAAGAAACAGGAGACGATCGTGAGAATAAGCAATGTCACACACATTTCTCTCCAAACTA 
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TATGTATCTTGTCCTTTAAATTCTTTGACGTGTGTGTGGGGCCTGGGTGGGGGGTGTGGG 
TGTGTGGTGTTCTMGGGCCTTTTCTACTAATTCCAATAACTGGGTTATCTGTGAGGCTG 
CTTATTCTCACTGTATTTTACTGCTTCTTTGCCTTTCTTG I I I 1 1 I I 1 1 1 1 AACCATACT 
CAGGTATGGTTAAAATGTAAATGAAAGAA 
Sequence 713 

CCGCGGTGGCGGCCGCCCGAGCAGGTACTGGGGCTGCACAGGCTGTGGCAACACTGGCTA 

GTCAAAGCCTGGAAAGACCTGTGAGCTTGAACTCTTCTGGGTTGCCATTGTTCTGTTGGT 

TTCAGCAGTGAGACTGAGAGAGCCTGTTCTGTTTAGAAMGCCACAGTGGTTTTCTAGGT 

AAAGTCTGCAGGAGATGTCACTTGGTGCCTTTTCMTACGAGTTTTCCACCTGCAI I I I I 

GGAACCATTATGGGCCTTTTAAAAATTTATTAATAAGTCTCTTAAATATTTTATAATCTA 

GCTTCTGAGACAAGATGATTTTAAAACAGTTATATGCTCTAAATTAAAAATTTA 

Sequence 714 

AGGTACCGTGTGAGCAGGTGGCGTTCACCAGGGGTGAGACTTTATTGACAGTAAGTTGCC 

T CTGCCA AAAAACGCCCTCATATGTCTGCTGATGTTTGAATTNCNNCCNNATGGCAGGAG 

GTTT TTTG GTCCTCCCCAGNTTTAAAMAAMTTGGTTAAAAATMCCTGGGTTTGGTTN 

TTTCTTTTGNTATGGGGAAGGCCCTTCCAAGNAAGNGGAAAATAAANAAAAAAGTAATTA 

NANC CTTAA CCTT CCTTMG GGGGAATTMTTGGTMTTTCAAAATTTTTTGGAATGGCC 

TTANCTTTTNTTAA I I I I II I I AMTTTTTTAAATTTTGGGANGGAAACCAAGGGGGGGG 

TTTCCTTNTGGGCCTTTCTTGGTTTGGCCCCCAGGGGGCNTGGGGNANGGTTGGCCAAGG 

NCCMTNTTTNNGCCAAMTTTCCAMCCMGGGCCCTTTCNAACCTTTGGGCCAAGNTC 

CCCCTTTCAAAAACCCCCTTTCNNCCTTGNtTTGGGCCCCCCAAAA 

Sequence 715 

CCGCGGTGGCGGCCGAGGTACCGTTTTATGATGATAACATAACTTTAATGCTCCAACCTG 

AGAAAGATAAAATAGACTAAGATGACCATTGAATGCAAACAGAAAGTTCTAAATGAACAA 

TCAAGNCAGGACCTGGAAATTTCAGGTCCCTGGTGGTTGGAAAANTAAATTAMTTAAAA 

ACCMNTTTCTTGGTTTTTCCMGGAAAAANTGGNTAAAAAAAATTMGGTNTTTAAAAT 

MNCCCCAGGGAAAMTTTCCAAATTCAAAATTATTAAAMGGCCTTAAATTAAATTAAT 

ATTTTTTGGCATTTCNMGGCCCCAMCCTTAAAATTGGCCTTANCCAMATNGGTTT^ 

GGGTAATTAAACCAGGGCCAAATTTNATTAAAAAAATTCC 

Sequence 716 

AGGTACACGATTATTTCACCATCCAGGTATTAAGCCTAGCACCCAAGAG I II I I I I I II G 
CTTCTCTCCTTCCTCCCTCCACCCTCAAGTAAATCCCAGTGTCTGTTGCCNCCCNTNCTT 
CGGTANNAACCAAGGTGG I I I I I I I I I AAATTTTTCCACCAAAAAAAAATCTTCNCATNC 
MCCTTCCTTCTTTTCMTTTrTTGGCCTTNNCCTTAAA^ 

TNCTGGGTTNTGGCTTCCCCCAGTAAAAGNACCAMGGAAANTTTTAMNTTTCAATTTT 

TTTMTTCCCCGGGTTTTTTAAAMAGGGTTNAMAGGGAMGGNTTTCCTTAACCCCCA 

A GTGCCC MGGGGAAAGNTTGGGGTTGGGGGGCCTTCCAACCNAATTNCCTTAANTTAAA 

MTTTTTGCCCCNMGGGCCAAACCCTTTNNTTTGGGAAGNGAAAGGGGGGCCCCCCGGA 

MGGGGGCCCCGNAAGGGTTTGGGGGAMNTTCCAAACCNMTTTGNTMAGGGGGTTTT 

TGGGGGGAAAAAG 

Sequence 717 

CCGCGGTGGCGGCCGAGGTACTACAATAAGGACAAATATTCAAAACATTCTGTTAAGTAA 
AATAAGACAGTCAAAAAGGAAAGCTGTATAATTACACTCATGTAAAAATATTTAGTCCAA 
CNCTCACAGGANAACCAAAGGTGGTCAATAGGTTCCTCAAGCCAGGTGGCCACCCCAAAG . 
GAATGGTTAAACCAAGGTTCTTCCTTCNGTTAAGGTTCCTGGAAGGAATTAAAACCAATT 
CCCCAAGGAGGTTTNCTTTTTGGTTTTCCTMCCCT^ 

GGGAGGTGGTTAAMANCMCCMAAMGGGTTTGNAAAGGGNTTTTGGGGGAAGGNTTT 

GGGAAAAAAANGNTTTTTAAAGGNA 

Sequence 718 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTT 
GGATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGTGATGA 
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CTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATT 

GAAATATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGGTGTGGTGTAAACACC 

TACATTTTAATGTAMCATTTCAGTGTMTCAATGAGAACTATCATTCTGCTTTAATCAC 

CATGAGTrCTGAMTMCAAAGGATTTGTCTGACATTCATTCTAAGAAATTCATTCTTAC 

CTGACTAAGAMCTTTTTTAACCCGGCACAATAATAAAGAAATGACCTGTNAGTACCTGC 

CCGGGCGGCCGCTCTAGAACTAG 

Sequence 719 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTTGG 

ATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGNGATGACT 

CAGACCGACNCTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATTGA 

AA TATC AGCCTTTCCCANACTGACCTGATGGGNTGACTTCAGGTGTGGNGTAAACACCTA 

CATTTTAATGTAAACATTTCAGNGNMTCMTGAGMCTATCATTCTGCTTTAATCACCA 

TGAGTTCTGAAATMCAAANGATTTGTCTGACATTCATTCTAAGAAATTCATTCTTACCT 

GACTAAGAAACTTTTTTAACCCGGNACAATAATAANAAATGACCTGTAAGTACCTGCCCG 

Sequence 720 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTGTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGA 

Sequence 721 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GNTGCTTNGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAAAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAA 

CATGTCCTTGCTCTCCAATGGCCCTGGCAGGCCAACCTTTAGTTTCAGGGCTACCACCTG 

TGCGGGGGCTTNTGTCATTACCCCCTGTGGATATTAATGCTGCACACTGNGGTTATGACT 

TGTACCTTCGGCCGTT 

Sequence 722 

GGAGAGGAAATGTGTAGGGGTGAGGGATGATACAAGAAAGCCAAATCCTCATCTTCTATA 

GTAGAGAGTCAGCGGATAAAACCTAAAAACAATACATCAAGAAATACTTACACTTATGGA 

AGGAAATACCAGAAGTTAAAAGGGGTTACTTCTGGGACATCAGACACCAGACTGCAGGGA 

AGGGC TGCC TCTTGTATTAACAAGCTTCCAGTATAATTTGC I I I I I AAAAATAGGTCCAT 

GCATTATTTTAATAAAAATTANGCTGGGCGTGGTGGCTCAGGCCTGTAATCCCANCACTT 

TGGGAG 

Sequence 723 

ATTGGAGQTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAGCAAGCTGTGTGACCTA 

GAGCACAGTGCTTTGAGTTTTTGAGTCTTAGCTTCTGTGTCTATAAAATGGGGTTCACAC 

MCTCACCTTACAGGGCTGTAAGATTAGATTACACAGAAAATATATTTTTTGGCTGTGGG 

GGCTGGAAGTGTTGCTGATTAGCATTTGAAATCCCATCCTGTGGGTGAGAAAACCCCACC 

TTATGACTTGGTGGGAAACAAAGCCAACCTCCCACTGATGAAGCTGAAAGTAGCAGAACC 

TTGCTTCTACTGCCTCCCTTGCAGCTAGAGGCAGGCACAGGACTAGCCTGTCAATTGGAT 

GCAAATGCTCCAGGCCTTGAATCACAACCTGGTGACTTGCACCCAAGTCTATTA 

Sequence 724 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACCGACTTNATCATGCTGAG 
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TTGTGTGCAGCTGCAGCGGATTCCTCTGAGACGCTTGTCTATCGCATCTGCCNGGGCAAG 
TNTCACCTTCCCTGGGATGTNCCCTTGGACAAGANACAAGGGAGGAANGGCCTTAANGAT 
CCTANCTGGGGGGGGA 
Sequence 725 

TAGG GNGA ATTGGAGCTCCCCGCGGTGGCGGCGCCCGGGCCGGTACCCATAAAAATTAAA 

AACTATTTTAAAAAATAAATTCCATTTGAGCCACTCCTTCAAACCACCCAGAGTGGGTAG 

ACGTCTTTCGTGCCTCTAAGAAGCCCCATCTCTATTCTGCGTCTCACCTTGCAGGGCTGC 

TCATCTGAATCCTGAAGATGGTGGACACCCATCTGCTAGGACTGAAATGAATAGGACAGA 

GGGAGGTGCAGAGTGAATGGACCATACTACCTGTCATCTTGGCAACGTGTGATTGAATAA 

MCAACTTCTTTAGMGTTTGATAGAGTGATTTGATMTGTMTTTACAAGTGATCATTT 

CHINA 

Sequence 726 

GGAGCTCCCCGCGGTGGCTTTTTGAGTCTGGACAGGNCTCTGTTTTTGCTTTAAAGTTAA 

GAGAGCTAAATAAATGATGGTAAAAAGATAATAAAATAGAACATGAAGGGCTGTCAGTCA 

GTGTAGGTATTTCCATCCCCTCACTTTTCAAGTGAGGTCACGGAGGCTCAGAGCGATAAG 

GAGACTTGTCCAAGGCCACACACCGGCTGGTGCTCAAGCCGGGACTTGAACCCACGCAGT 

CTGACTCTAGAGCCCAAGCTCCTAACTATGACATCCTATTTGATACACTGTTTTACTGGA 

GMACAGATCATTTGACAGACATTCTTTCTGTTAGCMTTTGACAACTCTTTCCCCAGTT 

GTCTGTACCTGCCCG 

Sequence 727 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCTA 

TTCTCTTACAGCTAGGACCCAGCGTTCTGGGCAGAATAGCAGGCAGCGCAGTAGCTACTG 

AGATTATAAGTGGTAGGTTTATGGAGCTGTGACCACAACTCTCCACAAGCCAGTGCTGTC 

TCATGCAAGCACTCTCAGTGTCCAAGTGCTGAGTGTGTGAGTGGTCTGGGCTTTGCAGGG 

TCGGCCAAGCTCTTGGAAGAGCAGGCTTTGTTAGCTGGGGAGTCATCGCTCCATGCAGGC 

CCTGAGAATGGAGCATCCTGAGTGGACTGGTAGAGATGGGGCATGGGTCACTCTGAGGGT 

TTGAGCTACTTCTGCTATTTTTGAAATTCTGGTTTGMCTGCAGGATCGTGCTGAGTTTG 

GCACAGACTMTTTCTCTGTTGGCAGCACATGATGTATCAACTCATGTGTCAGTTGGTTT 

G 

Sequence 728 

CCGGGCAGGTACTACCTTCTCTGCTACAAGTCGAGCGAGGAGCCCCGCATGAGCCCTGAC 

ACCTGTGCCACCATTTTGGAAAAAGCTGGTCTCGATAACTGGGCTCTTGGAAAAACAAAA 

GTGTTC CTTAA GTATTATCACGTGGAGCAGTTAAATCTAATGCGAAAGGAAGCTATTGAC 

MGCTTATTTTGATTCAAGCTTGTGTCAANAGCATTCTTGTGTTCAAGGAAGGATACCAA 

AAAATACAGGGAGGAAAAGGGAAAGGAAAGCCGCTTATAATAATACCAGTCAGCTTGCAA 

GGAGGGACCACCCTTGTCAGGGAAAACMNAAGAAAAGGAMATTTGGTTTGGACCATTG 

MMAACCCCAGCANTTMCCAACCCAATTTCAAAMCTTTCTTGATTCAGGGAAATTTC 

GACTTACCAAGAAAAAACCTTTGGNAAAAATACCCANGGGGGGNTCCTGGGNAAAGGGNA 

GGANGGGAGCCCAAAAAAANAATTTGGANAACCCCCCNANGACCGACCCCCCNGGAAACC 

CCCA 

Sequence 729 

CCGCCQGGGCAGGTACTTTCI I II I I I I I II 1 1 I I I 1 1 I I I I I ACGGAGTCTTGCTCTGTC 

ACCCAGGCTGGAGTGGAATGGTGTGATCTCGGCTCACTGTAACCTTCGCCTCCCAGGTTC 

ACGTGATT CTCCTG CC TCAGC CTCCGGAGTAGCTGGGATTACAGGTGCACACCACCATGC 

CTGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACGATGTTGGCCAGGCTGGTC 

TTGAACTCCTGACCTCAAGTGATCTACCCACCTTGGCCTCCCAAAGTGCTGGGATTATAG 

GCATGAGCCACCACGCCAGGCCCACTCTNTAAATTTTGACCACCCTGCCTTGAGTGGTCT 

TCTAGCACCCTAACCTCTGTCTAACCTTCGAGAGCTTTGCACTAGCNATTCCTGGGGACC 

AGCTATGGTTGGTATCTTCTCAACTTTCTAA I I I I I I I AAAAATATTATTATTATTATTA 

TTATTTTAAAA 

Sequence 730 
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TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGCAGGTACCTGTGTGGAA 

AAGMTGCTTGCAMGCTTGTCACCCTCACGAGAATTCCTCTGACAGACATTTGCCTTTG 

ACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGAAACCGAAGAGCATCAGAGGGGACGA 

CTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGAGTTGGTTACCAGAGAAACTGGCGAT 

GGCGATCCCGTGAGCMCATCTCTCAGACCCATTTTAAATGCCGGGGGATACTTAATCAT 

GCTGAAAAACAGCAGAGCCCTGAGGTTTTGGACTACATGTTGCAGAAAGAA 

Sequence 731 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTACACCTGGGACAAA 

TACTTTTTTGTGMGNCAGGTAAAGCCTTTGCGTGCAATATAGCATCTCTATGCAATGCA 

NCAACTCCTCGTCTATCGCTACAGTAAGAAAACAGCCACGGGTCAGGTGTTGNGGCTCAC 

ACCTGTMTCCCAGCACTTTGGGAGGTCNAGGTGGGTGGATCACTTGAGGCTAGGATTTT 

GAGACCAGCCTGACCAACATGGAGAAACCCCATCTNTACTGAAAATACAAAATTCCCGGG 

TGTGGTGGCNGCATGCCTGTAATCTCAGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTT 

GAATCTGGNAGGCGGNCGTTTGNNGGTGAGCCAAAAATCGTGCCATTGCACTCCAGCCTG 

GGCAACAAGAGCGAACTTTCGTTTCAAAAAA 

Sequence 732 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTAGTTATTT 

TAAATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGT 

GGAGCGGGAAGTGCAATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAAT 

GACTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGC 

TCATGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGA 

GTTGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAA 

AATACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAG 

AATCACCCTTGAACCCAGGAGGCAGAGGTTTGCAGTGACCCGATATTATGTCACTGCAAG 

TNCAGCCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAAAAAGAAAA 

Sequence 733 

CGACCACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTGC 

TGGT TCM GCCATTAMTTACATCACMAGGTTTGGTTTCTCTGTATATATTTCTCTGGG 

GCACTTTTGCTANGTTGGCTCTATCCTGAGGCAGNCTCTCTCCTCGTGGNAACCAGGTGG 

CTCTAGCAGCCTCAGCTTTATATCTCTCAAGAGTAAGTCCACCGTCACAGAGC 

Sequence 734 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACCACCACA 

CCTGGCTAATTAATTTAAAAAAAA I I I I I I I I AGAGATGGGATATCTNTATGTTGCCCAG 

ACTGGTCTCAAACTCCTGGCCTCCAGCAATCCTCCCACCTCACCTCCCAAAGCCCTGGGA 

CTGCAAGCATGAGCCACCATGCCCAGCTATATTTTCTGTAAATTGCTAATGANAATGAAA 

CATGTATGCTGTGGACAGMGCCTTGTTGGACCTAGAGCCCATGCTGGGTCCTTTGCCTT 

MTMMCATMCTCTGGCATTACATATATMTTMCAGCCTCMAGANCATGTTTCTTT 

ATTAMCTCTGACTGTTTCAGCATTATTTT 

Sequence 735 

GCGAATTGGAGCTCCCCGCGGGGGCGGCCGCCCGGGCAGGTACTACTGTGTCCTTTAGAT 

CACTCTGCCTTGATCACTCTGTCCCGTCACTCTGCTATTTCACCTGNCAGNGAAATACCT 

GGTATCGTCCTGCCAACGTGAAGCATTGAATGCTTNATACGTCTCCATCCTGATTGTTTA 

GGCTTTGAATGCTGAGAAGTATCTGCACTTTGTTGGTCA 

Sequence 736 

CCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAAGTACCCTTAAGCCCTGCTCCT 

TTGTAMGTCTTTTTGGATTGTCATCATCAAGAGTCAGTNGATCTCCANCTTCTCAGAAC 

TCACAGGGCACTCTGTCTAGGCATTGCTGACCGTCTGCAGTGTCAGATGGTGACTTCTGT 

ATGTGTTGTGTTTCCCGTTAGACTCTAAGGTTTTTAAAGGCGAGACTCACTCCTGCAGAA 

GCACATAACACMTGCCAMCTCTTATTTACGGAGGTCCTGGCGCATTGTCAGCTTTTGG 

TAAATGCTTTTCTTTTGTTGAATACTTATCTTCTGTGTGCCMGATTTGTGTTAAGTGCT 

AGAAAAATGTGGGAGGTCACCGCAGACCCTGTTCTCATGGAACTGATGGTGTGTAGTGGG 
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GTGNGGATTAACATAAATAAAATGATGCGCAAATGAACACAAAATTCAAATTGATGATGT 

GTACCTGCCC 

Sequence 737 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTGAGCAGTCGAAG 

TGGATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAANGATCCCAATGACGATGATT 

GGAAAAAACTTCA ATGG CAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGT 

GCAACAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCT 

TCMCAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGC 

TCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAA 

GAGATAGTAGGCCACAGTGTTACTGGCTTCCCATAAACACAGCCCTTTCCTGGCTCACAC 

GGGGCATGACCTCCCGCGTACCT 

Sequence 738 

AGCTCCCCGCGGTGGCGGCCGAGGTACATGTAGTTGGATGTCGAGGTTNGATTAGATTCT 
GGGGTTGGTTTGCTTGTTTTGGTGGATNGTTT^GAGTCGACTTTACAGAGGGTTGTTTA 
TCCACCAG AAGGCACATGTGCTTGCCTGTGTC I I I I I I GTTATTGTTTTGAGGCAGAGCC 
TCNCTCTGTCTTCCAGGCTGGAATGTAGTGGCACAATCTTGGCTCACTGCAACCTCCACC 
TCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTNCCAAGTAGCTGGGATTACAGGTGTGT 
GCCACCATGCCCAGCTMTTTTTGTATTTTCAGTANANATNGGG MIN I GCC 
Sequence 739 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACACTTCCACGA 

GAAGAATTAATATTGTAGTGTTAGGAMACTAGCMTTTAACTAAACAGCATCAAGTTAC 

MACCAGGAMGTGATTTAAAACTAAATGCTGGCTTATCTTTCTGAAACAAAGCATCTAA 

ATTTGACAGTCCAAAATGGCACTTATTGAGTGTCCGTGACAATACATGCTGACAAGCAGC 

ACACCTCTTTTTTGTTTTTTTAAGACGGGATCTTGTGCTGTCACCCANGCTG 

Sequence 740 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATATAAAAATTGATTTAGCTTCTACACTCA 

AGTMTTATAAACAGGTTTNTCTTTTGGGACATTTGACAGTTATGTGAAAGGTGAGTCTT 

CGTTGTGTAGTATTGTCTGTTACACTGCAGGTGTCTAGAATTGCTGATAGTGTTCTCCCT 

CTAAAGTAATGTCACCCAACCACTTGTAAATTGACGATAATAAGACAGGAAATCAAGAAC 

CAATATAAATAAGCAMCATTTGAAAATAAGAGCTAAAAATCAAAAATAATCTCTCTTTT 

TGCTG ATAAT ACTTTATACCTAAATAACCTAAG A I I I I I I I I I I I I I I I I I GAG AC AG AG 

TCTTGGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAATCCCGGCTCACTGGAACCTCC 

GCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGAAGCTGGGATTACAGGCA 

TGCCCACCGTGCCTGGCTATT 

Sequence 741 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACAGTATAAATCATGCT 

CCTGCTGTCTAGAGCTTACCACCCAACGAGGGCTTCAGATAAGATCAGCAACTGCCCTAG 

AGTGTGGAACTCCTATGACAAGGTGAGCCTGGGGTGTGATGGAAACACGGCGTGCATGGT 

TACCAAGCCACGCTTCCAGGGAAAGGGGTCCGTGCGGGAAAAACTTCAGAGAGGAAATGA 

CATGTCAGTCAATAACCTGAAAGAACTGGNTGAGAGTTAAGCANCAGGGAACAAGGGCAC 

AGTNNTCCACACAGCTTTTTGGAAAGATCATGTTGNTTATAGTGCAAAAAAATACTGAAT 

ATGGGMACMTTTGTTATTATT7TTTAGGAGTNTTGCTTTGTCCCCCAGGCTGGAGTGC 

A 

Sequence 742 

ACTACTATTGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGGGGGAAAA 

AGAGTTACAATTTGCCCATAAAGAATGGAGAGAACAGAAATGTANCTTTTATGCTGAAAA 

ACAAAATGCAAGGGCAATCCAGTTTCTAATTCCTGTGCCAAAGCTGCTGTTCTTGATGAC 

CTCGGTCAAATCATTTAMTTCTCTCAATTTGTTCATATAAAAGTGCTATTAACCTGCAG 

TTCCTTCAAATACTATCCAATCAATGTTGGCTACTTGATTTTCA 

Sequence 743 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCACAGGGTATGCATAAA 
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CCAMTATACAAGMTTTMTGACAGCATTATTTGCAAACAGTGAAAGATTCTGAGCAAT 

CAAAAGGTTCATTACTACAGGCATAGGTCTATAAATTACTACTGTATGGAATACTATGCA 

GCCATTAAAATAATGAGGAAGAGGGAGAGTGCCCTGTATGCACTGACATGGAAAGGTTTC 

AGTATATGGTAAMAGCAGCTCATCTTAGMCAGATTTTATAGTATGACCCCACTTGTGT 

GGAMTATTTTGTGATGTGCCACTCAGTGTATACGTATTCATGAGTGCATATACAAGTGT 

GTGAGAAGCAATGTAAGTAACTGTTTCACAAGGACCCCCCCTTTAAGAAGGCAGAGGGGA 

TCGGGGGATATAGAGTGMGGGATGATTT1TGCTTTTTCCTCTA 

Sequence 744 

CCGCGGTGGCGGCCGAGGTACGCGGGAGGAAAGGGCTGTGTTTATGGGAAGCCAGTAACA 

CTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAG 

GTAGAGGTGGAGGCGGAGCCGGATGTNAGAGGTCCTGAAATAGTCACCATGGGGGAAAAT 

GATCCGCCTGCTGTTGMGCCCCCTTCTCATTCCCGATCGCTTTTTGGCCTTGATGATTT 

GAAA 

Sequence 745 

CCGGGCAGGTACACAGTAAGTGAAGGGCCAAGACTGACGGCTGATAGGACAGGGGTGACC 

AGNGGTGGGGAGGGTAGTGGGAGCAGTCCATCCTGGAATCTGGCATTCAAGGGGCGCATT 

GTCTGTGGGAGGATTTAAMATMTAAAACCAACTAAAGGCAGTCTGCTTTTTATGGTCA 

CCAGGCCGCCAGCAATTCTAAATTTCAGTGATAAAATATTCCTCCTCACTGGACACGAGA 

AGCTGGCTTTCTCCTTATTCCCCAGTACCTTNGGCCCGCTTCTAGAAACTAGGTGGGATC 

CCCCCCGGGGCTGCAGGGAATTTCCGATATTCAAAGCCTTATCCGAATACCCGTCGACCC 

TTTNGANGGGGGGG 

Sequence 746 

CGGTAATACTNGTTATCCACAGCAATCAGGGGGGATTAATCGCAGCNAAAGAACATTGTT 

NAGCAAAAAGGGCCAGTCAACAAGGGCCAGGAACTCGTAAATAAAGGCCCGCGGTTGCTN 

GGNCGTTTTNTCCCATAAGGGCTTCCGCCCCCCCTTGGACGAGGCNTCACCAAAAAATTC 

GACCGCTCAAAGTCANGAAGGTGGCGGAAACNCCGACAGGGACCTATNAAAGGATACCAA 

GGCCGTTTTCCCCCTTGGGAAGGCTCCCCTNNTTGCCGCTCTCCTGTTCCNGACCCTTGC 

TCGCTTACCGGATACCTGTCCCGCCTTTTCTCCCTTTCGG 

Sequence 747 

CCGCGGNGGCGGCCGAGGTACATCTTTGGTGACTTTTCATTCACATTTCATGGATMTTT 

GGGGAGGTGGCCTGCCANCCCTGAAGCCCTACATCCCCATACACACTCTGTGCACATCCA 

GTGCCCTGCTCCACCATGGCAGTGCCCGCAAGGGGGTCCCAGATGAGAAGAAGCTGGCTA 

AAGGGCCCTTGTCCCCTCTCAGACTCCTTCAGCGGGCTGGAGTCCTCCCTCGCTCGATTT 

CGCCCGAGAGCGTTAGGGGTTTCTAAATGCAGGCGCCTTTGTGTTGTAACGMACTTTTA 

GTTTAAGGGAAAATCTCTTTTAAGCCACTGATTGTTCTGACTTGCTGAGTTTACTCAGCA 

GCCTTATGCTGGCTCTGCCACTGCACAATAAAACCAAAGCANGACAGTTGCAGNTNAAGC 

AAGGGGGAACATGTTTTGCATTT 

Sequence 748 

GCCCGGGCATGGTACCTGTGTGGAAAAGAATGCTTGCAAAGCTTGTCACCCTCACGAGAA 

TTCCTGTGACAGACATTTGCCTTTGACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGA 

AACCGAAGAGCATCAGAGGGGACGACTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGA 

GTTGGTTACCAGAGAAACTGGCGATGGCGATCCCGTGAGCAACATCTCTCAGACCCATTT 

TAAAATGCCGGGGGATACTTAATCATGCTGAAAAACAGCAGAGCCCTTGAGGI I 1 1 IGGA 

CTACATGTTGCAGAAAGAAAGAAGNAATTTNTACCTTNNCCNAAAAATAAAAATATNNNA 

NNNNGGTACCTCGGGCCCGGTTTTNAACTAGTGGGATTNCCCCCGGGCTTGAAGGAATTC 

GNTNTTCAAAGCCTTNTTCGATCCCCGTCCNANCCTCNANGGGGGGGGGC 

Sequence 749 

AGGTAC I M I M I I I 1 I I I I M J I { I 1 GGNCTAACTGNNNGGAGTATTTCTtTTACCCAA 
GATAAGTAAAAGCTACAACTCTTAGTATAAATATGNGTCCAAGTGCCTNATAACTGCTAA 
CCACAGGGATCCTGAGCTCTNATAGCTTAAACACACAGNGTNNATTTTACTGGTCTACTT 
CTCCTGNAGACCTAAAAGGGCCTATAGCCTCAGTAGTTGACAAAACAACATATTAAAATT 
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CCTCACTGATCACTAACATAACCTAAAATCCCTGCTTTTGACATTAGCATGGNANACATC 

CTTAGCAGGCCTAAATAGAATGGCCTTATAAGTGGATCCAAAGGGC 

Sequence 750 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCA 

GGCACCACCAATTA 

Sequence 751 

GTGTCCGGMTCCTACCCGGTGTGNNGACAGTGCCTGATAGTTTCTTCTGCCTTTCTATC 
CCAAAACGATTGGTCAGTTTACCCMGTTTGCAATGCAGTTTANAATCTCCCAGGAACAT 
CTCTTCTAGTAGTTGCCTTAGCCATCTTGTAGTTGATTTGACl I I I I 1 1 1 I I I I IGCTNN 
CAGAAAGCTCTATGCTTCATATGGACTTGCATACCAA I till I GTTCATCTGTTGGTCAT 
GATGGTTAGCAGAGCCTGACCTCCTGTTACAATAGAATGATCGGTTCCTGGGCTACAGAC 
TTGAGTCTG I II I I I I IGI I M I I AAACCTTCCCATGNGGCAATTTGCCATATGCAAAAC 
T 

Sequence 752 

CCGGGCAGGTACGCGGGTGAAAATGGAATAGTTTTCTAATTACAGAAAGAAAAGAAGTTG 

AAGTGGGTTTCGCCATGTTGAGCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCT 

CGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACGCCTGGCCAAAAAT 

CTTATAAATAATCCCCTTCTAATTTCGGCCAGCTTAATCACACACCAAATTCCTTTCATG 

AGATTMTCTTCCACMCTTCTACACTTCCTTAAATCTTTGATTTTGTCCTATACTTCTT 

TTTTTATATTAGCMTCTACTTTAGGACAGAMTTTACTTTCCT^ 

CCAAAGTCCTCTCTTAT 

Sequence 753 

TAGTTTTCTMTTACAGAAAGAAAAGMGTTGAAGTGGGTTTCGCCATGTTGAGCAGGCT 

GGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGGGATT 

ACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATTTCGG 

CCAGCTTAATCACACACCAMTTCCTTTCATGAGATTMTCTTCCACAACTTCTACACTT 

CCTTAMTCTTTG ATTTTGTCCTATACTTC I I I I I I I ATATTAGCAATCTACTTTAGGAC 

AGAAATTTACTTTCCTTTCCTCTTGATTTTGACCAAAG 

Sequence 754 

CCGGGCAGGTACCTATATGATGTTGGCCATGCTCACTCACTCCTCCAACCCTCAGTTTAC 

ACATCTGCMAATGAGATACTTCTTTTCCAGTGTTGCTGTGGACATTAGCAGGCACACAC 

ATTTGGTGCTTGCACAAATGAGGTCCTAAGAGGTGGGTCCCTCTCATCTTACGTGAGGAA 

ACTGAAGCAGATTAGAAATGACCCAAGGAAACCACTCCGAGTTCAGTCTGGAGCCCACTC 

CCCTAGGTTTTAATCATCCCCCAACTCAGTCCCTATCTGCTGAGGTTCCTGGATCCAGAC 

GGTTCTTACCAAGGAACTGTCTGTCCTCACCACATGGATGGTTTTCCTGGCAGAGGTGTG 

CCCTGTGAGGGGGTCA 

Sequence 755 

GCCGAGGTACANACAAGGGGGCNACTGNCATGGGGGNGGNNTCTGGTCTTGTAGTCNGTT 

TGGAATTTTCTAAGTCAGGGTGGGGTGGGGGGACTGTGCACGGGTCATGTGCAGACTGGA 

ACCCATCTCCCCCTCGGTCTGCAAGTTAAAACAATTGGGTTGTCCTTCTCAGCATCTGCC 

AATGTCTCTTANTCAATCTTGGATCAAAAGGGCGTTGGAGGAGGAGGCTGGGAGGGAAAT 

CCAGACAGTTCTCCGCCTCTGACATCAGGTCCAGCTGTTAGCATCGTGCTGTGGGTCCCT 

GAACAAGAAGCAAAGTCAGGACT 

Sequence 756 

AGGTACCGCTGTGTCCGGGTGGGTGGNGNGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 
TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 
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CAACTGGGTTTCCCAAGCTATGTAAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 
CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 
TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 
TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGT 
Sequence 757 

AGGTACCTCTGGATATGTTACACTGAGANAGATACTTCATCACTTACATGATATTGCTCC 

CCCACAMTTTCATAACCTGAATCTAGTTATAAGGAAATACTATGCAACCCAAATTGAGG 

G ACATTCTGCAAAACAACTACCTGTAATC I I I I I I 1 I I I I I I II 1 I GAGACGGAGTCTCA 

CTCTGTCGTCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACTTCTGCCCCCG 

GGGTGCGAGCGAT TCTTCT GCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACACGCCA 

CCACGCCCAGCTMTTTTTGTTGGGGTTTCACCATGTTGGCCAGGATGGTCTCCATCTCT 

TGACC 

Sequence 758 

CCGGGCAGGTACTATGGTCCCCGGCAACCTCCCCTTCCTCCTGGGAATGCTCAAATGGGA 
AGGCA GCATG AAACGGTGAACAGGCAATCACTGGACAAAGTCACAAGAACTGGGCTTTAG 
AAATGGTTTTACCATTAGCAGTTGTGACACCTCAGAAGTGGCAACTCTGGATCTNAATAC 
CCTACCCTTNACCCTAAGNANAGGTACCTCCNCNATTTTNNCGGGGGGAAACNTTCTNGG 
GAANTTNCCCTTTCCNAAAAAGGGGGGGGGGGGCTC 1 11 11 1 1 1 111 11 I GGGGGGGGGG 
CCCCN NCNN CCCCCCCCN II I I I I I I II NTNAAAGGGGGNTTNAANANANATTTCNTTNC 
TCNTTNTTTTNTNANGNNAAAAANTCNGGTNGNGGNG I I II I II I I I I AN AAAAAAAAA A 

Sequence 759 

GGCGGCCGAGGTACMAGMAGGACTTGATAGCTATTACCTTGCTGCTATGTTTGTTNCT 

TNGNCTCACCAATCATTNTTNTGTATACCTAGCACTGCACCAGGCGCTGAGGTTAGAGAA 

ATAACTAAAACTGCGCCCTTCACCCCTGATGGCAGGATAGGCAAGGTTGGCACCATCGTC 

ACAGCAGGACCCTCATCGATGCCTTGGTGTGTGCCTGGCATGGNGTTTGCAGCAGTTTAT 

CACATNNAATCCTTACAGC 

Sequence 760 

AGGTACTCAGGCCTTACTGGGATTTCCTTTAAGACCTCTGGGAGGAAGTGTCAGTAGCTG 

GGCAGGCCTTCTTGGCAAGCATTCCTCCCTGGGTTGTGGCGGGGGCTCCCGGCCTGCTGT 

GTGGCAGCTGCAGGCTCCTGGGGACCTGAAGGAAAAGCTTAACCGTTCTCCCTTCCCTTG 

CTTGGCACTTAGAGCACTAGTTCCATTCCAGACATACCGATTATCTTGCCTACGTGGCAT 

AGAGGCCTAGGAGCCTCCCTGGGAGGAAGAGGCAGGCCAAGGTCTTGCCTGGCTGCTTTT 

AGGGGGAAAGA TGTAG GGAGGAAGCTGCCTTATGCTTGGATCTGCAACGTTTGCCTGGAC 

CTGC GGAG CCTATTTTGGCCAGGGGGAGGGAGACAGAAATTANACCCNANGTATTNAGGT 

AATCCTTTTNTTGCCTTTGAACATTGCNCGGGNGTACTTTGNAAAANAAAAAAAAAAA 

Sequence 761 

CTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 

GGMTAGTTTTCTAATTACAGAAAGAAAAGAAGTTGGAGNGGGTTTCGCCATGTTGAGCA 

GGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGG 

GATTACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATT 

TCGGCCAGCTTMTCACACACCAAATTCCTTTCATGAGATTAATCTTCCACAACTTCTAC 

ACTTCCTTAAATCTTTGATTTTGTCCTATACTTC I Mil l I ATATTAGCAATCTACTTTA 

GGACAGAAATT TACTTTC CTTTCCTCTTGATTrTGACCAMGTCCTCTCTTATGCAAAAT 

GAAMATTACTCTTTTTTCMCTTTCTTTACCMAMTACATCCTC 

ATCTCTCCTACTTACTGG 

Sequence 762 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 
GGMTAGTTTTCTAATTACAGAAAGAAAAGAAGGGGGNGTGGGNTTNTGCCATGTTGAGC 
ANGCTGGNCTCGAACTCCTGACCTCAGGNGATCAGCTCGCCTNAGCCTCCNAAAGTGCTG 
GGATTACAGGCATGAGCCACCACGCCTGGCCAAAAATNTTATNAATAATCCCCTTCTAAT 
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TNCGGNCANCTTAATCACACACCAAATTCCTTTCATGAGATTAAACTNNCACAANTNCTA 
CACTTNCTTAAATNTTTGATNNTGNCCTATACTTNI Milt I ATATTNGCAATCTACTTT 
AGGACAGAMTTTACTTTCCTTTCCTCTTGNTTTTGACCMNGTNCTNTCTTNTGCAAAA 
TGNANAATNNCTNTTTTTTCAACTTTCTTTACCAAAA 
Sequence 763 

TTAGGGCGATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATGTAATGCTCCTGAACTGT 

ATGCTNGACACGGCTGTCTACNTAGGTTTTGTTCTGTGTATTTTATGACTAI I I I I I I AA 

AAAGTAAACAAAAAAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAA 

AGCAGGTGAGTGGTTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCAGAATGGCC 

ATGGGCTTTGCCTTCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGT 

TCAACACTGCCAGTGTAGTTAATGTCACTGAATTATACACTTTAAATGGCTAACATGACC 

MTTTTATGTTATATATATTTTACTACCACAAAAAAACTAGCTGGCACCTAAAAACATTC 

CATT 

Sequence 764 

CCGCGGTGGCGGCCGAGGTACTTGGATGGGTTTTGTGTGTATGTTTGTGTGTGCACTNGC 
GTCCACCCTGTTGGGCTTAGTGMCTTTTTGATTCAGTGATTTAMGTTTCTCATCAGAT 
TTGGAAAATTCTCAATTAC I I I I 1 I CTTTAAATATTTCTCTTGCCCTTCCCCTCTCTCTT 
CTTCCAGGATTCCAATTTCATCGATGTT 
Sequence 765 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTTCAAGCG 

ATTCTCATGCCTCAGCCTCCCAAGTAGCTGGGACTAAAGGTATCCACCACCACGCCTGGC 

TMTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGATCAGGCTGGTCTCGAAC 

TCCGGCCTCAAGTGATCTGCTCGCCTTGGCCTCCCAAAAGTGCTGGGATTACAGGCATGA 

GCAGCTGTGCCCAGCTGGATAATTATTTAATAAATTGGGGAGCATAGGAAGCATAGTATT 

TGTGAAGTGGGTAGGCAGGTGTGATGGGGGTAGGTGATGTTACATTTGGGGCATTTTGAA 

GTTGGTGGTTCTTCTGAGTTGAGCAGTCAGTCACTCTTCATTTGCTGCACCTTTATCTCA 

TTTTAGCCAACAGACATTGAATACCTACCAAGTCTTAGGTATTTGCA 

Sequence 766 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAAA 

GAAAACAAATACCAAGTATTTACAGATCCAGAGAAAGTTCACAAGAATGGGAGGATGCCA 

GTTCCAATGCTTTGTAAAGTCAAAAATAGCCACATTGCAAAACAAACAAAAAAAAACGAG 

AACGTTCCCGAGTGTGCCTCCAAAACATAAAGGAGAAAATCATACAGAAAAACCTCATGT 

AAGGGTTGGAACTTGAGCAACCAGCTATCCAAATACAGAGGGGAATCCTCGCTTAGCTAG 

GGCATGGCCTGAG 

Sequence 767 

AGGTACACACAGTGATTTGGGGTCC I I I I I CCTAAAACAGCTTCTTTATCAGGACTTTGG 

MTTCTGGGTGAGATAGAAACACTGAAAACAGGGCGGAAGTTTTTTCTTCTGGCTTCTTA 

GTCCATGGAGGGCTCAGCGTGGAGAGGATATGCCGTGGCATTCTCCCTGGGAGACCACAC 

ATGTTCCCGACAGCTCAGACCCCAGACCCGCACATGCTTCTTGACAGTTNAAACCCCAAA 

CCGNAGGNGCTCCCGACAGNTNAAACCCCANACCCCGCGTACCTGCCCG 

Sequence 768 

CCGCGGTGGCGGCCGAGGTACTTAATAATTCATAATTTAGCCATGATAGTATCTAAGCTC 

ACTTTCAGMTTATTGCATACATGCCTTAGGGAAGAACCTATCCACTMTGCTTTTAATA 

ACTTACATAGATTGTGTTGCGGCAAGTCAAGTTTTAATATAGAGGAAAGGGTTTATCTTA 

TCATAGTAAMTAGTAGTGATGTGTTCATMTTTACTATTTGCATGGTATATTATCAAGG 

CTGTMAAGCTTGAATTTGCCTTTTCCACATCTTCATTTCAAATTMTTTTTGTGAGGA 

CCAGAGAAGTGGGTAGAACCCAAATGCCCATGNGGGT 

Sequence 769 

GGCANGGTACAAATTCAGGGGAGGATGGAGCAGCTGCAAGCCTGGCTGGCATCCCATGCC 

AACAAGGTGACCCAGTAGATAAGTGACAAGGTGACTGAGCTGCCTGGTGCTCTTGCATAG 

AATTTTCAAGTGTTAGAAAAATGTCTCCATGCCTTGCAATTTGTCTCTTGTGGCCAAGCC 
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TAACTCAGATGGAAATGCA6AAATCACCCGTCTTCTGCGTCGCTCACGCTGGGAGCTGTA 
GACCGGAGCTTGTTCTMTTNGGCNATTTGGGTTCNTCCCCCCCGGGNNCNTN 
Sequence 770 

TACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA C I I 1 1 I I I 1 I 

II I I I I I I I I I IGAGGNGGAGTCTCACTCTGTTGACCAGGCTGGAGTGCAGTGGCACAAT 

CTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAACGGGTCTCCTGCCTCAGCCTCCCA 

AGTAGCTG GGAC TACAGGCGCATGGTGCCACTCCCGGCTAGTATTTTGTATTTTAGTAGA 

GACGGGGGTTTTACTGTGTTGTCCAGGCTGGACTCGAACTCCTGAGCTNAGGCAATCCAC 

CAG CCTCAGCC TCC CAAAG TGCTGGCATTACAGGCATGAGCCACCGTGCCTGGCCTCTTT 

CATAI 1 1 I I I I I ACACTTTTCATTTCTTCTTATTTTAAGTGNGCTGGATAGGGGCTCCAG 

AACAGMTTCAATAGAAAGTTGTGACAGTAGGMCCCTTATCTTGGTCCCTGATTTTAAA 

GGAGGGTTNAAAAAAAACCCCCCCCAA 

Sequence 771 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTAC I N M 1 1 I M 

I I I I I 1 1 I I IGGAGAGAGGGGGTTTCACTATGTTGCCTAGGCTGGTCTTGAACTCCTGGC 

CTCAAGAGATCCTCTTGCCTCATCCTTCCAAAGTGCTGGGATTACAAGCGGGAGCCACTG 

TGCCTGGCCTAGMGATCTGTTTTCTTTTCTCTGAATAATTCTTGTGACACTGTCTCTCC 

CTCCATCTCTTTCTGTTTCTTTGTCA I I I I I CCCAGCTATCC I MM I CCTTGNCTTGTC 

CTCTTCTCCCCTCCATCCTAAAACCTTTGATCACAAGCTAGTTTCCTTTCCACATCATCT 

GCTCCCCTCTACTAAACGCTATTTCGCCCCCACCTGCTTTCAAGCTGNGCTTGCCTCTGA 

GCCCCTCTTTCACCACGGCCCAAA 

Sequence 772 

NCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCATGCTAGACAA 

CCTTATGACTTGAAAACAAAATAATATTTGAAATGGAAATGGCCTCAGTTCCACCCCTGG 

TGCCCACATAGCATAGTGAACCTGCCCCTGCAGCATTGCCCATGAGTGCTAAGATCCTGT 

GCCCATTTGCATGTCTTCCTTAAACAAAAGACCGCCTTAGTAAGAAATTAGTAAACCAGG 

GAG ATAA TCAACTTATCCCCCAMAGATTTMGCCTCTCATTTTGTTTAACCTTCATTGG 

GGATTTTAAATAGAAAACTAGGGCCCGGCAGGGTGGCACATGCCTGTAATTCCAGTACCT 

GCCCGGCCGCTCTAGAACTAG 

Sequence 773 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGG 

ACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTTCGCT 

TCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCT 

CCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCT 

CATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGT 

CATCAGTTTATCAACCAACAAGCAG 

Sequence 774 

AGGTACGCGGGGAGTGAACGCTCTCGGAGAACCCTTTCCCACGAACGTCCACTTCAAAAG 

AACGCGACGGAGCATTAACTCTGCCACTGACCCCTGGCCTGCCTTCGCCTCCTCCTCTTC 

CTCCTCTACCTCCTCCCAGGCGCATTACCGCCTCTCTGCCTTCGGCCAGCAGTTTCTATT 

TAATCTCACCGCCAATGCCGGATTTATCGCTCCACTGTTCACTGTCACCCTCCTCGGGAC 

GCCCGGGGTGMTCANACCAAGTTTTATTNCGAAGAGGAAGCGGAACTCAAGCACTGTTT 

CTACAAAGGCTATGTCAATACCAACTCCGAGCACACGGCCGCTCTAGAACTAGTG 

Sequence 775 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTACTGGCATAAGAGTAAATTGGTGAGAACT 

TTCTGGAGGGGTAGTTTGGCAATGTGTTTCCAAAAAATCTAAAAATTATAT7TGCCTCTA 

ATCCAGCAATTATACCTCTAGAAATTAATACTAAGGAAAATCTTAAGAATATACCGTAAA 

ACTTTAGTTGTMGAAATTTTTTTGTGGCCAGGCATGGTGGCTCACACCTGTAATCCCAG 

ACTTTGGGAGACCAAGGTGGGCGGATCTCCTGACCTCATGATCCACCCGCCTCGACCTCC 

CAAAGTGCTGGGATTACAGGCGTGAGCAAATTTTAAATAAGAAGAAACAGTCAACAGCAT 

CAGACATAGTAGGTATGTCCAACACCATAATGGCTGAAAAGTGCCCCCTAGTCTGGCAAT 
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TAGTAGGTCATTGGTTTATTAATAACCGGCATGTTAAAGTTG 
Sequence 776 

CCGCGGTGGCGGCCGGCAGGTACAMTCATACCTCCCMGGTATTGCTCCATTGTGTTTT 

TGTGCATTTGGTTTGGATTTTTATGGGGAATTGAAGACAAGTGGATCATAAAGTGCAAAA 

TAAAATGCTCTAGAAATGACAGATGGGGCACMTTTCCAAGAAAATTCATCTAGACAGTG 

GCAACACTGAGAAAAAAAAGAAACATTCAAGAAG 

Sequence 777 

GAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTAC i 1 1 I 1 1 I H 1 1 I I 1 I II I 1 

TGCTAGAGATGGGGTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGGCCACAAGTGA 

TCCACCTGTCTCAGCCCCCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACTCCTGGC 

CCATGTTTAGGATTTATACCMTATTATTMCTTAGAAATAAGTTTCTMTAMTTATTC 

C ACCCG AACTTAGGGTMCTGAATTTTAATGCTGATGTATTAAGCAGGTTCTTCCTGGGG 

TCTTTTGATTCTCMGGGATCCTTCACTGNGGGTGGACTTCAAATTAATAGGAAGCAGGA 

AGGAGCCACCTGCACTGTTTTCTTGACTGGGGATGACACCNAAACCTT 

Sequence 778 

CCGCGGTGGCGGCCGAGGTACTATGAGAATTTCAAACAAAGAATGAAGCCATAAAACAAA 

AAGACTGMTATTTGGCTCTGCCTGGCTCCCAGGCTTTCTACTATTCTTGTGACTTGGCC 

TCMCAAMTCTAMGTGACTTGTTATTTGTGGGTCAGCTTTGTCCCATCCTTACCAGTC 

ATGGCTTTAGACAAAAGACTCAGCACCACTCACCCTNTGGGACAGTNTGACTGNGGTCTG 

AGNCCCCTGCTTANATATTAGGCTTAAGCTCAGTT 

Sequence 779 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGAGAATTTCA 

AACAAAGAATGAAGCCATAAAACAAAAAGACTGAATATTTGGCTCTGCCTGGCTCCCAGG 

CTTTCTACTATTCTTGTGACTTGGCCTCMCAAMTCTAMGTGACTTGTTATTTGTGGG 

TCAGCTTTGTCCCATCCTTACCAGTCATGGCTTTAGACAAAAGACTCAGCACCACTCACC 

CTCTGGGACAGTCTGACTGTGGTCTGAGGCCCCTTGCTTAGATATTAGGCTTCAGCTCAG 

TTCC 

Sequence 780 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACAGTC 

AGCATGCGCTGTAGCAATGTGCTTTGCAiSCTGGAACTCCTATCAAGCATCCTAGGCAAGG 

CATGCACCCCAGCGCCAGAGAGAATCAGGAAGGGGAAGGTGCCCTGAACCTCAGACAAGA 

ACCCCTTCCAGAAACCACCACCAAAGCCATCACTGTGTTTCCACCCTCAGACCTGTGTCT 

CTTTAGCTTCTTGGTAGAAGGAAAGAAGAGGAGCTTGGGTGGGGCAG 

Sequence 781 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCATGGACAT 

GCACCCCGGCCACAGAGAGGCGCACAACTTNTACAGAACACCCTTCCACCTGGTCTTCCA 

CAGCTGCATCAGATTCCTGGACTGTCACAGACATGACTTCAAACTTGAAAGTTGCAAGAT 

CTCCTGGAACMTTTCCACAATGCATACAACTTCATTCTTAGCCTCAAGCACTGAATTAG 

ACTCCATGTCTACTCCCCATGGCCGTATAACTGTCATTGGAACCAGCCTGGNCACTCCAT 

CCTNTGGANNNTTAMCNTTNAAMANNNAMNCNNNCCCCTNGGCNTTTTTAAAAANNN 

GGNNNCCCCCGGGNNNGGGAAA I I I I 1 1 M I A AAN A I MINI I CCCCCCCCCCCCNGGG 

GGGGGGGGCCCCCCCCCCCCCCI I I I I I I 

Sequence 782 

NAATTGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACTTCCCTGAGCAGTCGAAGTGG 

ATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGA 

AAAAACTTCAATGGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCA 

ACAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCA 

ACAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGAC 

Sequence 783 

CNATTGAGCTCCCCGCGGTGGCGGCCGAGGNCTGATGTCCTACAGTCCTCTACCTGATCT 
ACGTTCACTGGAAAGTGTNGAGTCTCAGCAGGAAGCACCTTGCTCTCGTGTCCGGCTAAT 
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TCGAGTGCTTTACGTMGTAGAGGMTTGCTGACTTTTGGGACATTTCTGGTCTTGCCAA 
AGTTCACCTTGTAGTAAAGCCCCCAAAGATACTTCCCAAATAGATGCTCTCTTGAAAATA 
ACTCAG 
Sequence 784 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCGGCCTATTTTA 
TATTCTGTTTAGTGCTTTCMCCCCGAAATCCACCTTCTAACATTAATATTGATATCCAT 
CCTC TCTCTCCTCCMCCTCTCTCTCTCTTGCATTTACTTTTAGATTTTCATTACTTTCT 
TTTTATTCTGATTCTTGTAMTAGTATAAACTAGATT 

MTTTATGATTGACACATAATMTTGTATATATTTATGGGGCACAACGTGATGTTTCGGT 

GCATGTATACATAGTATAATGATCAAATTAGGGTAGTTACCATATCCATTACTTTAAACA 

TTTATCATTTCTTTGTGGTAACAACATTAAAAATCTCATCTAGAATGGCGTGAACCCTGG 

AGGCAGAGCTTGCAGTGAGACGAGATGGGCGCCACTGCCTCCAGACTGGGCGGACGAGCG 

AGACCTCCCNTCTCAAAAAAAAAAAAAAGG 

Sequence 785 

GCTCCCCGCGGTGGCGGCCGGCAGGTACGAAATGAGAGAAATGGTTTAGTAAACGTATAA 

GACATCAACATAGNAAAGTATTCTATAGGNNTATGTGTTGGAATTACAAAGATGAAGAAA 

AGATACAGGCAAGTATTTGATATACTNAATTAAAAATAGCAAGATGTAGAGTAGNCATGT 

ATACAGTGATAGCAAGAACATGGATCCTTAAGGACAAAACTGAAACATAATGCAAAAAAA 

GAAAAATATGCAAATTATTTTTCGTATGATGTAAGTTGTAAATAT 

Sequence 786 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNGTACNCGGGCCTATTTCTGAA 

TAACTCAGNGGCTTAAAATTATATCCCAAAGTAGNGGTATCACAGGGTTTCCTGATGAGG 

ATAAATGGGCCTGAAGTGCTTATGGGCACCCACTATGTATCATGGNAAAACTTGCACGTG 

TGTGTGTGTGTTGAGAGAGAGAGAAAAAANAATAGANAAAGTTGGTGAGAAAAGGGNAGG 

CTGTTTTTTGNNCCGAGGGNTGTNTGGTTGGGCTTT 

Sequence 787 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGNACACANTAAGTGAAGGGCC 

AAGACTGACGGCTGATAGGACAGGGGTGACCAGTGGTGGGGAGGGTAGTGGGAGCAGTCC 

ATCCTGGMTCTGGCATTCMGGGGCGCATTGTCTGTGGAGGATTTAAAAATAATAAAAC 

CAACTAAAGGCAGTCTGCTTTTTATGGTCACCAGGCGCCAGCAATTCTAAATTTCAGTGA 

TAAAATATTCCtCCTCACTGGACACGAGAAGCTGGCTTNTCCTATTCCCCAGTACCTGCC 

CG 

Sequence 788 

GGNGGCGGCCGCCCGGTTTTGGACGCGGGTNTNTGCCCTNACTTTTTTAGCGGAGCAGAG 

GAAACATTCATAAGGAAATATGCGAGTAGAGCTCAGGAGAAAAGCAGGACTAGAGGCCCA 

AGAATCACAGGCCAGAAGAAGAAGCTGTAGCCTCGGGAATGGAAGAGCTCTCTGAAGGGG 

AAAGGGGAGAACAGGAATGTNCCAGGAGCCAAGGCTCATCTATAAGGGACTTNCACATTT 

AGGATGTAGAAGAAGGAAGCAGAAGCAGGGGATGACCAGAAATGGCCCCAGAGATGAGAT 

GAAAGTTAGGAGAGCGGNGAGCMGCCTTTAGGTTTCACAAGGGAAGGAGGGAAAGTAGG 

TGTTAGGTGCTGCCAAGATCAGGGAAAATMGCAGAAGACCAGGCCATTTTNANTTGCNG 

TGG 

Sequence 789 

CCGC GGTGG CGGCCGAGGTACCACAATCAACTCAATCACAACATATTACAACAAAACCCT 

TCATCTTTTTCCTTMCCCACTGTAACACMAGCAGAGAATACAGATTAGCTTTTTTATT 

TGTCTGT1TGACTTCATCTCTTACATACCTCTATTTCAGTATCTATGATATTTTCCTCTT 

CTTATCTGTTCMTGACAGTCTTCCCTTTTMTATTCTAAACTTGTCAAGCACAGCANTT 

AAAAAAGTATTCTCATGTATATATTTTATCTTTAGAGCATCGCATAAAGNCTGATACATA 

GGMGTTTTAGATGCATATTTACATTGGGTAGATGAATCCAGGGGAAAAG 

Sequence 790 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCAAGAGAGACGTCTCTTACTGCCTCATT 
AAGCATTTGGAGCTGTTAAACACAAATCAAGGCAACCAGAAAGGGCATCTTGGCTTCAGG 
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CTGGGCATAACCATCCCATTTGCCACATAAAAGTCTAGTGGCTACTCTGCACCCTTTCTG 
GGTAGAAGCAGAGTTAGTTTGGTCATGGGGGCCCCTGTGGGACAGTGTTGCCCAGACAGG 
TACCTCGGCCGCTCTAGAACTAG 
Sequence 791 

AACCCACTATAGGGNTNATNGGAGCTCCCCGCGGTGGCGGCCGCCCTGGCAGGTACTGNC 

TGTCTCAAATTTTTGGATACGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGAT 

CCTAACAGGTGATGACTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCT 

CMTGTGCCTAGATTGAAATATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGG 

TGTGGTGTAAACACCTACATTTTAATGTAAACATTTCAGTGTAATCAATGAGAACTATCA 

TTCTGCTTTAATCACCATGAGTTCTGAMTMCAMGGATTTGTCTGACATTCATTCTAA 

GAAATTCATTCTTACCTGACTMGAMCTTTTTAACCTGGCACAATAATAAGAAATGACC 

TGGNAAGTACCTCGGCCGCTCTAAAACTAAGGTGGGATCCCCC 

Sequence 792 

NGGCGAATTGGAGCTNCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAAAATGTAA 

GTTGGTAAAAGACATTGGACTCCAGCTATGTTCTTTAGAAAGAAGGTATTGGACTCTGGC 

CATGTTCTTCAGAAAGAACATGCCTGGCTTTTTCACGATTTGATCAGTCTTCTTAGACCC 

TGAACCCCACCATGAAATGGCTTCCCCAGACACAACCCGAGAGAGTTATGCTTTGTTTCT 

CAGCTAAAATATTTTGCAGATCTTAATTTCCTGGGTCATTGCATC Ai I I I I I I I I I I I I I 

T 

Sequence 793 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGTATACCTTTT 
CATTTMATCATTGMCAGTTCACAATGGCTGTTGTAAAGTTTTTGCCTTGTTACTGAAA 
CC ATCAT CTCTTGGTCTATTTCTATGGAGTTA I I I I I I CCTAGTTATTGGCAGTTTGCTT 
ATCTTTTATGTGTCTMTAACTTTTAATTGAATGTAGAACATTATGGATCTTAT Ai MM 
GAGMTCTGAATGTTTAAMTTCCTTTGMGAGTTTGTGTTTTCTTCTGGAGGCAGTT^ 
CTTACTGGCTGTGAGCTCAGTCCTGTTGAGGCTGTTTNGCGACNNNCTGTGGCTTTGTCA 
GGGTGGGGTGGTGGATCAGACATCTGGTCATCAAC 
Sequence 794 

ACNCCCGGCCAAAAGGGAGNNCACAGGGGGGGGCCAATATATAAGGGGGGCAAGGAGGGG 

GGGNNGGGNAAAGANGNCAAACCCCNNCCCNNGGGGNNGGGGGGGGAGGGGNAANAAAGA 

AGGNGGGGGNGNCCANGCAACCNAAAGGGACNAACCCAACANNNAAGCGGGGGAAGGGGG 

CACGANAGAANANANANNANAGGNGANNCAAGAACCAANCAAGGGNGGGGGGGGAAGGGC 

NNCAAANGGGNNNAAAAANNGGCNCAGGNNACCNAANGGGGGGNNGGGGCNNAGAANGGG 

GGNGGGGGNNANGGAAANNAAANNCNGGANNNAACCAAGGGGAAGGGAGGGGGNGGGAAA 

NCAAGGCCCCCGGGGNCAAAACAAAAAGNGGGGGGAAAANACAANCGGGGNNNGGNGNCA 

GGAGGGGAGGNNANNAGCCGGCGCGNAAAAAAAAAAAACAANCCAGGGGCAANGGGGNNG 

GGGGNGGCNAANGCCNCNNGGGGNAAANCCCNANGGGNGNACCCNNGGAGGGNGGGGNNN 

GGGGGGGCNCNGAAGGGAAAAANAACCCCCAAGGGAAAANCCCANGGAANGGGNAANAAG 

GGGGGNGGCAAGNGGGNANCCCGNAAAAANAAAGGGGAAACNGGGGGANNGNCCCAAAAC 

CCCNGG 

Sequence 795 

CCGCGGTGGCGGCCGAGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAA 

CTAAGAAAACACCAACCATTCAACAGTATACTAGAATTCCTTTCAATGCATAATAGAAAC 

MGMGGGATTAGAAAAGCATGTCATAATTTCCAGATAGCATAATTATTTACATTAAAGA 

TCCMGAGACTCAGACCTAGTAAMGATTTTGGCCACATTGTGACATTTGAGATCACATT 

AAAAAAAAAAAGGAAAAATCAAGNGATACTAACATCACCAATTAACATCATNAATTAGAA 

AATTTATCCATCAGCNGTATTNNNNNGGGCTCAATGCNTTGTAATGCCGCATTTCGGGAG 

GC 

Sequence 796 

CCCCCCCCGGAGTTTCAGCGAAAANCCCGGCCGAGGNACCNCNCCNTTACNCCCAGGNTT 
ANAAACNCCCNGGTNTNGNCCCAGGAGAGGCGGGGGANCACCGCGCCNGGCCAGCANGGN 
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CCCCTNCTAAATNNNGNNGTNNAAGGAANCNGGACCCCAAANNCNCGCCCCCCACGAAAG 

GAGCCTGGGAACTACCAGGGCGTGAGCTCACCGNGCCCGGCCGCTCTAGAACGAGAAGGA 

ACCCCCGGGCTGCAGGAAAAACNATAACAAGCNAANCNAAACCCGACCACCCCAAGGGGG 

GGGCCCGGGACCCAGCTGTTTGGGCCCTCAANACNAGGGAAAAATTGCGCGCTAAGNGGN 

CAANCANGGGCCAAAANGCTGGNTNCCCCGGAGGGGAAAAAAGGACACTCCCGCGNCACA 

AATTACCACNCAAAACATAACNGAAGCCGGNNAANCAATAAAANNGGGGAAAAANACCCC 

NGGGGGTGGCCCTAAAAGGAGGGGGAGCCCCAACCACCAACAAAATAAATTGGGGG 

Sequence 797 

AGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAACTAAGAAAACACCAA 

CCATTCMCAGTATACTAGAATTCCTTTCAATGCATAATAGAAACAAGAAGGGATTAGAA 

MGCATGTCATA ATTT CCAGATAGCATAATTATTTACATTAAAGATCCAAGAGACTCAGA 

CCTAGTAAAAGATTTTGGCCACATTGTGACATTTGAGATCACATTAAAAAAAAAAAGGAA 

AAATCAAGTGATACTAACATCACCAATTAACATCATCAATTAGAAAATTTATCCATCAGC 

CGGGTGTGGG 

Sequence 798 

GCGGCCGAGGTACAATTCAACAATTNNTGGTCCAGGATCATGMTGGGCCATTNNTAGTT 

CTGTGTGTGCTTAAACACATTTTTTTGTGGGGTGCTGTGGATGTGTGGATGTAGCCAAAA 

AAAACCCTATTGTGGGNTNGGTCCTGGGGCAGAAAGTCTGGTGCCAGAGAGTGGGGTTCT 

GGGGGTCTGTCTTCATAGTTTGGGGTAGCACTAAAATCCTGTGAGCCTTTCTGGGCCTTG 

GTAACCTCCCCTGTMGTTAGCTGTTAGATMTTCAGCTGGGTAGCATTTTATACCTGGA 

TGATGTTCTAAAGTCCAGCCACANAAGGCCNNNGTCTGGCAGAGTGAGAATTNCCTTTGA 

AGAACCTTNAAACTGNTNCCCNAGAGTGACACAGGGGNNCCTNGGGGAAAANCNAAAAAG 

NNNTTGGGAATTCTNTNCAAAAGNAAGNCCCA I I I I I I I I I GCCNNNATTTNGGCCCNCG 

NTAATNCCCCNCCCCAAGNAAAAANNAAAAAAANTN I M I N I MUM 

Sequence 799 

CGGTGGCGGCCGCCCGGGCTTGGTACCCTCTGTCACGGCTTCCCTTTGCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGGCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCAGANATTGTTGCCTCCTGAA 

Sequence 800 

GGGCGNTTTGGAGCTCCCCNCGGTGGCGGCCGGGCAGGTACTATCTGGAACNTGTAGCTT 

CCTTTNGCACTGCAGCATGGGMGCCAGAGTTGATGATTCATACACCAGCATTTACATTT 

TTCAGCATGAAAGTGGTATGTTCTTCAACTCACAACCCATTGGCCAGAACCAGTAACATG 

ACTTCACCTAACTGCAAACTAGCTGGAGAATTGTGGGAGAGCTCATGG 

Sequence 801 

CCGCGGTGGCGG CCGAGG TACCATTTAGCACACAATTTCCATGTCCCAAAAGCAACCCCC 

ATAMCAGTGAC TATTTTTTATGC TGTTTTTCTTTGCCCCMCAC 

TGTTATATCTTGC I M M M M I I CTTTTTTAATGGAGTCTCACTCTGTCACCCAGGCTG 

CAGTGCAGTGGCGCGATCTTGGCTCACTGNAACCTCTGCCTCCTGGGTTCAAGCAATTCT 

CCTGGGGGGTGGGGAGGTTTGCAAGTGATCCAAGATTGCGGCTCTTCACTCCAGACTGGG 

NGAAAGAACGAAACTCCATCTNAAAAAAAAAAAAAAAAANAAAGTACCTNCCCGGGCCGG 

CCGCTCTANAACTAGGTG 

Sequence 802 

CCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATGGTGTCAACTTATGTCAGGACCCATG 

GGCCCTCCCCATGCACACAGCACTCTTGGAATCTCATCCTTTTCCATGGCTCTGGCTCAC 

ACTTCCACAGCATTTACTCCTAAATATGCCCCCTGGGTTCAAGGGTGATTCTCGTGCCTC 

AGCCTCTCAGGTAACTGGGATTACAGGCATGCACCACCATGCCTCGT AI M M M IGTGT 

GTGTGTTTAGTAGAGACGTGTTTCACTATGTTGTCCGGGCTGATCTCCAACTCCTGTAGT 

CAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTATAGACATGAGCCACCACA 

CCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAAGTGTGGGTAGTCATTAGTGC 

TGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGCACTTCCCGTCCACT 
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Sequence 803 

AACGACCGCCCGGGCAGGTACTAAGCCAGGACCTCAGTTAGCACTAAGCACTCTTACTAT 
TGCCCCCACCTGGCACAAAGCAAACTGAGATCTTAGTTGGGCCCATCATGTGTCATCTGA 
TTGTCTTAG AAGTTC 1 1 I 1 1 1 ICTAAGACAGAGTTTTGCTCTTTTGGCTCAGGCTGGAGT 
CCAATGGCACAATCTCGGCTTACTGCAACCTCCGCCTCCCAGGTTAAAAGCGATCCTCCC 
GCCTCAGCCCTNCGAGTAGCCGGGACCACAGGCACCCGCCACCACGCCCGGNTAACCTT 
Sequence 804 

TACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTCTCTAT 

GACTATCAAGCTCAGGCCTCTCCCTTTTTTTAAACCAAAGTCTGGCAACCAAGAGCAGCA 

GCTCCATGGCCTCCTTGCCCCAGATCAGCCTGGGTCAGGGGACATAGTGTCATTGTTTGG 

AAACTGCAGACCACAAGGTGCGGGTCTATCCCACTTCCTAGTGCTCCCCACATTCCCCAT 

CAGGGCTTCCTCACGTGGACAGGTGTGCTAGTCCAGGCAGTTCACTTGCAGTTTCCTTGT 

CCTCATGCTTCGGGGATGGGAGCCACGCCTGAACTAGAGTTCAGGCTGGATACATGTGCT 

CACCTGCTGCTCTTGTCTTCCTAAGAGACAGAGAGTGGGGCAGATGGAGGAGAAGAAAGT 

GAGGAATGAAGTAGCATAGCATTCTGCCAAAAGGGGCCCCAGNTTCTTAATTTAAGCAAA 

CTAAGAAG 

Sequence 805 

CCGGGCAGGTACAATGGACTTTGACAGTTCTTCCCAAACAGATCCTAATTTTAAACATTA 

GGTTTGCTTTGATTCTTTTCCTTGGGGCTAAGAGCTCACAAAGACTTAGGTTCTGGTCAT 

GGCTCCAGAGGCCACACATTCCAGGACAAAGTCTCTCTACAGTCAACGCCTTAGTCCCAC 

ATCTGTAAAATCGGAATAATCATCCCTGATCCAGCTATCACATTGCAGTAGAGTGAGACT 

CAAATGAGATAATGGAAGACAGTGGGAATGATCATTTCCAACTTGGCCTGGCTGACCCAT 

TCCTTGTTCTAAAGTCAGCTCAGGTTTCACCTCTTCCAGNGAAGTTGACCTGGCACTTTC 

TTTTAGGATGGCTACTGCTCCTCTGGGTGCCCCGGGGCTCANTGTCTCCCCATCACCGCC 

CATGGCACACTTGGAGTGACTGGTCCTTTACTTTGNTT 

Sequence 806 

TNCGGGCAAGGTACATTGGCCCCAAAGAGNAGGAATTCCTTGTAGAGGAGCTTGTAGATG 

CTTNCCCTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTG 

AGAGTGTATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCC 

CCAGGGCTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCACACACCCCA 

GCCTTCACCCTGCGTCAGTGGACAAGGGGGTAGGAGCCTGCAGAGCAGAAAAGTACCT 

Sequence 807 

AGCNCCACCGCGGTGGCGGCCGANGTACGCGGGATATGTAGAACTTCACNNGTTTGAAGT 

TGGCTGATTAMTATACTAAGTATTACTGMTCACTGCCCTGCCTTTTCTGCTTTCTTTA 

CAGACCTGTTTAGTATACACTGTATGTA I I 1 1 I 1 1 I I I I I I 1 1 1 GAGACTCCGTCTCAAA 

AAAGAGAAAATTATGGGCCGGGCACAGTGGTTCATGCCTGTAATTCCAGCATTTTGGGAG 

GCCGAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACTAGCTTGGCCAACATGACGA 

AACCCTGTNTGTACCTGCCCG 

Sequence 808 

CCGCGGTGGCGGCCGAGGTACGAGACTTGTCACCATGTGACATGGCAGCTTCAGAAACTT 

AGCCACTGCCAAAAAAAGAGCAGGCAGGGATAATGTTGTCCCATTGTCCAGTCAGAGAGA 

CCTGTTGAGTCTCTAGTTTGCCAGTCCCCAAGAGACCTTTGGAGTTTTGCTGGAGCCAGA 

CATCCTGCTTAGAGATGAGGAAGATCCTGCTGTTCCGTGGGGAGCTCTTGAGACACCCGT 

GCCACCACCCACCTTCTCCTGATTGCCACTTGCTGCCCTTTTCCCATTACCCTCTCCTGA 

CTCCATAAACATCTTCAAGTCTTCCCTTTCTCCACCCCAAAAAATGCCCACCTTGGAAAG 

GG 

Sequence 809 

AAATTAATTGGGGTTGNGCTAACTGCCCGGTTTTCAATCNNGNAAACCTTGTGGGGCCCA 
NNT GAATT AANANAATNGGNCCACCCCCCCGGGGAAAAGGGNGGTTTTNNAANTTTTGGG 
GCCTTTTTCCCCI I I I I IAAAAAA 
Sequence 810 
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CCGCGGTGGCGGCCGCCCGGNCACGGTACGCGGGGATGTCTTCTGAGAGAGTCAGGGCAG 

CTGAAGACTGGGTGAGGGTGAGGGAAGCCGCTGGTGTCCTCCTCAGTCACCCGTGAGAGG 

ACTCCTNTGTGGAGCTAATCAACTGCAAGGAAGATTGTTCCCAGTGTCCAGACCTGAAGG 

AGTCTGGACCCATAGTGCANTGAGATTTGGGGAAGGAAGGATTCCGGATAGGGGTGAGCT 

TTNTGNTGATAAGCAAATGTGAAC 

Sequence 81 1 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGTTTATGGGAAGCCAGTAACACTG 

TGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTA 

GAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGAT 

CCGCCTGCTGTTGMGCCCCCTTNTNATTCCGATCGCTTTTTGGCCTTGATGATTTGAAA 

ATA 

Sequence 812 

CCNTCAGGTACCAGANCTTAGCAGGGATTTTGGACAACAAAAGCTCTAAATCCTCTTGCA 

TCGACACGTTCAATTTGCACTGACCMTCTGTTGGCACAGTMCTGTTTATAAGCTAAAT 

TTCTACATTTTGGCTACAAGTATCCCAAATCCACCTTTTAAAAAATCCTAGGTAGATGCC 

ATCTGGTGTTAATGATTTGCACACCCCTTAMTTGAAAATATTTTAAATAAATCTCACGG 

TTTTATATAGTATCATTMTGTGTCTATTTTAAAAAGACAATCTGAGAATAACACTTCCC 

CTAATTGTTGTCTTAATAATGACCAAGAGCTGAGGAAAAATGATTCACACTGTTAGTTGT 

TTTGTTGTTTTGCTCACGGGGGAAGGGGGTGAAGTACTGGCTGTGCCTGGGTTTG 

Sequence 813 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCC 

CAGCCAGGCACCACCAATTACCAGAGGAACAAA 

Sequence 814 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 

GAGCAGTTATCCMCAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 

TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AAATCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATCACGTCTGAGATGGGAAAAAGGCTCTTCAGCTACTCC 

CAAAAGGGTATGCCCTGGGCATGGG 

Sequence 815 

CCGCGGTGGCGGCCGAGGTACTC 1 1 I I II 1 1 1 1 1 1 1 1 I I 1 1 1 1 GAGACAGGGTTTCTGTT 

GCCCAGGCTGGAGTGCAGTGGCACAATCTCAGCCCACTGCAGCCTCCGCCTCCCCAGTTC 

CAATAATTCTCATGCCTCAGCCTCCCAAAGTGGTGGGATTACAGGCGTGAGCCACTGCGC 

CCGGCCACCTTTCTATTTTTCTGGTTAACTTTCTMATGTTTGAMTGGCTTCCAGTGAA 

TTTCATTTTATTATTGGGGGAMCTTCCATACTTATTTTC 

AGTATACTCTCCTCCCAAAATTTAGATAGTTGTATTTTTCTGATTATTCCAAATAAGAGT 
GCTGAGAGGCTMTCACAAAGAGCMCAGCCAGAGATTTACAAAGTGGTTCTCTTACTAT 
TGAACATTTTCACTTAT 
Sequence 816 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCCCAGGTAAGATCAC 

TGGTGGGTMGMCATGAGGTTCTCACCCGTMGGCAGGATTTTTATAGAAGGAAGGTAG 

GTCTTTCAACCTATGTCCTCCTTCTGTTCCACAAAGTGGAAAGCCACAAGCCCTACAAAA 

GCCTTGCAAGTCCCAGAGGCTGCAGCCGTATTTATTCTTCAGGCCAAGACTCTCAGGACA 

GAGAGCACCCATGCACCCCGCAGGCTGCAGGCCATCTCCCTGCATTTGGGACTGTCCTGA 
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GGATGGCGGCTTCATTTTTGTCCCTCCTACCTCTGA 
Sequence 817 

GMCCTAGGGCGATTTGGAGCTACCCNCGGTGGCGGCCGAGGTACATTTTGGCAAACCGT 

GAAGGGCTTTCNTTTTNGCAGGTTGGACTTCCCCCCCCTAGTNGGCAGG A I I I 1 1 I I I AG 

GGGACCACCTGAGAMGGTCTGTTACCGTGCATMACCTCCTTTMCACCTTTTAAAAAC 

TCTTCTGGGGGCCGGACTCAGTGGCTCATGCCTGTAATCCCACCACTTTGGGAGGCTGAG 

GCAGATGGATCACCTGAAGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCC 

GTCTCTACTAAAAATAGAAAAATTAGCCAGGAGTGGTGGCAGGTCCCTGTAATCCCAACT 

ACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGC 

Sequence 818 

GCCAGGAAACCCGTAAAAAGGGCCCNGTTTGTTGGCGG I I I I I I I CCATAAGGGTTTCCG 
CCCCCCTTGACCGAGGCANTTAACAAAAAATNGACNGCTTCAANGTCAGAAGGTGGGC 
Sequence 819 

CCCCCCGGTGGCGGCCCGCCGGGCAGGTACTGGGAAATGAGGCAAAAGTNTNTCTCTTCA 
CTGCTAGCTCCTTGGGGACCAGCAAGCGGCTCTCAAGTTGCGTGGTGGCCCAACTGGAAA 
AAAGGCAGTTCGGTGCATCCTGGGAATATCCAGGTGAAAGTGTGAGATTTACCTAGAATA 
GCTTCTGGGCCTCTGGGGTTTTCACGCTGTCTCTGGTGMGGTGTCCATTTTAGAACTGA 
AGCAAAAAGGTTTCMTCCGTTCCGTTTTCTTTGTTTTAGCACTTACCCCAGNNNCCTCC 
ATAACAAAGGTGGNGCCCCTTCAGGGAAATTAAA I I I I I I ! I I CNTAAAGGCCTTGGCAT 
TAANCCN II 1 1 I I I I GNGGGNNGGGNAAAN I MINIUM 
Sequence 820 

TAGGGCGNTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACAT 

CCAGGNCAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACGCATTCCGCTTCTG 

TCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTC 

CMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATT 

CCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATC 

AGmATCAACCAACMGCAGTTCCAGCACCCAGCACTTTTTACCTGAATTTTAC 

Sequence 821 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCATGCAAAACTCCAGATTCCTATCTTC 

TTTGGGGGAAAAGCAAATTGGAAGCTCTGACAATGCTGGGCTTTACTTTCCCACATAGCA 

ACCATCAGTTGGAGCTGAGACACCTCTGCTCTCTTTAGAAAGAATTATTAATGCTTCAGT 

CTCCATTATTGCTTCCCTAACAGTGAGGATAAGTTATTGGCATCAANCCTGGCCGGTTTA 

NCTTGGGGGTTTATTTTNTNNNTTTGGGGCCTNAAAACCCCGGGGGGNNCCTrTTTGGCN 

CNGNGGGGGGGGGGAANTNTNNNANNANGGNNGGGGGGGGGTTTNTCTCNNCCCCCCCCA 

Sequence 822 

CCGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGTGTTTATGGGA 

AGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCG 

GGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCA 

TGGGGGAMATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 

TTGATGATTTGAAAATAAGTCCTGTTGCACCANATGCAGATGCTGNTGCTGCACAGANCC 

TGTCACTGCTGCCATTGAAGTTTTTTTCCMTCATCGTCATTGGGATCATTGCATTGATA 

TTAGCACTGGCCATTGGTCTGGGCATNCACTTTCGACTGCTCAGGGAAGTACCTCGGCCG 

CT 

Sequence 823 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTGTT 

AGCTATTATCATCACCCTCCTTGCTAGGCAGAGCAGGACAGTGGGGAATTGATGTTTCCT 

CCCCTCCATCTCACAGGTGGGGCAGGGGTGTGCTGAGAAGAGAACTTGGGACTCTTGGCC 

CCTGTTCAATTCTCTGCTTMCCTGCTAGGCAATTTGGGCCTCTGAAAATTCAGTAATCC 

TCATAGCAACTTAGACGTCACCTGGGCCTGTGGTCCCCTTCCTAGCCTAGGAGTCAGAGC 

ATGAAGCTCCATCTGTCACATTGGTTTGTTCAGAGMCTACACATGCGTTTTATTTTAGC 
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AGCATACAGGTTCCCACTTAGGCATTGAGAGGACATAGGAAGCTGTTTAACTTCCTA 
Sequence 824 

ATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGTTATTTTA 

AATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGG 

AGCGGGAAGTGCAATQCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGA 

CTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTC 

ATGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGT 

TGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAA 

TACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAA 

TCACCC7TGAACCCAGGAGGCAGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAG 

CCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAGAAA 

Sequence 825 

CCGCGGTGGCGGCCGAGGTACAGATGTATGGATCTCATAGCATTGAGGGGTCTTTCAGAT 

TATGTTTTCAAACCCCTCACTTTCTCTTTTCAGATAAGACCACAGCGACCTGGGAAAGTG 

CAACGTCTTAGCCAAAGACACAGAACTATTTAGCGACACTGTCTAGACTCTAGTTTCCAT 

GTCTCCTGACTTCAGTCTAGTGTTCCACCCCTGCCGCCCACCCCTGCCCCATCCTCATTC 

CTCCTGTAGGAGAGGCCAGACCTTTGCCTGCTGCAGCTTGTGGCTCTTCTCCTGCCTTCA 

GTTNTTCCATTGCCTG 

Sequence 826 

GGGNNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCTGTCTGGGCAACACT 

GTCCCGNNGGGGCCCCCATGACCAAACTAACTCTGCTTCTACCCAGAAAGGGTGCAGAGT 

GGCCACTAGACTTTTATGTGGCAAATGGGATGGTTATGCCCAGCCTGAAGCCAAGATGCC 

CTTTCTGGTTGCCTTGATTTGTGTTTAACAGCTCCAAATGCTTAATGAGGCAGTAAGAGA 

CGTCTCTCTTGGGCAGTACCT 

Sequence 827 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGGTGTGGCTACTACCGTTACA 

ACTGCCTGTGCTTGGACATGGACCCTCTGCAATATGCGGCAGTTTCATTCATTGCCCCCT 

ACATTCTACACCAAGTAGAAATGGAAGGCAATTGGATACTTCACAGACAAGATCTAAGTG 

GAGAAGGAATGCGTCCTGTGGCTGCAGAGATCCTTGGAGCTTGGAGGGGAGAGCTTGAGC 

CCCACTGATGATGACCTCCCACAGCTCGCCAACTCAGCCCTCCCTAAGTCCCCATCGGGG 

GCCAATTCTCACTCTGGGGTTGGGGGGACTCCACCATAGCTCATCCATCATAGGGGATGT 

TGGTATCTACTGTGGGTTTGGGTAGGGCCCGATGTGCTGAGGATGGCTCCCCCACAAGCA 

AGAGATGTGGGATTTGG 

Sequence 828 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCAATTTGGACCCCAGCCTGGTGGAGCANGTCTTTCNAGATANGACCCTGAATGCCTNAT 

TCCATTGGGCTGGGGCTTCCACCTACCAAGTTGGGTGGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTTA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCCGGCCCCCCCC 

Sequence 829 

CGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCTGATCTACTCCTCTCTACAACAAC 

CTTGTGGGTGACGTTATTATCTCCATTTCACAAATGAGGCCACAGAGG7TCTAAAGGGTA 

AATGACGATGATGATGAGAGGTAAGTGATAAAACAATGTCTCCTGACCACAAATCCTGGA 

ATTTAAACATAAGNGTAGTAAACATGAACTCTAGGAAGCCTCCTGGGGCTTCTNCCTGTG 

TCTGGAGCCCCTGCACATGCCCAAAGGAAGTCCTTTTGGTTCTNCGNTCAGNAGAGAAAG 

GGNGCATTTCACTAAAAGGGAGGTGGGGAAACAAGACTGGTGGTAGGG 

Sequence 830 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 
GAGCAGTTATCCAACAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 
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TGGAGGCCTGAGATGGTTCCCGTGAGTCACMCMTTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AAATCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATTCACGTCTGAGATGGAAAAGGCTCTCAGCTACTCCCA 

AAAGGTATGCCCTGGGCATGG 

Sequence 831 

CCGCGGTGGCGGCCGAGGTACGCGGGTAACAGGAGTCTTTGCTGAGTGATCATCTGTTTA 

TTCTTTTACTCCACAAATATCGAATGTTTACAGCGTGCCTGGCACTGAGCAGGGCTGGGG 

TTTCCTGACCATATGGACCTTCCTGGGTATATCTGTGGGGCTTGAATGGTGTGTGACCTT 

GTGTCCTGCCCG 

Sequence 832 

CGGGCAGGTNCGCGG GGGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTC 

CGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCC 

GGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGC 

CCCCTTCTCATTCCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACC 

AGATGCAGATGCTGTTGCTTGCACAGATCCTGTCACTGCTGCCATTGAAAGTTTTTTNCA 

ATCATCGNCATTGGGATCATTGCATTGGATATTAACCCCTGGNCAATNGGCTTGGGCATT 

CAATTTGACTTGNTAAGGGAAGTNCCTCGGCCGNTNTANAACTAGNGGGATCCCCCGGCT 

GGANGAATTTCAATTTNAACTTATTGATACCGTCCANCCTTGNGGGGGG 

Sequence 833 

ACCGCNGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACTTTTNNGGACAAG 

GTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACC 

AACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGAC 

CCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTG 

GGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTTATCA 

AC 

Sequence 834 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACCTTACCACCC 

CATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGCTCAATGTAAACGTGTG 

CACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTTCTCTGCACCCCTCCCT 

CCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGCTGGGGCGCTGGTCTGA 

TTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCCAAAAGGTTGGGACAAA 

CACAGCAACMGCCCAGCTCCCTGAATTTCAGTGATTCATTTGTGGGGATAAAGGAGTGA 
ATG 

Sequence 835 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATAACTTATCG 

GCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGGAGCGGGAAGTGCAATCCTACC 

TTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGACTACCACACTTCGCTAAGGTC 

ACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTCATGTCTATAATCCCAGCACTT 

TGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGTTGGAGATCAGCCCGGACAACA 

TAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAATACGAGGCATGGTGGTGCATG 

CCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAATCACCCTTGAACCCAGGAGGC 

AGAGGTTGCAGTGACCGATATCATGTCACTGCAGJCCAGCCTGGGGTGACAGAGCGAGAC 

CTTTGTTTCAAAAAAAAAAGAAG 

Sequence 836 

GNGGNGGCGGCCGAGGTACTTTAACANGCCATACTCCAGTCCCAACAATGTTAAATGCCA 

AAGCAGTGTTGGTAAAAGCCTCAAATGGTGAAAAGGACAGAAACTCAAACCCGCCCTTGT 

GCCAGTAAGTAACTGTTACTTATCTCACAAAGCGCTTGGCTCTGGAAACAATCTAACTCT 

GAGCTGCACGTGGAGTCTACATGGGAATGTGCAAAGCATGTATTTCCTTTTAGGTGCAGC 

AGAGGTAACCGAAATTCAGATAAGAGAAAAAAAATCCAGATTTCAATGCAAGAGGTGGAA 
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GATCCACGMGATACTCGTTACTATTTGGTTTCTAGGAGCAGGATTGCCACTAGATATGA 

TGGAGAACAAAAATGAAGAGGTGTTGTGTAACAAAACAAAACAAAACAAAACAAAAAAGT 

AGAAAGAAAGAGCMCAGGCCGGGCCGCAGTANCTTCATGCCTGTAATCCCAGCACTTTT 

GGGGAGGCCCAG 

Sequence 837 

NTTGCGTTGCGCTNACTGNCCCGCTTTCCAGTNCGGNGTAAACCTGTCGTGCCCAGCCTG 
CATTMTGMATCGGCNCMCGCGCGGGTGAGAGGCCGGTTTGCGTATTTGGGCCGCTCT 
TCCCGCTTTCTTCGCTCACCTGACTCCGCTGCGCCTCGGGTCGT 
Sequence 838 

CGCGGTGGCGGCCGAGGTACTAGGCACTGAAGATACAATAAGCAATCCAGNAATNCCTCT 

TTGMG MTTTATTCTGATGA AATAGAGACAAGCCTATTAAAGTATTCAAGGCAACATTA 

CTTAAATATTTTATTTTATTTTATTTAT 

CTGAAGTG 

Sequence 839 

AGGTACCTGTAGTCCCANTTACTTTGGAGGCTAAGGTGGGAGGATGGCTTGTGCCCAGGA 

GGCGGAGGTCACAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGCAATAGAGCCA 

GGCCTTGTCTCATAAATGAAATATAAAATATAAAATAAAATAAAATAAAACGTTGTTGGC 

AAAGATGT ATTCAA AATAGTATGTAGMCTTACCTGGMGGTMTTTTGCAATATTTACC 

AA AATATAC I I \ I ICATCTAGTAAGGTCTCCTTCTCAGAATCAGTCTTAAAGCCATTCAA 

ATACAGATTTATGATTAAAGATTTAAGTCCAAMATGCTCATTATAGCAATATTTATAAT 

AGGAAAAATTGGGGAAAAACAATTATACATCCAACAGTAGTAAGAGTGTGACTACATTAT 

AGTATTAGTATGTAATGGGATTGTCACAGAGCMCACACATGTTTTTGAAGAATATTTAA 

GGGCGTGATAAATATTAAATGTAAATGTAMTTGAAAAAATGATATCTGTAGATTTTCAT 

TATGCATTTCTTTATGAAATTTTTNGATATACACAAMNAAAATAGTCATGCATTTGCTT 

CATGACGGGGACATATTTCTGAGAAATGTGCTGTTAGTCGGTTTC 

Sequence 840 

GGCGGCCGCCGGGCAGGNACANCTTCCGTGGGGGGGCNNAAAACCCCCCACCNAAANCAA 

AANAGCANCAAAGGAAGAATTNNTTAAGGGCAGGGGGGGGAAGCCCCNAAAAACCNGANG 

GCMA NNCC AGAGCNGNGGNNNAGCNCNAGNNNCNGGGAAGAGCAANCANNACGCTTTTG 

TGAGGTTTTNGGNGNNGGAGGGGGGGAACCCCCCCGANAAAAAACCCAGNAGCCCCCCCA 

CNNAACAGGGGANGAANAAGAGGAAGGNAAAGGANNNAANNAAGAGCCACAGNCCNGGAA 

ACANGAAANCANNANNNCAGACANGCCNNNGAANANGNNGCCNACNNNAAAAGAAGGNNN 

GGNGCGCCANNGGGGAGNAAANNGGAANNANNAAAAAAAAAAAGGAAAANGNGAANGAAN 

GAGGANAANANNNGGGGGAGGGNAAANGGGGCGGGGGNNNAANGCCNAANAANANNNAGN 

NNNNGGGGGAGGGCCNG 

Sequence 841 

AGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGC 

TCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTT 

CTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGC 

TGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCC 

MMGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCATTTGT 

GGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCG 

Sequence 842 

TAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTACTCCCAAAAGGCTATGAAATN 

GGGGAAAACCCCAGGTGATTCATGCCTGCTTAGCTGCAGNATNTCAGTNGCANTAGGTGG 

AACCCCAAACCCAGNGCANAGTGCCAGNGTCTGCTTNGGTGAGATATGAGTGTCAAGTCT 

CGAACCAAGCAACCTATCNAAAGCCTGNGACACTCCTGGCCACAGGCGGNTGGTANAGGC 

ATAGNANACTATTGCCCAGGTGACGTGACTTCACAGATGCTGGGAAGCCTGCTGCCCCAT 

CCAATACAATACTGCCCACTGTGCATAGAMCCAGATTCCAAAGTTAGAGCTTCGTTTTG 

GCCATGAGTGCMTTTCACTGCAATGTTTTATCTTACTCAACTGCCAGGGTCAATTTAGG 

TGGTAGGGCTAAATCTCCTTCTTTATATTGGTCCMATGATTTTTCCTGATGCTGCATTC 
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CCGGA 
Sequence 843 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGCTTAGACCAGGAACACAGGGAGGTAGAG 

GGCAGCAGAGCAGGGACTGGCTTCAGAGCCAGACAGGTGGCTATGTGACTTAATGTGTCT 

GAACCCTGGTATCCTAGTCTATTAAATGGTATAACAGCAGCTTCTAGTATGTAAGTTCCT 

TGTCGGGAGMAMCTGTTTTGCTCATGGCTGGAGCCTTAGCATGTTGCATCATATTGAA 

CATGTMTAGATGCTCMTAMTATATTTTTAAGAATAAATAAATGTAAATGAAAATTAC 

TTCACAGTGTTTCTGTAGAGATTTTATAAGATATGGTATACACAATGCATAACATAGGAA 

CTGACGCTCAAAMTGCCAGTTACTTCCATCATTGNGTCATAGGCTTTTATGTTCATTAT 

CCTGCTGCATCATCCCAAAGAA 

Sequence 844 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGT 

GTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGNGGTGCCATCTACATTT 

TTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAA 

N TAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCG 

CTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCANATGCTGTTGCT 

GCACAGATCCTGTCACTGCTGCCATTGAA 

Sequence 845 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTMCCCTMGGGATTCCTACAGCTTTTCT 
GCATGTTMATAGTCTGTTTTAGCTTATTCTCTTATTACTTC^ 

MGTTTGCTTM TAATCA TGGGAATATTTTAGATTTTAAAATACAAAATATACAAGCTAA 

ACTTGAGAGCA GTTTT TAGTTGTAGAAACTGTTTCTTGMGTAATTGACTTAGCGTTTGC 

TCTGCCTCTTTCTTTTCTTACCTAGGTAGGTAGTGGGGACTCCTTCAATTATCTGAGCAA 

TTCAAATCTCAGAATGTAGTGTTGGGTAAATTGAGGGTT 

Sequence 846 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGNGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCA 

Sequence 847 

TGGAGGTCNCCGCGGTGGCGGCCGNGGTACTCCAAGCAGTCCCAAAGTGGGAGTNCTTAA 

AACACCATGGGCAGGTGAATGGCTGACCAGGTGGAGGTGCACAGTGCACCATGACAAGAG 

CAGTGGAAAATGGGTGAATCTGAGATGCCTGGAGGCGAGGGGGAAAGAGCACATCACAGA 

GGACAACGTCCANNGGGACACCCTTTTATA 

Sequence 848 

CCGCGGTGGCGGCTGTGGACTGAAGGGTGACTGGTTCCACTGTGGTCTCCATGGGAACAA 

GTTGTTTCTGGAGTCTTCCAAGGAGAATTTCTCACAGTGGACCTGATCTCTGGGCTGATG 

CTGGGTTCCTTGGAGCTCATGATTTTGAGAGTGGTAGACATTTCTGGGCTTCCTGGGGAT 

GTGCCTGCTGGACTGCTCCCCGTCTCCTCTGCTGGGGCAGGCCACGTGGAATTTCCTTGT 

GCTGCCTGGCTTGACATCTTTA 

Sequence 849 

CCGCGGTGGCGGCCGAGGTACCTGAAGAATCTCTCTTCAGCTCTCTTCTCCTGGAAACTT 

GAGTGGGGCAGGAGGAAAAGCGGAGCTAGGTGTCATTTTAATGAGGAACATACTTGTCTC 

CTCCATTTATCTGGCCCTCCCTGATGGCACTCCAGAATTCCAATCCCACACGATTAACAA 

CATAGTTTCCCTTTTCTGCTTGAAGGTCCATTCTCCTCTCAAT7TCAAATCACCTGAGAT 

ACAAAGCTGCATTTCCCCACAAGAACCAGTTCCCTCTCCTTTCCTTCAGTGCTACTGTCC 

TTCTCTCAGACCACCAAGCTTAAAAACTCCAGAGGCTCAAACAGCAAAGATGGCAGCCCG 

CTCCTCCCTCTGGGGAGTTCTGGCCCAGGGAGTTTTCAAATTTCTGTAGGCGGMGAATA 

CTAGCGGGGAGTGGCTGGAGACCCCAGTTGGTAGGGNTCCACATTTGGGGGAAGTGAGCC 
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CAAGCTTTTTN 
Sequence 850 

CCGGGCAGGTACATGAAAGTAAGATCACAACCACAGGAACCACACAAAATTCAAGGCACC 

AGAGGAGCCCAGACTTGGCTGGCMTGCCTGTTTTGGAGCTATTCCACATTTCTGGAAGT 

CAATGGGAATACCGGAATATGAAAACACTATGAGGCCGGGCACAGTGGCTCACGCCTGTA 

ATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGTTCGAGACTAGC 

CTGGCCAACATAGTGAAACCCCATCTTTAATAAAAATACAAAAAATTAGCCGGGCGTGGT 

GGGGGGTGCCTGTAATCCCAGCTACTCCGGCGGCTGAGGCAGGAGAATTGCTTGTACCTC 
GGC 

Sequence 851 

CCGGGCAGGTACTTTTTTCTTTTTTC I MINIMI! I GAGTGGGGGCGGGGTTTCGCCA 

TGTTGGCCAGGCTGGTCTTGAATCTCGGGTGATCTGCCCGCCTCGGCCTCCCAGGGTGCT 

GGGATTGCAGGCGTGAGCCACCACGCCCGGCCTCGATATATTCTTACAGTGGAATACTGC 

TCAGAAATACTGATGAATCTTAAAAAACATGATGTTTAGCAAAAGAACCTTGGTATAAGG 

TTCTTGGTATAAGGGATACATAGTCTATGATTCCATTATATGAAATTCTAGAACAGGAAA 

AACTATAGTGAAAAACAATCAGATTAGTGGTATCTGGGGTAGAAAGTAGGAGGAGATTGA 
TT 

Sequence 852 

CN AN AGGGC I I I I I GGGGGCAAAACCGCGGNGGCGGCCGCNCNAGAACNAGNGGANCCCT 

NTTGGGGGGGGAAAAAAACCCCAAGCCCACCGANACCGNCGACCNCGAGGGGGGTNCCGG 

NACCCAGNGNNNGNCCCCTAAANAGAGGGNNAANNGCGCGCNNGGCGNAANCANGGNCAN 

AGCNNGNNNCCNGNGNGMANNGNANCCGCNCACAATTTNTCTTTTNTAGNCGAGCCGGG 

AGCAGAAAGCCCNAAGAAAAAAGN 

Sequence 853 

AGGTACCCACAGCCCTTTCTTTTGGAATTCCCTAGAAGGGGTCTGTGCCACATACAGGAA 

GTAGGGAGGGTGTCTTTGCAGCATATTTCTTCTCTTGGAGTTAACTGCGAACGTTGCACG 

GCGACCTCTTGATCCATTCTGTGAAAGCCCCAAGCCTGTCATGCAATAAAGACGTCCAGT 

TTCACCGCAGCAGGGAGGCCGCATGAMTATTCCACTTGAACAAAACCACTTAGCAGTTT 

ACATCAATGCTTACCCTGTCGCATTGAAAGTGATGTGAACCCACACCCAAGAGCCCCCAA 

ACCAGCACGTTGATACCAAGTTTCCCCAGCTGCATCCAAATCAATTCCTTCTT 

Sequence 854 

CNTTTGGGGGCTAANCCGCGGGGGCAGCANCNNCACCGGCAGCGCGCNATTCTTTGGGGG 

GGTGNAAAACCCCCAGACCCCCGAGCNCNNCCACTCGNCNCACCANTNTAGAGACTGGGG 

CCGNTCCNCNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnl M M M \ GGGGTTTTNAGNGAGGGGAANCG 

CNCGCAAAAAAANAATCNCGGCCANAGGGGGGCCNGGGNGAAANTGTTANCCGNGCACAA 

NNNCACACAANANACGAGCCGGGAGCAAAAAGAGGAAAAGCCNGG 

Sequence 855 

CCGGGCAGGTACGCGGGCTACACACACAGTTCGGATGCCAAGGGTGACACCCCATTCCCT 

TCACAAGAGGCGGTTCTGTCACAAATCAGCACTCCACCCCCACCACACCTCTCAGTGAAT 

GAAGTGCTGGTGGTCTCACTCCCCTGGTGACCCTTAGCCGTGGGATGGGGTGGTTACACT 

AAGGCTTCAAGCTGAGAATGGCCATCATGGCGGGAGGCTGTTTGCAAAGGCACCTTCTGT 

CATCCTGGGGTTGGCTAAGTCAACTCCACCCCTTCCCAAAAAAAAAAAAAAAAAGTACCT 

Sequence 856 

GCCAGGATTCAAACCAGGGANTTTGCTCCAGCACTCCGGCTCTTAACCTCAACCGTCTGC 

CTCTCCACAAACACCAGGATCAACCACCAAGACCAAAAAAACAGTCTCACAAACCATCAA 

A CATTG CACTTGGTGGCTCAGGACCTTAGCTTCGTCTTAAAGGTCCCTGTTATGCT7TTT 

C TTTT TGCCCCAGTGTGGAGTGGTCTTCGTGTTTGTGAGTGCAGGGGTCAGGGGTTGTGT 

CTTTTCTTCTTGTNCCCTTCCAAGAGGTGACATGTATCCTTGATACTGGAAGGGCCCTT 
Sequence 857 
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AGG TACGC GGGCACTCCAGCCTAGGCAACAGAGCCAGATTC I 1 1 1 1 I 1 1 1 I 1 1 1 I I A AAA 

AGTCTTTTAAMAATTCTTTTATTGTGCTGATTTTATTGTGTCATGAAGTGTAATATCGC 

ATGTAGGTATGTGTCAAGTATACAGAGTGTCAGGCATACGGTGTTCAGTCATAAGCAGTT 

CTGGCCTTTGGCCCTGCACTGTTTGTTGGCTTTTTAGGTAGGAACCTTCTTAGAGTAAGA 

CTGTCATGCTAAAATTGTAGCAATCAAATGTGCCCCCCATACAACTCATTTGAGGTTGAG 

ATTATGTTGCTAGAGTGGAGGAGATTGGAGTGTTCTAAATGCTAACAGTTTGTCTTGCCT 

Sequence 858 

GGCGGCCGCCCGAGAACGAGGGNAGCCTTTGGGNGGGGGGAAAAAACCCAGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGNACCCANCGCAGAGNCCCCAAAAGCGAGG 

GGCAAGNGCGCNCAGGGCGGAAANAGGGCNACAGCCGGGGCCGGAGCNAAACAGGCACCC 

GCCCACANTTNNTTACAACANACNANCCGGGAGCATAAAGCCGCAAANCCAGNGGAGCCA 

AAGGAGGGAGCGAACCCACAG 

Sequence 859 

GGCAGGTACAACCTCCATCCCCTGGGCTCAAGCGATCCTCCCATTTCAGCTCCCCCTGTA 

GCT GGGA CTACAGACACACACCACCGTGGCTGGCTAATTTTTGTATTTTTTGTAGAGGCA 

GGGTTTTGCCATGTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCTGCCTGCC 

TCGGCCTCCCAAAGTGCTGGCATTACAGGCATGAGCCACCATGCCTGGCTGGGACATTAT 

TCAAATTGAAGTGAGGACATGATGTTAAAAAGTTCTGGGCAAGTATTTTACAAGTTAAAA 

TACAGATGTAAGACTTGACTTGATCAAATGCCCAGCTCTGTAATTTCAGCTAAATTG 
Sequence 860 

AGGTACAACATTGTGAAAATTTTCCTATCATTTCCTCTAAAACTCTGCAGGCATGGAGGC 

TGTCTGCTGAGATATAGCAGGGMCMTTTAACCCTTTGTTTTGCACCCACAAGATGAGC 

ATTACCAAATTCCCAACAGAGACATGCTGGGGCTGTGCTGTGCCtCCACCCTGCCCCCTC 

CACAGCCAGCTCCGCGTGTCTCAGTCTGTAACTGATAGCATTCCACTTCCTGCCTGTACC 

TCCCCGGC 

Sequence 861 

CNGCGGCGGCCGAGGNACACNNANAANAACCTTTTAAGGGGNGGAAAAACCCAANCCCCC 

CGNCCIMACCNCAGNGACNNGANGNATNTACNGGAAACAGGGCGCAGCCCNAGGAAGGACA 

GNNGAAGNCCNNACNGNGCAAGNCNAAANNNNAAGGAAAANNAGNCCCGCGANGAGNNNC 

CNCANGCTTTNNNTNTGCGGGGACCAGNCAGGGGNCGNCCCCGACANAANCAGGGCNCGC 

CGNCGGNGGNAGAGNCACNNNGCAGGGGNNGGNGAAGCNGCNCCCANCCANGGACCNNGG 

CCGCNCNAGAACNAGNGGANCCCCCGGGCNGCAGGAANNCGANANCAAGCNGANCGANAC 

CGNCGACCNNGAGGGGGGGC 

Sequence 862 

AGGTACI I I I I I I 1 1 I I I 1 1 1 1 1 1 1 1 I 1 1 GGNAGAGATGGGGCTTTACCATGTTGGCCAG 

GCTGATCTCCAGCTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAATTGCTGGG 

ATTACAGGCGTGAGCCACCGCACCCGGCCCAGTCTTCATATTTATAAATAAAGTCTTGTG 

GGMCACAGTCACATTTATTCATTTACATTTTGTAAATAGCTGCCTTCAAGAGGTAGAGT 

TGAAAGTCTTTAAGACAAAGATCAGGTGGCTTGCAAAACCTAAAACGTCTGGTCTTTTAC 

CTAAAAAGTTTGCCAACCCCAAAACAAGAATATATTCGATTGTAATCTTTACAAAGGTGT 

TGGCTATATTTCAGTGTGT 

Sequence 863 

NCTATAGGGCNAATTGNAGCTCCCCGCGGTGGCGGCCGAGGACAAATTCAGTCCCAATAC 

TCMTACGTATTATAGATGACTATGAGTGCAAACCTTAGGATGNGATTNTCTGAATAATN 

GNTCTTTGTAGGATTTGGTTACATTATTTAAAATGAAAAAGATCTAGTTTTAGTGTGAGC 

TCAG TMTGNTAATNGGTTMGTTCATTGCGMTCTTGAGTTTTAGATAAGTAGTTATTT 

TTTTCAATATCACTTCTGTTTTTAGTGATATTATATCAAGAAACAACGTATTCAAGAACC 

ATGGCTGACAGTGCCAGATATACTTAGGGATAAACATCAAAATGCAATTATAGTTGCTAT 

AACGTTAGATACTCGGAATCAAAATTTATTTGCANGCTGACTTGATAAACTAAATGAA 
Sequence 864 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAGTCAGCAG 

AAATGTGTGCTTTAAGCAGAGTCACAGGGGCCTGGGGCTGAACTGAGTCATTTCTCAAAG 

ATATCCCTGCCTGGGATGATGATGGCTCTAATTGAAGCTCTGGCATCATCTGGGGCTTTA 

TGAGCCAAGGGAGATAAGAAGAGCCACAGCAAAACCCTTGGGTCTACAGTGCAGGCTGCA 

ACCAAGGCAGCATTTGCTAGAATATTTGTGATTATGTGTTCAACCTACAACCT 

Sequence 865 

CTA CTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATTAT 
ACC I I II I ATTGTTGTTATAATTATTATGGGGTATTTCTAATTAATATGATGTTGAAACC 
TGTTTGGCACCTTCTGGAAGCTACCAAAAAAATGACACTCCATTGAAGTGCTTAAAAGCT 
GTTCTCATAAGAATTCTACTGGCCTATTGTAAAAAAAGAAAAAAAAAAAAGAAAAAGAAG 

AAAGACACAAAGAAAATAATCTAAACACCAAAAACTAAACACAATTCCAATCCTTTTTCT 
GTACCT 

Sequence 866 

ACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGGAAATCTGGACAGTG 

CTGCACAGATTGATACATTAGCCTTTGCTTTTTCTCTTTCCGGATAACCTTGTAACATAT 

TGAAACCTTTTAAGGATGCCAAGAATGCATTATTCCACAAAAAAACAGCAGACCAACATA 

TAGAGTGTTT AAAAT AGCATTTCTGGGCAAATTCAAACTCTTGTGGTTCTAGGACTCACA 

TCTGTTTCAGTT7TTCCTCAGTTGTATATTGACCAGTGTTCTTTATTGCAAAAACATATA 

CCCGATTTAGCAGTGTCAGCGTATTTTTTCTTCTCATCCTGGAGCGTATTCAAGATCTTC 

CCAATACAAGAAAATTAATAAAAAATTTATATATAGGCAGCAGCAAAAGAGCCATGTTCA 

AAATAAGTCATTATGGGCTCAAATAGAAAGAAGACTTTTAAGTT 

Sequence 867 

CCGCGGTGGCGGCCGAGGTACATAACATGATATCAAGGAAATGCTTGAAACAAACTTTCA 

CAATAAAGTCAGAAAAAAACTGTAAAAATTGTCTGCAATCCAAGAAAAAGCACGTGCCCT 

GTGTGTAGG GGGA AAGAGGGAAAGCACTTGCAGTGTGACTTTATGTGGTCTTTCCCCAAG 

TATTGCTACGTTTTGACCTTTGGCCCAACTGAACAGGTGAAATGCCCTTCACATAAGTTT 

CAATCCCCAAGAAACTAGCTGGAATGCAGGGGACTGTAGACACACTCCTGGACCAAATGG 

CATCGACTCTCAGAATCCAAAATGGGCCCTGCCCTCATTCTGAGCTTACGGCCCCAAGCA 

TATTCTAAACAAAG C I 1 1 II I AA 

Sequence 868 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGATATCAC 

CTGMTTATTAATGMTGCCCAGGAAGTAATTTTCTTCTCATTCTTCTAAAACTACTGCC 

TTTCAAAGNGCACACACACCGCGTNCACATACACTGCATTCGTTGCTCCAGTATAAATTA 

CATGCATGAGCACCTTTCTGGCTTTTAAGCCAATATAATGGGCTGCAAAATGAAGACACC 

ANAGTGTATGCATACAAATCTCACTGTATTAAAGATGCAGGTTTTCTAATTGTACCT 

Sequence 869 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGGCGACGCGCGGGACAAAG 

GGAAGCGAAGCCGGAGCTGCGGGCGCTTTTTCTGCCCGCGGTGTCTCAGATTCATTCTTA 

AGGAACTGAGAACTTAATCTTCCAAAATGTCAAAAAGACCATCTTATGCCCCACCTNCCA 

CCCCAGCTCCTGCANCACAAATGCCCAGCACACCANGGTTTGTGGGATACAATCCATACA 

GTCATNTNGCCTACAACAACTACAGGCTGGGAGGGAACCCGGGCACCAACAGCCGGGTCA 

CGGCATCCTCTGGTATCACGATTCCAAAACCCCCAAAGCCACCAGATAAGCCGCTGATGC 

CCTACATGAGGTACCT 

Sequence 870 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGG 

ACCCGGAAGTTTAAATTCCGCTCCCTCCACGAAAGAGTTGTAGTGAGTGAAAATAAATAT 

TAA AACA CACGGAAATGTATTTTCCTGGCTGCAGCACCNGCCATCTTGCCTCGGNAGGAC 

TCATTTTNAAAAACAGCAGCTTCTTGAAGCCCCANAACGCATTCCTGTGCTACGG 
Sequence 871 

CCGGGCACGGTACAGAGCCCAAGACAAAAGATAGGCCTGTGAGGATAACATCTGGTATAT 
CTGACCCTTCCCAGCATGGCCAGGAGGCACAGCCAGGCCAGGGAGGGCATACTGGGTTTG 
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GCTTTGCCCTGCAGCTGTTGGCCTAGGTGCTGCGGTCATACATATGCCCTNAGGCCTTTC 



TCCCCCANGCTGGAGTGCAGNGGCGCCATNTNAGCTNACTTGCAAGCTNCGNCTTCCGGG 
T 

Sequence 872 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTCCGGGTCATCCGCA 

GAAATTCCTCATAGATGGTAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACA 

GGGAGTCCACCCCGGTGTGGTGCCTGCTTGNGGACAGACCTGAATGTTGAAACTTGACAG 

TCA GAAAA ATAACTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGCGCATCCTCAATA 

TTCCTTTTGTTCCTCTGGNAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGT 

AGGTTGGTGATGGTAGAAATTCAGGTACGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTT 

GATGAACTGATGACTCC 

Sequence 873 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCCGCCCGGGCAGGTACAATGCTCACT 

GGGAACCAAAGTCAGGCATGGGGCTGGGCTTTAAGGAGCACAAACAAAAAGGAGGGACTA 

GAAAACTTCAGAAAGGTATTGGTGGGGGATGTTGCGGGGGGACAGGGGACAGCGAGGATG 

TGGGATCCCGAGATCGTCCAAATCCCTATGTGTAGACATATGTGTATAAAGGCCTTTAAG 

AGACTCAGGCTGATGGGGTATCTGTAATAAATCAAACATAATATAACAGCACGTCAAGTG 

ATAAGGGGACTCTGGAAAAACAAGCAGCAAAAGGAGCAGTATCAAACTCCAGAGAAATTC 

ACAAACATCAAGACACCAAGAAAGCTGCATTNATTTAAATCAAGGTGACAGGCTGGGCTC 

TGTAGCTCCAGCCTGTAATCCTAGCACTTTGGGAGGC 

Sequence 874 

CTACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI 1 1 1 1 1 I I 
i 1 1 i 1 1 i i i I 1 1 1 I 1 1 1 1 1 GGGATGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAATG 
GCACAATCTTGGCTCACTGTAACCTACACCTCCCGGGTTAAAGAGATTCTTCTGCCTCAN 
CCTNCT GAGAAGCTGGGACTACCAGGGGATCCCGCCCCACCCCGGGTAGGTTTTTTGTAT 
I 1 1 1 I lAGNNAGAAGACAGGGTTTTCCNCCCATATTGGGCCCAGGGCTTAGGTCTCGGAA 
CCTCCTGGACCCTTGNGGATCCTGCCCACCCTTGGGCCTNCCAAAAATGCTGGGGATTAT 
AAGGNGGGGAGCCACTGTGCCCGGGCCMCMTAMTTTTTTTAAGGGTTAGTCAAACCT 
AACAACAAAANTTTAAAAGGTCAATCAGTAGTTCTAAC 1 1 I 1 1 I I 1 1 1 1 
Sequence 875 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGACTTTGTAA 

ATGTGATTCAGGGCCCCCAGCACCCCTGTGTCTGCAGAGTGCCTTCAAAACTCAGCTGTT 

CCGGCCGGTGCCAACCTGTGAACTTCCCACCATATCCCAGAATCTGCTATTCCCCAAACC 

ACTTCCCAGTTTCCTTT CAGTA ATCTTTCTGAAGGAGCCAGGACAATAGGGCCTGTTGTT 

TAGTGMT TTCTTTATT ATTTTCAGCCTTTAAAATGTAATTTCCATCTCTTGCAATGAAT 

^GT TTCCCT TT TTTTTGC TTCATTTTGTTTAMTTTTCAGGTATTTAGCTCCCCTTTCA 

TATTAl I 1 1 1 AAA 1 1 1 I 1 1 AATTACCTGTTGTAGGGGTGTTCCTCCAGAAGCAAAGAGCA 

AAATTTTACTGTTGTGATGTACCT 

Sequence 876 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATTGA 

TCAAAAGCTTTGTAACCACAGGAAAAAATAAACTCTTCCATCCCTTAAAGAATAGAATAG 

TTTGTCCCTCTCATGGGAATTGGGCTGTATGTATATTGTTCTTCCTCCTTAGAATTTAGA 

GATACAAGAGTTCTACTTAGAACTTTTCATGGACACAATTTCCACAACCTTTCAGATGCT 

GATGTAGAGCTATTGGGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCT 

AGAGATAACTCGGGA 

Sequence 877 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCTTA 
AACTGCTCTGCTCTTTAAACCAAATACATACACACATACACAGATATAGTTAGATACAGA 
TGTGTGTGCATATAAAAATATGACACTCCTTAGTAAAATATTCCTCTAGACCTGGGGTTC 
ACACATCCCTCCTCCTGATCCGTGCTGGTGCCTACTCAGGCACTACTTTGCAGATTTCTC 



CATGGCTACCTACCTAGAACCCAGATTC 



GAGACGGAGTCTCGCTCTG 
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TTCTATGAGCTMGGTTTTTCTGAGCTAAGGTCAAGCGGTGACTTAGCAAGTTGAACGTG 

TMTGMCCAAACCTGTTTTTCCATGGAACCAATMTMTTAATCTAGAGTGAGCCATTT 

GGCCTCCAGAAACAAAGAGATTTCCATCACAGAGTGTTGGTGAGGGGTCATGAGTAAGGC 

GGGGGGGCAGTGAGAGCAAGCTGTTTTATTGTGAGAGTAGCAGGCAGGCTGAATGAGAAG 

GGGTAGCTGTT 

Sequence 878 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACCATTCC 

GGGTCATCCGCAGAAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCG 

AGAAGTTACACAGGGAGTCCACCCCGGTGTGGGTGCCTGTTGGGGACAGACCTGAATGTT 

GMACTTGACAGTCAGMMATMCTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGC 

GCATCCTCMTATTCCTTTTGTTCCTCTGGTMTTGGTGGTGCCTGGCTGGGCTTTGTCC 

TGGGAATATGGTAGGTTGGTGATGGTGAAATTCAGGTAGGAAGTGCCTGGGTGCCCCGCG 

TACCTCGGCCGCTCTAGAACTAGTG 

Sequence 879 

CTNCT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I I I I M M I 1 I M 

I 1 1 I IGGGGAAATMCAGGGGAGAGCAAATTTCTAAAMCTTGGGGTTTTATAGTAATTT 

CTGATTTTCATGTTTAGAAAAAGAMTCACATTAAAAATATGCTTTm 

ATAGGATACACTATAATATTATTGTAGTCCAGAAAATCTGTATACTATAATTCCTAGGGA 

AAAAGAGAAAATTATTAGTGTCAAAATACCTATAATTCCCACAGTTACCATATACATTTT 

TAAAA ATTGT TTAAATACACAAACAATGATGATGCTGTCCTACTAGAAATGACAGGAGCN 

AGAGCTTTTACCTTTCTTTCAAAATGCCTTTAACCCCTTTCATTATTNCCAAGGTTCAM 

ATTTAAANATTC I I 1 1 I I I I 

Sequence 880 

CNCTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGGCGCAATCT 

CAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGATTGCAGGCATGTGCCACCATGCTCGGCTAATTTTTTGTATTTTTAGTAGAA 

ACGGGGTTTCGCCATGTTGGCCAGCTGGTCTCCAACTCCTAACCTCAGGTGATCCACCCG 

CCTCGGCCTCCCMAATGCATCTCTGGTCTTTAAATGCCCTTTGCTGTATATTCTATAAC 

ATCMGTCTCAGATCTGGTTTGACCTCAGTTGGCCTCTTAATAGTTTTCCCCTATGAACA 

TTCTGGTCTCCCAGTAAGCCTGTAAGCAGCTGAGACTGGGAAACCATCTCTTATATCCCA 

CATCGTCCCATG 

Sequence 881 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCACACTGGTAAAGAGTGGCAAGGTAGC 

CTTTGTMCCAGATATATCTGATCTCAAAATCAATTTTCTTAATTTAACCCACGTCAGTC 

AGTCAAATGCTAAGGCTCTTCAAGCTACACTTGGTTCCTCCACCCTCTAAAAGGTGAGAA 

CTCMGAGAGCTGGGTTCTTTGGGACCTTATCATATTTTTCCCCTCCCTAGGCCTTGATT 

TCCCATTTGGAMATMATCAGTGAGGGCTTTCTAGTTAAAAATGCCAGTTGAAGCCAGG 

CTTGGTGGCATATACATGTAGTTCCAGTTACTCAGGAGGCTGAAGTGGGGAGGATCGCTT 

GAGCCCAGGAGTCCAGTCCAGGCAACATTGCAAGATCTCATCTCTAAAAACTAAAAAATG 

GACCAG 

Sequence 882 

C7TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAATGAATTGAC 

AAATGGAAGAAAATTTAAGGAGAGTAGCCTAGAGAACTCATTCTAGAAACTAAATAACTT 

MGTCMAMTTATTTCCTATATTGCCTCAAGCCCTGCAGATAGCTTTGCTATGTTTGTG 

TATTTGCACATTGCACTCCAGCCTGGGCGACAGAGACTCTGTCTCAAAAAAATAAATGGA 

ACAATCACAACAGAAACATTCCCTTATTCACTGAACATTTCAAACCCTGAAAATGTGTAA 

TGAGAMTGACAAATTTTTAAAAGTTTAATTACTAAAGAAGACAAAAAATGTCTATTATG 

AATAGACCAATTCTCAATTGGTAGAGGAACTTTGAACTGGAAAGGAACCCTAAAGAAATC 

TCCTGTCTACCCCTGTTATTACAGATTAGAACCCGAAAGTCCAG 

Sequence 883 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATAATTATAATGATTTCAGATAGAACATGCA 
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ATTAGCCTTTTGAAATCCAACTTCTGTGCAAAATTTTAGTATCAGAAAATACGAGATTTG 

CAGGGGGAMCATCAGTAMCTACCATTMTGTCMTGCCCAGTTTTGACTTTTGTTAGC 

CTGACACTCCCAAACAGTTGTAGAATCCGATAGATGACTGATGGCAAAAGATTGTGAACA 

TGTGGAAGAAAATCAGTGGGATTCGGTGCTGATGAATAGGTTGCCTTCAGAGTATTATTG 

ACAGACAGCTTGTGGAACTMTTCTTTATTTTTGATGTTGTGGGAATTAACACATCMTG 

GTGGTTATGGGAACTACCAATGGGTTCCTACAAT 

Sequence 884 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATTCTAAATTATAAG 

AAMTAT ACATT TGCACTTATTMTATAGAAATTCATTTTGTGTATATTTAACATAGCTT 

TTAMCTATTTTACATTAGCTACTTTCATTATGGTTTCTTGAACTTCTGAAAAAAATTAG 

AAATGTATTAAACTTATCAGTAACATAAAAACTTATTTTGTTTCACCTAACGAATACTGC 

. GTTTGTAAAMTAAATTTMTATAGAATATATTTTTAAATTAAATATTTGAATATAAAAT 

AGCTCTAAGAAAGAAGCAAATTATCACTGAACATATTTCTTATTATTTCTGGCTTTGAAT 

TAATACGTAACTTAAATTGGCTTAAATGATCCAGAATATTGGAGGAATATGATACTTTCA 

CATAATATACTATGAACCTGTTCATATAACTCTGGATTGGCTACCTAACCTTCTGNTTTA 
ATG 

Sequence 885 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTCACACCCAG 

CCAG TCAA TAAC TGAGA AATCAAAATAAAATAATAATTTCAAAGAATTACATAAATACAG 

GGCCTTTTGAGATTTTTGGCAATTGTAMCAAAMCGAATGGTTTTTACAATTCAGTGTA 

ATTCTACGAATATTTATTTGGCACCCATGTTAGGCACTGAGGCTACACAGCAGTGAAATA 

GGCCTAGTTGTTCTCAACTAGAGAACATAGTTGGTTAATGTAGCTGCACTGAATTGTAAG 

CTGTTTAGAAGATAATATACCCTGAGGCTTTTTAAAGTATACTATTACTATAAGGAAGTA 

AAATTATTTTATACTTATAAATTTTGTTTTGGATTATTCAACTGAATTTGGAGTGTTCAG 

AATTTTATGGGCGGTTGGGGACAAGGAAGAGGTATAATGCTA I 1 1 i I 1 1 I CCTTTTCTTT 

TTT 

Sequence 886 

CTACTATAGGGCA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI 1 1 1 I 1 1 1 

1 1 1 1 1 1 1 1 I 1 1 I 1 1 AAATTTCCACCGGTGCCGAGGCCTCAGTGGAGCCTGGCTGGCGGCT 

TGTTAGAGCCTGCAGCCTACCTGTCCTGCATAGGAATGAAGCCGGGAGGAGTTACATGAT 

ATGCCCTCGTTGCAGGCCGGGGACACAGCTACCGCATTCAGAGACCAGGAAACAGAGCAA 

AAGCTGTTCTCANAGTGCGGCTGAGCGAGGAGCTACAGGGGAATGGNGGGGGCCAAGCTG 

CATGGAAGATTGTCCCATTAACCTGGCTTTTTACCAGGGTGGTCCTNTCCCTAACCCCTA 

AGAATCACACCCTGCATCCAAACGGCAGCAACCCCAAAA 

Sequence 887 

CNCTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 1 1 I I 1 1 1 1 I I I 



I I H II I I I I TMAG CTGGGATATCTTACAGAGGAAGGAAAMTTMCCTTTTTTACTTT 

CTTTCTCACTTTTTAAATCAGCCAAAGTCAAAGCCCGTTTGCCAACCTGCATGTCCATGC 

CTGTAAGCCCTTCTNTTGGCCAAGGAAGAAAGGAAGAAAGAAAAAAGAAACCCAGGGGCC 

TGTATCCCCTGATTAAACACAGCACAGCACTCCAGGCAGACATGCCCGGNGGCGGCTCCT 

TTGCACCATTGACCTCAGGCCAGACACCTCAGCGCCAACAATGGGACCTCGGCCTTCCGG 

CTAGGTTTGCCCCAGGCTGGGCAGGAAACCAGCTCGGCCGCTNTAGAACTAGGTG 
Sequence 888 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGACATTGTGGCTAGCCAACCACA 

TGGTCAGCCTCAAAGTTGAGAGGCTCAGTAACCCTCCTATCCCTAGAGAATTCCAAAGTG 

TGGATGTAATTTAACCTAGGAAAGCCATTGGTGACTATCTGTGATCCTCTGGAAAGTATG 

CTATGTTGGGGTATATCTTTGCATCCAAAGCCAGAGGGGAACCACAATGGCCTAGTAAAA 

CCGGTGGGTCCTCAAATGCCCACTTAAGCCTCTGGCCTNTTGGAANTTTGACCCATAGTG 

GGCCGTTCAGCTTGATTAGAGCCGGGGAAAGAAAGAAAATATTGNCAI 1 1 1 1 I I I NTTGA 

AAAAAAATTT 

Sequence 889 
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CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGATCTATGCTGC 

TATGGGTGGAGAATCGACATCCTTTGAAACTGGCCACAGGCAGAGCTAAGAGGATGACTA 

AAAGGTCCCTTGGGTGGGTGCTAATGAGCAGGGGCCCAGGAAAACCTCTGTCTTCCCGGA . 

GAGCCCTCTTGCATGAGTTTCGGCTTTGCCAAGATTCCAGGGAC7TGAGGACAGCTATTG 

AGTTATGGTTACGTGACTGCCACATTGGGGCTTGGAGGCATCTGGCAGATGGTTGGGAAT 

GGGCTGGCACCACACTAATTAGGCCACGATGATCCAGTTTGACTCAGGGAAACCCAGAAG 

TCATAGTNCTCTTTGCAGAATGACACAAGGATGTCAACATGCTTTGNTTGTGTACCTCGG 

CCCGCTCTAGAACTAAGTGGGATCC 

Sequence 890 

CCGC GGTGGCGGCCGAGGTGCATATATATATATACACACACATATATATATTTATGTATC 

TTTTAMCATATMTTACTCTCTTTMTTCATTTGGACTTTTCATCTAMCTTGCT^ 

TTGCACAGGTCTGTTAGGGTAAGATATGTTCCTACCTTGAGAAATGTTGTGAATATCTAG 

CGAAACACCAAACATCCTCAGCTGACTAATGTGGTATCAGACTTTCTGGTTGCAAGGTAG 

GGGTGAATAAGGCAGGATGGGGTGCGGGGGTGGTGCTGGAAGAAGACATGGCATCAGGTT 

GGGTTTTGCAGGATACTGAAATTGTCTAGGGGCCTTGGCTGTGCAMGAGCCTTTCCGTC 

Sequence 891 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACGCGGGATTTCT 

CAGATAGTTATGCGCAGCTCCAGGCACCAGATTCTGTGCTGGGTGCAGGCAGGACCTGGA 

GGGCGTCCTCAAGTGTTGATCTGCAGGGACTGTCTTGATCTTTCCAGCAGTGTCATTGTG 

GGCACGTGACCTGAGCTTTCTGAGCCTATTTCCGCATCTGTAAAGTGCTATCCACTTCCA 

CCTCCTGGGCTGTCGTGCAGATGTAGGAAGGAATTGCACTCACACACTCAGCATGAGACA 

GGCGCTCAGTAAAAGCCCGTCCAGGGGATATGAGATCAGTGAGGGATAGGAAAGCAAGGG 

TGGGTAGAAACAGCAAAACCCTTTCCCA 

Sequence 892 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAGCATTG 

GACACACAAAAATACAGGCAGCTTCTTCCCTCAAGGAGGTCACAGGTGGGTGTGTCCATA 

GCAAAGCTGGGAGGAAGTTGTATGAAGGAGCCTGAAGACAATGGGGAGCTAGGGGAAAGT 

TCTGAGTAGAAAGGAACATGTGGACAAAGGTTTGAAATGATGAAGACTGATTAGGAAGTT 

CATATTATGAAGCATMTTCAAGCTTTCTCTACGATGTTCAAATCCCATCTCTCCTACTT 

ACTAGANAGGTGACATTGGGCCAAGTTACTTATCTCCTCTGCTCCTGTTTATTTGTGTTC 

AAAAACAGGGACCTCTCTCACAGTGTGATTNTGAAGACTGGACAAGAAAATGGGAGGTTT 

TG 

Sequence 893 

CTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAGTTTGGAAGTTT 

AGGCAAAAGTCATTTCTTCCCTATATTTTGTCATGCTTATCTCCTGTCTCTTTCTGTTTT 

ACAGATTAGCAATAMCTCCTTMAACCCAAAAGGTTTGGGCTTCTGTTCCTTTCACTTG 

CAGTCAGACATGGAGTTAGTGGTAGAAGAAACAGAAGGGGTAACCTGCATGGTGACAGCT 

ACTGAGGGGATGGATAGGAAAGCAGGCTGAGTCCCTGGGGCCAGTGGTTACCAAAGCCAA 

GGAGAGGGCAAGGGGAGCCCAGTGGGCCTGG 

Sequence 894 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACACGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGGCACCACCAATTACCAG 

AGGAACAAAAGGAATATTGAGGGATGCGCTCAACCAACTNTT 

Sequence 895 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAGTAGTCTAAAACAA 
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GTGACTTTACTACTTATTCTTCTGCATGTCCTTACCAGCTTCTTACCTTCTTCAGGTTGA 
GCATGAGATCAGC TTCAC AGGGGATGGGGTCCTTMGGGTTTTTTTCCATACTAGTTTCA 
GCCTTMCAATGAGTTTTCAACCCTTAAACATGAAAAATAAATAGTGCGAAAGAGGGGAG 
GATGGTAGAAATGCTTTAAMTTACCTTTTGTAAATTT^ 

TGCCTTGCTTATCAGGGAAGTCCTACAAACAAAGAACTCCACGGCTTCTTCAAGTCTTCC 
MGGGAACAGGGTCCCCCTGGTTCCTAMMTCAATGGGMGTAGGTTTTTGGTAACCAT 
CTACT G GTCM NGGGNAACCATTTCTACCTGGCGGTTTATTACACCTTTGCTAGGCTTCT 
TTTTTCI ! I I I CATTTTTAAAAAATAA I MM 
Sequence 896 

TTAGGGCGAA TTGGA GCTCCCCGCGGTGGCGGCCGAGGTACAGTTCATATCCCAGTTCTA * 

GMTCAGTTCATTTTCTMGGAGTCX^TGGTTCCTTTTATTGGAAACCAAMTCTGGGCAC 

CAGGTGTGCTCCCATTCTAGTCGTTTTCTGACCACATAACTGCTAACAAAGATGCTTCAC 

TCTGGCTACACTGAT GTGMCT TTGAACTTTAGCAGAAGAGCTCAGCTCTAGAGAACAAT 

GAGCTCC TACAT TACCTTTTTTCCTCAAAGAATAAGTAAGTCTAAGCAGAAAAAAAATAT 

GCMAGAATTTTCAGTATGAATGAAATAAGACAAACCATCAGGCTTGCTGTATTGTAAAC 

CAACACAATATAGTTATAACAGATCTGTAGAAGGGATCCTTAGAATAAGAGAGGCATTTG 

TCGGGGGGTCATCAGGGAGAATACTGGATAGNATCTT 

Sequence 897 

TAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I M I M M ) M M M M M 

TTTTTTTCTTC ATGGCt ACATCTGAACAGCTACTGAGGGATATATATGCCMCTTTGGGA 

GTTGCACAGCTTTTTTGAGGCCATTTTNTMNATGACTAGGGACTGTAATTTCTNTTAAT 

TT GGM TAGCCACAAGTTGTTGTAGCCMGGTTTGNGNGNGTTTTTAATACAATTCTTAA 

MTTTTAGTAGGCTTCTCATCTGTANATAGATTTGAAGGGGNGGGGTTGCCCCTCCACAC 

CTGTGGGGTGTTTNTCGTAAGGNGGGACCAGAGACTTAGGA 

Sequence 898 

CCGCGGNGGCGGCCNAGGTACACCAAATGGATTACAAGCAGCATCCAGCAGAAGACAGAC 

CCCCCAACCCTGCCCACCAGGGCTCACACTCTACAAAACCCTGAGGGCCTAGAAATCTGT 

AAATGCATCGNCAAGCACTGGGGCTGATTTGCAGTAATTCTCTAAGCAAGGCAAACATGA 

TCTAGCTTTGAAGGCAGCATGAAGGCAGCGGGTTGGNGAGAACAATCTNTCCTTAAGAGA 

AGAAGAAACCTGGGGCGGANGGAGTTTTCCCCGG 

Sequence 899 

AGCTC CCCGC GGTGGCGGCCGAGGTACATGTTANGGTCTTGAGTTAATTGCTCTGTGGCT 

GTGGATTTTrATTTGATGTTCTGATCTCTTCCTTCCAGTTTGATAAATTAGTGTAGAAAG 

TGGAAGAAAAACATGCCGGCGCAGCCTiSTGCGCTTTGTCAGGTTAACAGAATGGAGTCCT 

GCTCTGGCATCAGTCAGTGCTGTTGTCCGAACCCTCTGTGGCTCCTTCCTCCCTCCCTGG 

GGCCCAGAGCTGCAGACGCTAGAGGGGTA 

Sequence 900 

GCTNCACCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAATGTTAAACTGAGGGATGAGT 

GAMCAATATCAGGATTAATAAATAAACACATTCTTGAATTCCATCACTTAATAGAAGTG 

GCCATTTGAATGCTGGCAGGTNGGAAGAAAAGAGGAGGACAAAGAACCCCAAAAGTTTGG 

CATCATAACTACTGCCACAAG 

Sequence 901 

ATAG GGCN AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI M M M M M M M M M 

TGTATTTTTAGTAGAGATGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGAC 

CTTGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCG 

CCTGGCCTTTCCAGGGTATTCTTTTAACGTGGTCTTATTTGCCTTT^ 

ATCTATCAGCATCATATACCACCACTGGAATATAAATTTGAAAGAGAGTCCTGCAGATTA 

TATACATGAATCTACTTAGGCCTAATAACCAAGCAGTCCTCAGTGGCAGATCAATGAAAA 

GTGAAACTAAAGGCAAGTGAAGGGTAGGAGAGATTGGCCAGT 

Sequence 902 

GGCAGGTACCCACCTCCTCGGTCACCCACAGAGCCACCAAAGATTCCATGTCCCAGAGCT 
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TCCCATAGCAGACCTGAAAAGTCCATGACCTGAGCTTTGGCCATGGTAGTGGAGTGGAAC 

AGGAAATAGTCCAGCA GAGGA GTGTGGGGGAAGGGGGCAGGAGAGGCACAAGAATAAGGG 

AGACCTGGACTCTGGCTTTTTGGGAAAAGGAACCAAGCTCATAGCAATTTGGCTGATNAC 

ACMTCAGATTTTTCCAGGTTAAGCTTCCTTTCTG 

Sequence 903 

CCGCGGTGGCGGCCGAGGTACCATCTACTGMTGCCCAGTTTTGATCTATTTCTAAATGG 

AGCAAACCAATTCCATCTCCTAGAGCTGGAGACTGTATCCAGGCAGTGTGTGGACAGAAC 

GGACAATCTTTTCTGCCAAGGGCCTATTTGAGTGGAGCACCCCCACACGGGTTAGACGGG 

TCGGCACGGGGCTGGTGGGTGAGGAACTCAGGGGTCAGTGCAAGCTGCAGACCCTCATTT 

GGGGAACGCTCTCAGCACAATGCTCTTACAACTACAGGGTGCACTCCAAAATGGAGTTCA 

AGGAAAAAAGGCTMTGAGAMTAAAATCTGAAAAMTMCTTAAAMGTTTTGCT 

Sequence 904 

CCGGGCAGGTACGCGGGGG CCCTT TGGATACCTGCACTCCCCATCACCGCACTCCCCATC 

GTGGCACTTCCCTTGTTGCAGTTTTATGGAGTGTGCGTCTGGCTCCCCAACTAGACTTGA 

ACCGCTTGGGTGCATAACTCGGGACTTGACCATTTGCGTCTCCCTACGGCCAGCTCAGCC 

TCCGCACACAGGGACCTGCAGAGAGTGGATGTAGCCACTGCCCCAGCGTCCCTGGGCTCT 

GAAGAGAAGCCATTGCCCTTCAAGAGCCACCCTCATTTCCTGGGCACTGGTTTGGAAAAA 

ACGAAGAAAAAGAGACACCCAGCTCACCTCCA 

Sequence 905 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCNGGGCAACTCATTCATGATATTGGG 

AGAAAAGCAAAGCAAAAACTGCAACAAAATCTCAAACCCTTTCTGCAGCAGCAGATGGCA 

AACAGTGATCAGAGGAGMGGACCCTTCCAGCATTAGAAGATTTCCAAAGGCTGTTCCAG 

TAGGGGCTGTGGGCTTCTGGGAGCCCAGATGCCCCCTGATGGTATATTTGAGTTTGTGAG 

GTGGAGGCCAGGTGGCAAGANACTGCNNGCCAATGTCAATGAAAAGCCTGGGAGGAAAAA 

GAGATTTCTGGGA 

Sequence 906 

AGGTACTT TGCTM CCAGCATTTTGGCTGTGTTATTC 

TTGC CATCTTTTTGAGAGTGCATAGACATTTMTTTTMGAMTT^ 

TTTTGTCTCTATACATTT GTAG GTCAGATGCACATTTGTT^ 

AGAGCAAMAT GTAM GTTTTGTATGTAGAGGATAATTGTATGATGATGATAAACTAATT 
AGGTATTACAGTTTTCTAACGACAGAAATTTGTAATAATTAGGTAACTTGGTTCATATTA 
AAATATTTGATACATAGGCCGGGCATGGTGACTCATGCCTGTAATCCCAGCACTTTGAGA 
GGC 

Sequence 907 

GGCAGGTACCACACCCATCTTACCCTCTTCCCTCTAGGTTCTGACATTCAGCTATCTTGG 
TGGGAGGCTGGGGAGCACTATTGGGGATGAGGGTAAGGTGGAGTTTTATAAAGCTCTCCA 
G GTGACTCAGAGACCACCCTC ATTCCACCTGGTCACAAATCCCTGAATGGGAAACAGGTA 
Ci 1 1 I t I l l I I I I I I 1 I M I 1 J I 1 GCAAAGAATTGTAAAATTTATTGTATAAGTATTGCA 

GCTTTTCANAATGTCATCATTGCCACTAATGATTACTGATACACAACAAGCAGTTTCTTC 

AGGCCTGTGGATTGGCATC 

Sequence 908 

AGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGTGCTAATATCAAT 

GCAATGATCCCAATGACGATGATTGGAAAAAACTTCAATGGCAGCAGTGACAGGATCTGT 

GCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATTTTCAAATCATCAAGGCCAAAA 

AGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGGTGACT 

ATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCC 

AAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCCCATA 
Sequence 909 

AGGTACAGCAAATTAAACCCAATAACAGGAGGAGGAAAACACCTAATTAATATAAAAATT 
TCAAGGATATGTTAAACAAACAAACAATTACAAGAAACTCCTGTCAAGTTAAACAGAGAG 
AGAGAGAGACAGGGAGAACCACTAACAGGAAAGAGAAAAAGATATACAAGCTATCCCGCA 
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GAAATTAMMGAGACTMGAATATTACAMCAACTTTATTTCATTTAGATGAAATGGAC 

AAATTTATTTTAAAACACAATTCGCCAAAATTGACAGAAGTGGMGTAGAAMTCTATTT 

CTGTATTTATTAAAGATACTGAATCTATAATTAAAAATATCTTCACACAGAAAATTACAG 

TTTCAAATG 

Sequence 910 

AGGTACGATCTGMGAAATGAMGGCATTGMCTTTGGTGGGTAMTTGGGTCTTTTCCA 

GCAAAGGTATAAATCCTTAAAAGCCAAGATCATATTGTTTGATTTCTCTGGGCTCTCTGC 

TGGATACAGTGCCAA GTCCATAACTG TATACCCCATGGACACTCTATGTTAAATGGAGAT 

TAATGTGTAAG AGGTG I I I I I I I I I I I GTTTTGTTTTG I I I I I I AATTTGG AAAAGAAGC 

TTAAAGACCACAATGGGTGTGGCATTGGCTCGACCCACAGATCTGCTTAGTCTCAGACAG 

GCACTTTGAACCAGTCTTTTAAAATTGCGTCACAACAAC 

Sequence 91 1 

AGGTACAGATCACTATGGCTTGTCTTTTCTCCTAACTAATGTAAAATTCCCAATAATTCA 

TAACT T GTATGAGGACAAC AGTTGTGTGMTCTACCCTGGTCCTTCTGATNATTTTTAAT 

TTTTTNAI 1 1 I I I I I I I I I IGGGGACAGAGTCGTGCTGTTATCGCCCGGGCTGGAGTGCA 

GTGGCATGATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCCAGCAACTCTCCTGCCT 

CAGCTTCCCGAGTAGCTGGAATTACTGGTGCCCACTACCACACCCGGCTAATTTTTTGTA 

TTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGGACTC 

Sequence 912 

AGGTA CAAATTGTCGl I I I I ATTCCTCTTATTGGG ATATCATTTTAAAAACTTTATTGGG 

I I I I I ATTGTTGTTGTTTGATCCCTAACCCTACAAAGAGCCTTCCTATTCCCCTCGCTGT 

TGGAGCAAACCATTATACCTTACTTCCAGCAAGCAMGTGCTTTGACTTCTTGCTTCAGT 

CATCAGCCAGCAAGAGGGAACAMACTGTTCTTTTGCATTTTGCCGCTGAGATATGGCAT 

TGCACTGCTTATA 

Sequence 913 

TGGCCAGNTCAAATNACAACCCCCCAACCCCCCCCCCCCCCCCCCACAAACAGACAAGGA 

CACAGNTCACCANACAATGGATGTNCAGGNANTNGATATCAGCAGATATNTTAGNCCTNT 

AGATAGGCTAATTTNANTNAGCAAAGGAAAGAGGAGGTANCATTAGNCAGATGGGNTATT 

NACCTCTGAATTAGATGGCACTTACCCANCTTCTGGNACAGNCCTGCTGGNGGCGTCTAG 

ACTAGTGATCCCGGCTGANGATCGATTAACTATCATCCGCGACCTCAGGGGGGGCCGGAC 

CCACTTTTGTCCTTA 

Sequence 914 

CGAGGTACGCGGGACACTGGTGGGGGAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCG 

CAGGGCCTCTACGGACCTTACTAGAAAAATGAAACCTGATGAAACTCCTATGTTTGACCC 

AAGTCTACTCAAAGAAGTGGACTGGAGTCAGMTACAGCTACATTTTCTCCAGCCATTTC 

CCCAACACATCCTGGAGAAG GCTTGGTTTTG AGGCTTCATGCGAGAAAGGGGAATGGGGA 

ATGGCTGCTTAACGGCATGTC I I I I I I II I I I I GAGACGGAGTCTTGCTCTGT 

Sequence 915 

CGCCCGGGCAGGTACGCGGGGACTTGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTT 
TCTCCCCGCTTGCCGGGGTGGTCCTCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTG 
TGGAGAGGGACTGCGTGTTTGGAGGGAAATGGGAATGTACCT 
Sequence 91 6 

CCCCGCGTCCGCTCTCTGTCGGGGTCCCCTCCATCTCGCTGCTGCTGAAGGCCGCGAGGG 

CGGCGGCGATGGCGGAGGCGGCGCTGTTGCTGCTGCCTGAGGCGGCGGCGGAGCGGGACG 

CTAGGGAAAAGCTGGCTCTCTGGGATCGGAGACCGGACACGACGGCGCCGCTGACCGACA 

GGCAGACGGACTCGGTATTGGAGCTGAAGGCGGCGGCAGAGAACTTGCCGGTGCCAGCTG 

AGCTTCCAATTGMGACTTGTGCAGTTTAACATCCCAGTCACTGCCCATTGAACTGACTT 

CAGTAGTGCCTGAATCTACAGAAGACATTCTCTTGAAGGGCTTCACTTCCTTAGGAATGG 

MGMGAAAGMTTGAAACCGCACAGCAGTTTTTTCTCATGGTTTGCAAAGCTGCAAACT 

CAGATGGATCAAGATGAAGGAACTAAATATAGGAGCAGTGTGATGCTATATTGAATGATG 

TAAACAGTGCTCTTCAGCATCTGGAGTC 
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Sequence 917 

GGCTGTGGCCAAGAAACGCAGGGACCGCTCTCTCCCCCGGGCTTTCGAAATCTTCACAGA 

CAATAAAACCTATGTCTTTAAGGCCAAGGATGAGAAGAATGCAGAAGAATGGCTCCAGTG 

CATCAACGTGGCAGTTGCCCAAGCCAAAGAAAGGGAAAGTAGAGAAGTAACCACATATCT 

GTAGGGAATTTATAAGTCAGCCATGACAATTATACACCACAGGCATTGTATTATCATTGC 

CMTGTCMGMAMGAGCTAMTTTACCMGCCATGGTTGGNTTTTTACTAAATACCAT 

GGGMTTTGTTGGTCCTTTAAGAAGAAGGGCCTTAAAATGGCAGGGATTTCTTAGTNAAA 

TGNCAATACTCTAACAGCTTTAGTATTGACTTTAGAATATATCTGATGCCCACAAAAATT 

AAATAAAAGGGNTTNGAGGAGGTTTGCCCNAAATAAGTGNGGGGCCCGGAGGGGAA 

Sequence 918 

AGTCNCCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTACGGGGGTAGCT 

TTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACCATTTC 

CTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCATTGGCA 

GTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTGAACAT 

GTCCCAGACAGTAACTTTGGTGTAAATGCTTCCTGTTTTTCTGCCACAAGCCCTTTGGTC 

TTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCAGCCTG 

ACACATATGGGACTGCTCGACAGGTCCACTGTCCCACGAGCAGAAGCTGGTCACAAAGCT 
TGGGAAAT 

Sequence 919 

GGG AGTCG ACCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTCACGGGGG 

TAGCTTTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACC 

ATTTCCTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCAT 

TGGCAGTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTG 

MCATGTCCCAGACAGTMCTTTGGTGTAAATGCTTCCTGTTTTTCTGCCACAAGCCCTT 

TGGTCTTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCA 

GCCTGACACATATGGACTGCTCGACAGGTNCACTGTCCCACGAGCAGAAGCTTGTCACAA 
AGCTTGGAA 

Sequence 920 

AGTCGCCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCG 

TGGGGGATGGATGACAGTCCACCAGAAAAAAGTTAGTGGAGCGGGGACAAGCAGGGTTGC 

AGAGTGGAAGAAAAATGTTCCTGTGAGAAGAAACTGTCCAAAGAGTNTGAAGAGAAAAGG 

GAACAGGGTGAATTTGANGCCCTACAAGAAAAACAGGAGACCATTCAACAGGAGACGCCC 

AGGGAGCAGGTGGCTTTGTGGGCCTGATGTCCAAGAAAGAAGTNCTGGTGGTAAACAGAG 

ACTTGTGGATTGCAAGCTACTGTTGTCTTTCTATTGAAA 

Sequence 921 

TCGGAGTCGCCACGCGTCCGGCCAGGCGTGCCTGGAAATCCGCTTTCGCAGCGCCCCCTC 

GTAGCCCGCCTCCGCCCGCAGAAGGCGTTCCCTGGACAGAGAAGCGGGCGCGCGGGGGCG 

GGCGCGTGGGGCCTTGCCGGAGAACCTGACTCTCCGCAGCAGCAGTGGAAGCCAGAGTGA 

CGCGTTGTGTTGAACACCAGTTTTCTGGAGCGCTGTGTGTTCTCAACAGCTGAGCAGTCT 

GTTTCTCCAATCAGGTTTCAAAGCCACTTCAACTGCACTGGCCCCTGTGGGTCACTGCTG 

CACCGCCCTGGCCCATGTGGGTCCCTGAGGAGCGACCTGCCGGGGCCACCTGGCTGGACG 

AAAAAGACACACTTTGGGACTTAAGCCGTGAGAAAAAAACTTCATCAGTAAGAAACAAGT 

CAATAGACAAGTAAAAGACTAGGAGAAAATATGCATAAAACATAAAAAGTGACTTGGATT 

CCTGATCTTGGAGTATTTAMGAATTCCTATAACTTANMAAGGTTTCAAGTTTTTTNAA 

ATGAGCAAAANGGTTTGGGTAAA 

Sequence 922 

TCCGCAGGCTGGGGGATCCCAAAGGGTGCGCTCCAGCCCCCAACCCAGGCACTGGGACTC 

TGGTGGCACCCTGGGTGGCAGGCAAGCCTTGAAATCAAGTGCACGAGCCTTGGAAAGGAG 

GACCGGGAGAGTTATGGCATTTATGAATGAAGAAGAGAAAGAGMTCACTCGGATGGGAA 

AAGTTAACTGGATTGTTCCCACCTGCATGGATCACCCGGGTAACTGCAGTGGGACCGAGG 

GGGCGAGGCTGCGGGCTGGGGGATGTGCCGGGTTTCCTTGTGTTGCCACGAACCCAGAGA 
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GGGAGAGGAAGAAGATGGAAAGAAAAAGGAAAAAGGGAAGGAAAGTAAGAGGGGAGAGAG 

GGGGAAAACTTGAGGATGAAAGAAGAGACAGAAGAAAGAAGAAGACCTTGAGAGAGGGGA 

GGAGAAAGGAAAGGAAGCNGGAAGGAGGAATTGGAAAGTGAAGGAAAGGGGGGAAACCAG 

GCNGAGAAAGAAAAGAGAAAGGGGGAAGGGAAAGAAGAAAGGGGAAAAGNAAGGGGGGGN 

GGTTGAAAATCAACNCGAAAAAAAGAGG 

Sequence 923 

CNCGCGTCCGGCTGTGATGAATGAGGTCTAGGAAATAATTTGCATGTGTCTTGGGGGACA 

CAACAGTAACNGAGAGGAAATACATTATTACAGCAACTTGCGACGTACTAATACCTGTCA 

GTGTTGGCCCCCGTAAGGTATGTAAGGCACCTGNGANGTGCCCAGTNAGTNCCTTGGTGN 

AAGGCCAACATGTACTAGTTATGTAAGTATTGGTGTCTGCTTTAAAAAAGGAGACCCAGA 

CTTCACCTGTCCTCTTTAMCATTTGAGAACAGTGTTACTCTGAGCAGTTGGGCCACCTT 

CACCTTATCCGACAGCTGACTGTTGGATGTGTCCATTGTCGCCAGTTTGGCTGTTGCCCG 

GACAGGACAGGACCTCCATTGGGCGCAGCAGCAGGTGGCAGGGGGTGTGGCTTGAGGGTG 

GGTGGCAAGCGT 

Sequence 924 

CCCCGCGTCCGCACAGATCCTTGAGCTCCGCTGCAGGATAGTACAGTTTTACCGCAGAGG 
GAATCTGGAACAGTGGAATCATGTGTCTGCCCTGTGTATTGCAGTTTGTATTGCCACAAG 
CTATATTTATACCAGTGTCACCCTTTTCTTGTAGAATATACTAATAAATCTGTGCCAACT 
C TACC TTCTCACTTTTACCTCTGACGTCATTCT^ 

GT11TGATAGACTCTGAGGATTATGTGAACAGGACATTTTTCATTTGTGMTTTAA 
ATACTGTCMGGTACTTGCT TGTGT CTGMCTCTAGTGCACTTATGATTTTGTAGACCCA 
TGTGAAATTTAATAAGATACG I I I I 1 I I I CCTTTCTTTGGTGTGGTAGTGCAGCAACAGT 
TTGGTCTGCATTTGTTAGAAGTTTAACTCCTAACA/^.CCCAAAGACCTATTTA 
Sequence 925 

GCGTCCGACCCCAAAGGGAGGGACCACATTGCACACACTGTAAGAAATGCACTTTCCGAG 

GAAGGGGATGGGGGAGCCCGGACACCCAGAGCTCCCCGAGTTGGGGGTGCCCGTCTGGAG 

CGCCCCCGTCAGCCCCTGGCGGTGGGAGGTGAGAGCGAGTGGTTTAAGTGCCTGATTACC 

ACCACCCGCCCCCCCCT7TGTCCAGCTGGGACACGGAATGGCCGCGGGCCTCCTCCCCCT 

CCCCTCCAGCCTCTCCACCAGCCCCTCCAGTCAACCCTCATCGCCGTGCCCCCCCAGAGC 

TAGAGAGATGGGGCCCCTGCGTGGCCCGAGGGGCAGAGCTGGGCGTCACTTCGCAAGCGT 

CCTGCCCTGCCGGGGCGCGGGGGTGGGCTCTGGGGAAGCCGGTGCGCCCCCCACGCCTNC 

GCTGCCAGTGCCTTACATTCTGGAGCGACCCCCCTCCCTGGTGCCTCCCAGCGAAGGGGG 
ACCCGC 

Sequence 926 

AGACAGCTCAAGCCTTGCCACTTCGGGCTTCTCACTGCAGCTGGGCTTGGACTTCGGAGT 

TTTGCCATTGCCAGTGGGACGTCTGAGACTTTCTCCTTCAAGTACTTGGCAGATCACTCT 

CTTAGCAGGTAGGTGCCGCAGACCCTGCGGGTTAAGAGGTGGGGTGGGGGGCAGTGCTTG 

CCAAGGCCCTAAACTGGGAGCGCTGGGTGAGGGGAACAACCCACTTTGGAGGGTTCTCTG 

AGAGATAGATACACCCCATATCCTGGGCCCAGCTCGTGCACACAGCTGGAGGTCCAGAGA 

CCCAGTCCCCTCTGCTCCGTCAGCCAAGTTCCAAGAAGTTGAGCAGAGACCCTTCTGGGA 

GCCTGGCGGGGTGCAGCGGCCTCCCCTGCGGGGCCTGTCACCCGGCCGGGCGCGTGCAAA 

CGCCTCTGGCGCCTNTNTGCGCGGGAGGGGAGATAAGCGTCTGAGCCAGGGAAAGCCGCC 

GGGCTAAAACCCGCCTTTTCCGGGGGCCCC 

Sequence 927 

CGCGTCCGGTCATATACMTGTCATTGTTTGGGACCCGTTTCTAAATACATCTGCTGCCT 

ACATTCCTGCTCACACATACGCTTGCAGCTTTGAGGCAGGAGAGGGTAGTTGTGCTTCCC 

TAGGAAGAGTGTCTTCCAAAGTGTTCTTCACTCTTTTTGCCCTGCTTGGTTTCTTCATTT 

GTTTCTTTGGACACAGATTCTGGMAACAGAATTATTCTTCATAGGCTTTATCATCATGG 

GATTCTTCTTTTATATACTGATTACAAGACTGACACCTATCAAAGTATGATGTGAATCTG 

ATTCTGACAGCTGTCACTGGAAGCGTCNGTGGAATGTTCTTGGTAGCTGTGTGGTGGCCG 

ATTTGGAATCCTCTCGATCTGCATGCTCTGTGTTGGACTAGTGCTGGGGGTCCTCATCTC 
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GGTCANGTGACTTTCTTTACTCCACTGGGAMCCTAMGMTTTTTTCATGATGATTGGG 
TGTATTCTGGGTCACTTTCTCTTGCCATAAGCTATNCTCATTCCAGTAGTTT 
Sequence 928 

CCACGCGTCCGGACGCTGCGTGGAAGCGGCGGAGCCGGAGGGAAGCAAAGGACCGTCTGC 

GCTGCTGTCCCCGCCCCGCGCGCTCTGCGCCCCTCGTCCCTGGCGGTCGCTCCGAAGCTC 

AGCCCTCTTGCCTGCCCCGGAGCTGTCCCGGGCTAGCCGAGAAGAGAGCGGCCGGCAAGT 

TTGGGCGCGCGCAGGCGGCGGGCCGCGGGCACTGGGCGCCTCGCTGGGGCGGGGGGAGGT 

GGCTACCGCTCCCGGCTTGGCGTCCCGCGCGCACTTCGGCGATGGCTTTTTCCGCCGCGG 

CGACGGCTGCGCCTCGGTCCCCGCGGCCTCCCGCTTCTTCTCTCGGGACTCCTGCTACCT 

NTGTGCCGCGCCTTCAACCTAAGACGTGGACAGTCCTGCCCGAGTACTCTGGCCCCGAGG 

GGAAGTTA 

Sequence 929 

CGACGGCCA NGGCG CCTCCGAGTTCCCCGCCAGGACTCGGAGGGCCAGGAGGGCGCGACC 

TGGGTGGATATTTTTGTTGGACGGCGCAACTCTTGGGGTGGCCCGGGAGCGGCGGAAACC 

GAGCGAGAGAACCAGGAGGCGCTGCGCAGAAGGAGGCCCGGGGGCTCCGAGGCGTTGAGG 

GGCTCGATCTGCGTTCTGGGGTTGGCAGCCGAGAGGCCGCGGTCCCTGAGTGCCAGAGGT 

GGTGGTGTTGCTTATCTTCTGGAACCCCATGCAGCCAGATCCCAGGCCTAGCGGGGCTGG 

GGCCTGCTGCCGATTCCTGCCCCTGCAGTCACAGTGCCCTGAGGGGGCAGGGGACGCGGT 

GATGTACGCCTCCACTGAGTGCAAGGCGGAGGTGACGCCCTCCCAGCATGGCAACCGCAC 

CTTCAGCTACACCCTGNAGGGATCATACCAAGCAGGCCTTTGG 

Sequence 930 

CGTCCGCTTTNAGACCGGMGACATTTAAAAGCCAGTTTACGTACANGAAGCATGGTTTT 

AGATTAACTGCCTGTTGGTACAGCTAGAMCATTGCAGCCCTATCGCTTATTTATCTTGC 

ATGTTGCTCTGCTTTGCTATGAAAMTATCGTTTTATGATAAAACTTGTTGAATTTTGAT 

ATGTATTCGGTTATACTCTTAGGGAAAATAATAGAAATTAGAGTGAGAGAAAGTGCTATG 

TATATTAGGCTTTCAGATTTTATAGATATAGGCTTAAGGGAGGGTGGAGGTTCTTTTTTT 

AAGTTGAATGACTACTTAAATTTGTTGATGTGAATTTAAGTTTTAMGATTATTATTM^ 

TAACTCTTCTCTTTGTC TTTGC ATTTACCTTCCCAGATGTTCCAGCCTATCATTTTACTT 

ATTCTCATTCTTGTATTATTTTCATCACTTTCTTACACAACMTATTTAAACTTGNCTTC 

CTTTTTACACTGGTTTTTGGTAC 

Sequence 931 

CACGCGTCCGTGGAGTATGTGCCATCTGCCAAAGTGGAGGTGGTGGAGGAGCGCCAGGCC 

ATCCCTCTAGACGAGAACGAGGGCATCTATGTGCAGGATGTCAAGACCGGAAAGGTGCGC 

GCTGTGATTGGAAGCACCTACATGCTGACCCAGGACGAAGTCCTGTGGGAGAAAGAGCTG 

CCTCCCGGGGTGGAGGAGCTGCTGAACAAGGGGCAGGACCCTCTGGCAGACAGGGGTGAG 

AAGGACACAGCTAAGAGCCTCCAGCCCTTGGCGCCCCGGAACAAGACCCGTGTGGTCAAG 

CTACCCGCGTGCCCCAC 

Sequence 932 

GGTTCGCCCACGCGTCCGCCCTGCTACCCTGGGAGAAGCCTCAGCTTTCTGGGCAGAGTT 

TGTCTCCCTGTCATTTATACTCTCAGGCTTTATACATTTACACAGTAAGTTCTCCCTCCT 

GGAGGGTTAAAAGGAATAATTTCAACAGGGTGAAGGCCTGGCACGGTGGCTCACAACTGT 

AATCCAAGGACTTTGGGAGGCTGAGGTGGGTGGATCACCTGAGGTCAGGAATTTGAGACC 

AGCCTGGCCAACTTGGTGAAACCCTGTCTCTACTAAAAACAAAAATTAGCCAGGTGAGGT 

GGCACACACCTATAGCCCCAGCTACTGGGGGAGGCTGAGGCAGGAGAATTGCTTGAACCT 

GGGAGGCAGAGGTTACAGTGAGCTGAGATGGCACCACTGCACTCCAGCCTAGGTGACAAA 

GCAGCAAGACGCATTCTNAAAACAAAACANCAACAACAACAAAAAACGGGAAAACA 

Sequence 933 

CNCGCGTCCGGTCCACTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTG 
GTTCTTTCGGCCGTTCACGTCATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGT 
GGCCTCTCGTGGCTGGCGGCAGGTGGGGTGATGGTGCTGGCCTCGGCGCTGCTGTGTGTG 
ATTGTGTCTGTTCTGACCAACGTGCTCGTGGGTGGAAACACCCCAAGGAAGAACCCCATG 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

149/467 

CATCCCAGCTCAAGGTGGTCAGAGCTAGACCTTCTTATTCTGTTGGGGACGGCGGGCCAC 
GTCTTGAGCCTGGGCGCCAGCAGCTTCGTGGAGGAGGAGCACCAGACCTGGTACTTCCTT 
GTGAACACCCTGTGTCTAGCTCTGAGCCAAGAAACCTACAGAAACTACTTT 
Sequence 934 

TCGCCCCGCGTCCGGTTATTTTACCCAGAAGCCGGATAGAGAAAATATTACAGAGAAAAT 
CACATATCACATGGGCTCGMAGATGTAGAGGTTTTTGACAAATGAAGAACAACCATAAC 
AGGTAGAGGGAACACCATGAACCAGGGCATGAAACTGAAAGTGCATAACATATTCTAGAG 
AGAGAAGGGTGTGGGCATGAGTTAGGGCTGGAAAAACAGGTTGGAAACAGATAAGTAAGG " 
GTCTCAAATGCMTGTCAMGAGCTTGCAGTTTATTTTCCAGGCAATGAGTAGGCAGCCA 
AAAAAAAAAAGTAAGGATG I I I I I I I I I I 1 11 1 CCCATGGCATCATATTTAAGAGGATGG 
ATTTAMTTGTGTGAGACCAAAGCATAGAGACTAGATMGAGGGCGATCATTATTTCAM 
AAGAAATAATGAAGATCCAATGAAGGAAGTGGGAAATTAAAATAGGGGAAGAGAGGTA 
Sequence 935 

CCGTCCGGTTTTTTGTCTCAGAGTTCTTCAGGCTGTACAGGAAATGTGGTGCCGGCATCT 

GCTTCTGACGGAGTCTCACTCTGTCGCTCAGGCTGGAGTGCAGTGGCATGATCTCGGCTC 

ACTGCAACGTCCGCCTCCTGGGTTCAAGCCATTCTTCTGCCTCAGCCTCCCGAGTAGGTG 

GGACTACAGTGGCCATGTGTCTGAGATCTAACCAAGGGAACATGGGTGGAACTGATGTAA 

GCCACTTTGACACCACMAACCTCCCATGGGTTCTCTCTCTTCTTCTCTGGTGTACTTGT 

TGGATGGAGAAGATGCTGAGAAATAGTGGGAAGTCCTAGGGGATGGAAGAACCCAGGATT 

CTGAATACTCCATTGGACCTTACGTTTTGGAATCAGGNATGATGCTGGCCTTCATAAAAT 

GAGTTATGGAGAAAGTCCCTCTTTTTCTGGTGTTTGGMCANGTTTTCAGAAA 

GNTACCCAGCTTCCNTCTTTTGTACC 

Sequence 936 

CCGGTGAGCGCCCCGCCCGCTCAGCCGCCAGATCAACCTTAGCGCTGGGGCGCGGGCTGG 

GGTCGCCAGGCGGTGCGTTCTGCCCGCGCGGGGCTGAGAGTTAGGGGCCGGGGCCGGATC 

CGGGGCCGGGGGTCGCGCCGCTAGCCGCCAGCAGCGCAGTCCGGGCCGCCACCCTGCACC 

CTCCGCCCTGTTTCTGCACCCGTCTGGGTTCTTGTGCCGCCGCCCGCAAGCCTTCCCGAG 

CTCAGGGTGGTGAGCTGCGGAGACCCGTGATAATTCGTTAACTAATTCAACAAACGGGAC 

CCTTCTGTGTGCCAGAAACCGCAAGCAGTTGCTAACCCAGTGGGACAANGCGGATTGGAA 

GAGCGGGAAGGTCCTGGCCCAGAGCAAGTGTGACACTTCCCTCTTTGACCATGAAACTCT 

NGGGTGTCTGCATTGCTGATGGC 

Sequence 937 

GTCCGCCGGCATGAGCTGTCCATGAAGGATGAGCTGCTTCAGTTCTACACCAGCGCTGCG 

GAGGAGAGTGAGCCCGAGTCCGTTTGCTCAACCCCGTTGAAGAGGAATGAGTCGTCCTCC 

TCAGTCCAGAATTACTTTCATTTGGATTCTCTTCAAAAGAAGCTGAAAGACCTTGAAGAG 

GAGAATGTTGTACTTCGATCCGAGGCCAGCCAGCTGAAGACAGAGACCATCACCTATGAG 

GAGAAGGAGCAGCAGCTGGTCAATGACTGCGTGAAGGAGCTGAGGGATGCCAATGTCCAG 

ATTGCTAGTATCTCAGAGGAACTGGCCAAGAAGACGGAAGATGCTNTCCGCCAGCAATGA 

GGAGATCACACA 

Sequence 938 

CCCGCGTCCGGAATTCCAGTTGTGGATGAAGGAAATGGTGTTATGACTGCCTCAAGGTTT 

TGTAGCAAGTCATAGGGMCCAAAAGAGGAATCTTGTTTTCCTCAGAGGTCATGCCAACT 

CCAACTCCCGTTC CCTAA ACTGTCTCTGAGCCATAGACTAGTAATGGACTCTTCAAGCTC 

TACCATTAGGTATCTTTTAAAGAAAGCTGGTTATTACTATTTATTC A I I I I I 1 ICTCTTC 

TGTGCAGTGCAAAAGATATGAAACATCGGCTAGGTTTCCTGCTGCAAAAATCTGATTCCT 

GTGAACACAATTCTTCCCACAACAAGAAGGACAAAGTGGTTATTTGCCAGAGAGTGAGCC 

AAGAGGAAGTCAAGAAATGGGCTGAATCACTGGAAAACCTGATTAGTCATGGAATGTGGG 

CTGGCAGCTTTCAAAGGTTTCTTGAAGTCTGAATATAGTGAGGAGGAATATTGACTTCTG 

GATCAAGCTGTGNAAGAGTACAAGAAAATC 

Sequence 939 

CGTCCGGCCGGCGACGGCGGCAGTGGCGGCCCGGCCTGCAGGAGCCCGACGGGGTCTCTG 
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CCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGCGGCGAGACATGAGGAGA 

CCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGATGGAGGAGAAATACGGCG 

GGGACGTGCTGGCCGGCCCCGGCGGCGGCGGCGGCCTTGGGCCGGTGGACGTACCCAGCG 

CTCGATTAACAAAATATATTGTGTTACTATGTTTCACTAAATTTTTGAAGGCTGTGGGAC 

TTTTCGAATCATATGATCTCCTAAMGCTGTTCACATTGTTCA 

AMCTTGGGACTGCATTTTTTATGGTTTTGTTTCAMAGCCATTTTCTTCTGGGM 

ATTACCAAACACCAGATAATTGGATCACTAAAAATTCCTGGTAGAAAAGAATTTAAAGAC 

AAA 

Sequence 940 

TCCGAAAGNGTACTGCCATGANCCGAGATAGGAGACACATAAGAGGACAGCAGAAGCCCT 
GGCCCTGGGGAGGCTTCTCGGAAGGCCTGGCTTCACAGGCAGGCCACAGAAGGATATCGC 
GGGCACCGTGCACCCAMGCMGATAGTGGCTTCCCTTTTATATCCAATCTAATCCTGAT ' 
TGGATGTCCCTGAGGCCCCTGCTGGAAACAGCCATAGGAGAGGGCCCATGGCAGTAGGGG 
MAGAAGGAAGAAATTCCCTGCAACAAMCTTCAGCTAMCTTTGATTTGTGTATTGTTT 
ACATAA TAAT TTTAAAGGGTACATAATGTGTAAAGAGTTTGGATAGAACCTCTCTTCATA 
CTATGGTTTTCGTAAAGGATCTGTTGTTGTtACGGATTC A I I I I I I CCCTCTATTTTTAT 
AAAGAGCAGCAGAGTTGTCTTCTCAAAACGGCTGCCAAGCTCTGCTTCTTGGGAAGAT 
Sequence 941 

CCCGGTCGTGCGGGTCGGGCGCGGGCGGGCGCGGCGGCAGTGGCGCTNTCAGGTGATTGA 

CTGGCCAGCTGCCTGAAGGAGCGCCAGGTCCTCCTTGCTGGCAGGTGGCGAAGCCCATTG 

GGGCGGCGGTGCAGACCCGCGGCGGCNGCTGCGGCGGTCTGGCTCGGGAGGCGTTCCTGG 

GGCCAAGGCCATGGCCCCGCGGCTGCAGCTGGAGAAGGCGGCCTGGCGCTGGGCGGAGAC 

GGTGCGGCCCGAGGAGGTGTCNCAGGAGCACATCGAGACCGCTTACCGCATCTGGCTGGA 

GCCCTGCATTCGCGGCGTTGTGCAGACGAAACTGCAAAGGAAATCCGAATTGCTTGGTTG 
G 

Sequence 942 

CACCCACCCCAGATGCCGCCTGGCACCAAGCGCAGCCGCCAGCTGCCGCACTTTCCACTT 

GTATTGATCACCTATNANNCCCGCGCANAACGGCTACGNCCGAGCGGACCGCGGCCAGCG 

CGCCAGCCCTTGGCACNCCCTNGGAGCAGAAAGGGCTCCGGGAGGAAACTCCTTGGGAGC 

GCCCTGTCCGGANTGCCCTTTGCTCTCTGCAGTGTGATTTCTTTCTGTTCTGGGAGGAGG 

AGGAGGAGGANGAAGAGGAGGANGAGGNAGAACGANANNCTGCCCTTCCAGAGGTTGGTG 

AGGGAGATCGCGCATGGATTTNAAAACCNACCTGAGGTTTCAGAGCGCAGCCATCGGTTG 

CNCTGCANGAGGCTAGCGAAGCGTACCTGGTGGNTCTGTTCGAAAGA 

Sequence 943 

GTCCGGTTTTGAAACAGAAATGTAGGCATTAGACTTCCTGGGCGGCAGACAAACCAAAGA 

GCGGAAATTCATGCAGCCTGCAAAGCCATTGAACAAGCAAAGACTCAAAACATCAATAAA 

CTGGTTCTGTATACAGACAGTATGTTTACGATMATGGTAAGCTTTCACATTTGATTTCT 

TCTGTTTTTCCAGTAACTGTGAAGGGAAATTGGTAGGAGGTGTTGTAACAGGGCAGGACC 

CAAATGGGMCGGGGGGATGACATTGGTTTGTCAGGTACCGAGCAAAGAGTGAGGATTTT 

GGAGTCTCCCTTCTGCTGCTCTGATGTTTTCCACATGCTTATTTCTTTGCCAGGCACTGG 

AGATGCAGTCAAGAAGTGGGAAGTGGCTCTTACTTCTAGTCTGTGTGTGTATAAGTCACT 

TAAGATGGCCGTGTTGACTGCTTCTTTGGGMATGCCCTGAATAGGAGCATGTAGGGGAT 

GCTTACCGAGGCTGGGGAAGG 

Sequence 944 

GCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGGCCCGACCCCGAGCCCGACC 

CCGAGCCCAAGCCCGAGCCGGAGCCCGAGCGAGACCCCGAGCCCGAGCCCGAGCGCGACC 

CCCGGTGCGGCGCGGCTACCCCGGCGGAGGCGGNGGGCGCGGGGCGCGCTCTGAGGCCCG 

GGGGATGCGCCGCCGCCTCGACCATGGGCGCCGCCGCCTCCAGGAGGAGGGCGCTGAGGA 

GCGAGGCCATGTCCTCGGTGGCGGCCAAAGTGCGAGCAGCCCGAGCGTTTGGAGAGNACC 

TGTTCCCAGAGTCACCCTGAGAACCGCAACGGCCGCAGATCACCTGCTGGCTGATGCCTA 

CTCTGGCCACGACGGGTCCCCCGAGATGCAGCCGGCCCCCCAGAACAAGCGCCGCCTGTC 



# 
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CCTCGTTCTCCAACGGCTGCTACGAGGGCAAGCCTTCTCAGAGGAAGCCCAAGCATTAGG 

AAGCCCGCAGGC 

Sequence 945 

CGCGTCCGGCACGGGGGAGTCTGTGGTGGCCNGTTTACCTGGGCATCTGGCTGAGAGGAA 

GAAAGGCCAACCTGATCCTGAGGGGACCCAGACATATCCTTTGCACTGTCCCTAGAGGGG 

CGATGAGCTTTGCAGCATTAAAAAATGGTGAAGGGGGGAAATATTTTGAACCAAAGACCA 

MTGTTAGGCCGCCGTTATATTTGCAGAAGCTTTGAGAACCATGCGTATAGCCTCCTGCA 

TTCTCCCCTCTCCTAGGAGCTCTTTTGTCTCTGTCCTTACGAGGCGTCATACAGAGGCAG 

TGGGGTGGGCACAGATGAGCAGAGTGGATGGTTCGGTGGGTCCCCACGAGGGCGAGTGGT 

GGTCATATGTGATGGCACCGTGTTCACACACCCTCCTGTGTACCCCCCCAGGGTCACCCG 

AAGTCCCCACACGCTGGCTCTCCACACCCCTCCTGTTCCAGAAAGCATGTCCCG 

Sequence 946 

TCGACCNCGCGTCCGGCACTCCCTCTGGCCGGCCCAGGGCGCCTTCAGCCCAACCTCCCC 

AGCCCCACGGGCGCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAGACC 

CCTGCCTGGCCCCGGGTCCCGCGCCCCGAGACCGCCGTCGCTCGGACGCTCCTGCTCGGC 

TGGGTCTTCGCCCAGGTGGCCGGCGCTTCAGGCACTACAAATACTGTGGCAGCATATAAT 

TTMCTTGGAAATCAACTAATTTCAAGACAATTTTGGAGTGGGAACCCAAACCCGTCAAT 

CMGTCTACACTGTTCAMTMGCACTMGTCAGGGAGATTGGAAAAGCAAATGCTTTTA 

CAC 

Sequence 947 

ACCCCGCGTCCGCTTTTGCATCTGGATCATTTTTCTCTTGCCCCACCATGTAAGAAGTGC 

CTTTCACCTCCCACCATGAACCTGAGGCCTCCCGAGTCATGTGGAAATCGCCCCCAGCCA 

CCCCACCCCAGAGGGCTACGTCCTGGCAGAGCTGGGTTTGGTTAGTTCTGAGGGCTGAGC 

TGGCCCAGCAGCTCCAGACCTCCAGACCTTGCACTCACCTGTGAACTTGACTCTGCAAAC 

TCCTCCAAGATGCGCCCACCACACTCCCAGTGAACAACACCTACAGGAGCTTGGAGTTCT 

ATTCTCAGATACATCAGCTTCCACATTCCTGTGTGTCCCAGCTGGAGAAGCAAGAAGTCC 

CAGACCATGTGCTAAGCACACGTTGGGGTGGGGATGAAATCCAATTGGTGGTGTGTGAAT 

CCATGCTGGATTCATGAAGCTGAGGCCCAGAGGAGGMGCTTTCTTAATCAACTTCTTAA 

CATG 

Sequence 948 

TAMAGCCATGGTNATTTGTGCACTGTGCAGTTTCTTATTAGCAAAGGTGCCAATGTTAA 

CAGGGCTACAGCCAATAATGATCATACAGTAGTGTCGCTGGCATGTGCAGGAGGCCACCT 

GGCAGTTGTTGAGCTTCTCTTGGCTCATGGGGCTGACCCTACTCATCGACTCAAGGATGG 

TTCMCAATGCTCATTGAAGCTGCAMGGGTGGCCATACTAATGTAGTTTCTTATCTGTT 

GGATTATCCAAATAATGTTCTGTCAGTTCCCACCACAGATGTGTCTCAGCTCCCTCCACC 

TTCTCAAGATCAGTCTCAGGTGCCACGTGTGCCAACGCATACACTTGCCATGGTTGTACC - 

TNCCCAGGMCCTGACAGAACTTCACAGGAGAACTCTCCTGCCCTTTTAGGAGTGCAAAA 

A 

Sequence 949 

CCACGCGTNCGGTCGGCCTGTGCGGCGCTGCGGCGGAGCGGGCCATGGCAGTGGGGAGGG 

GGCGAGTGTAGTGCTGCGCGGGGCAGGCGGGAGGTGATCGAGAGAGGCAGGGATGGGGGC 

GCCGGAGTGGAGCGGTTGCGGCGGNCTGGGCTGCTGACTGCGCACTTGGAATAGTAGCAG 

GCGGCGGCGGCGGAACGCCAGGCAGTGTATGTTTTAACTGGAAAAAGTCTTCCATGAAAA 

CCGTCACTTTTAAMAAATMGGTAATGCCCATTCTGTTTTTTCCTAAAAAAGACCTGAA 

AAT GGGGG GGCCGAACACATTCCTTAGGGGCCCCCGGTGGNTTATTGAAATGTGCCTTTC 

MGTTTTTCATTAAATGCNCTCCTGGCTTATTGGGCAGGACCATTCCCTTTGMCAATCC 

TGGGGGCCGGGCT GGGAT TTCAACAAGAATTAGGCAATTCTTGGAATGGGCCTTCCAATA 

ACCCTGNTGGGGAATTTTTCCCNTTTTNGCCCCCAACCTTGGGGGMTTTNATTATTTNC 

MGNTTTGGGGMGGGTTACCCTTCNGGGGGAAANGCTTAACCCAATTTTTC 

Sequence 950 

TTNGGGAGTCGCCACGCGTCCGGCCGGCGACGGCGGCAAGTGGCGGCCCGGCCTGCAGGA 
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GCCCGACGGGGTCTCTGCCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGC 

GGCGAGACATGAGGAGACCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGAT 

GGAGGAGAAATACGGCGGGGACGTGCTGGCCQGCCCCGGCGGCGGCGGCGGCCTTGGGCC 

GGTGGACGTACCCAGCGCTCGATTMCAAAATATATTGTGTTACTATGTTTCACTAAATT 

TTTGMGGCTGTGGGACTTTTCGMTCATATGATCTCCTAAMGCTGTTCACATTGTTCA 

GTTCATTTTTATATTAAMCTTGGGACTGCATTTTTC 

TTTCTTCTGGGAAAAC 

Sequence 951 

NNTCCGGAGTCGACCNCGCGTCCGCGGCTGCTGCCTGCTCTGGAGGCAGGCTGGGCGGTG 

GCGGCCGAGACTGGCGGGGGTGGACGCCCGGGCCGGGCTGCGCCCGCTTCTTGCAGCTGT 

GAATTCCTTTGGACAATTGATGATATTTATCATTGTGCCCAGTTTCTACAAATAAAAGAT 

GGGTGGATTATTTTCTCGATGGAGGACAAAACCTTCAACTGTAGAAGTTCTAGAAAGTAT 

AGATAAGGAMTTCMGCATTGGAAGAATTTAGGGAAAAAAATCAGAGATTACAAAAATT 

ATGGGTTGGMGATTAATTCTGTATTCCTCAGTTCTCTATCTGTTTACATGCTTAATTGT 

ATAT1TGTGGTATCTTCCTGATGAATTTACAGCAAGACTTGCCATGACACTCCCATTTTT 

TGCTTTTCCATTGATCATCTGGAGCATMGMCAGTAATTATTTTCTTCTTTTCCAAGAG 

AACAGAAAGAAATAATGAAGCATTGGATGGA 

Sequence 952 

TCNCCCCGCGCCGGTTTTGATACAGMTGAAAGTGCGTAGTATTTTCATTTTGTTTATTT 

TTGCCTTATACATAf AGCAAGCCCTCAATAAATAAATATTGAATGAATGAATGAGTGAGT 

GAAGAATTTGTTTAT AACA GTCTGTCATCTTGATMCACTGGMTGTCTTTGGTTCTTCC 

ACTTCATCCTTTATGTTTTAMCTTACACACACCATTCTTACACGTCACTAAAGGAAAAT 

ACCAGTATATATTGGCTAAAA I I II I I I I I I GTTGTTCAAAACTGAAACTCAAATGCCTA 

ATTGGGCTAGGGGTCCTCTTAMGGAGGTTGATGTTTGTCAAATGGGTTATTTTTTAAAA 

GCAGTAGATMTTGCTTATTTCMGGCMGTAMTGMTTTAGACTMGCTGTTCATAGG 

ATTCATCATT1TTTCCCCTCTCCCAAAGTAATTTGTAAGCCGTAAAC 

Sequence 953 

TCGCCCCGCGTCCGTGATTTCTCAGTGTTCTCCTTAGATACCAAATACAAAGGACGAGGG 

AT CAAG CTCAGCGAAAGTATCAGGCATTTAAGGTATCAGGCAGCAATGCGGGGAAAGGTG 

AATTTTCTTCMTCAGCATAGGATGGTTAGGGMGAGCATTTATCACTTTGGTTCTTATC 

CTTCAAGCCAGGGGAAAAGCAACAGTGAGGACATCAGAGACAAAAGCATTTATAGAACTA 

ACAAACACAAACGTTTGACAAGTGAGAAAGCTTTATTAAAGCACACATACATGTCAGGGG 

GGTGGGAAACAAAAGAGCAAGTTACAGCCCGGGATCCCAAGTTATGCCTTCCATTACAAT 

TGCMTCCACACCAAATCMTCTTTGAAAACATTCCTCCATTCGGTTCATACATACAGTA 

GAAACCACTGTGGCTGCCCTTAATCCAGTGTGCTTATAGGAAATCAGTTAGCAGCTGACT 

CTGTTGAAAG 

Sequence 954 

CGTCCGGACCCTTATTAAGAATATCCCAGGAAGATGGTGATGAACAGCCTCAGTTTACTT 

TTCCACCAGATGAATTCACTAGCAAAAAAATTACAACAAAAATATTACAGCAGATTGAGG 

AACCATTGGCACTGGTGTGAACMTTMCCAGCAMTGTCCTTTTCTAATACCATTTGAA 

ACTAGACAGCTTTATTTCACATGTACAGCATTTGGCGCCTCAAGAGCAATAGTATGGTTA 

CAGAACCGACGTGAAGCCACTGTGGAGCGAACGAGAACCACAAGCAGTGTTAGGCGAGAT 

GACCCTGGAGAGTTTCGAGTTGGTCG1 CTCAAGCATGAAAGAGTAAAAGTTCCACGTGGC 

GAGTCACTGATGGAATGGGCTGAGAATGTCATGCAAATACATGCAGATCGGAAATCAGTT 

CTTGAGGTTGMTTTTTAGGAGMGAAGGAACTGGCTTGGGACCCACATTAGAGTTTTAT 

GCTCTGGTG 

Sequence 955 

ACCACGCGTNCGGGCAGAAATACGGCGGCATGTTCTGCAACGTGGAGGGCGCCTTCGAGA 
GCAAGGACGCTGGATTTCGATGCCCTCAGCGTGGGGCAGCGGGGCGCGAAGACTCCTCGG 
AGCGGCCAGGGCAGCGACCGAGGATCGGGGAGTCGGCCCGGGATCGAGGGGGACACCCCG 
CGCAGGGGCCAAGGCCGGNAAGAGAGCAGGGAGCCCGCGCCCGCCTCCCCCGCCCCCGCC 
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GGGGTAGAGATCCGGAGCGCCACCGGCAAAGAGGTGTTGCAGAACCTCGGCCCCAAGGAC 

AAGAGTGACCGTCTNCTTATCAAGGGAGGCAGAATCGTCAATGATGATCAGTCCTTTTAT 

GCTGATATTTACATGGAAGATGGCTTAATAAAACAAATTGGAGACAATCTGATTGTTCCT 

GGAGGAGTGAAGACCATTGAAGCC 

Sequence 956 

CCCGCGTCCGCTA CTGTA CTTTGCAGTTTGATGTTTATTAACATTCTTTGGGCACCTAGC 

TACMTATAACTC MTT TTCTGTGAAAMCTATTAATCATCCTATTTTTTCCTTGTCTTT 

MTATG AGATAMTTTTATACCACTGTTTCTCAAACCATCTGTTGTGAGGGACAGTTTTG 

CTTTTTAATTTCCAATTGTCAGAGACCAATACTTTTGTAAAATATAATTAAAAACAAACA 

TAAAAATAAACTTATTAGAAAAATGAAATAAAAGAGAAATGAAATAAGAATAATTTTATT 

ATTAGATTTAACAGATCAAATATTATTTCAATACTCAGATCAAATGTGCAATAAGACAGG 

GTTGCAAAAAATGCACACTTTTTTTATTAAAATCATTTATATAAGTAATTTATATAAAAA 

TAATATTACAGTTGCAACTTTCTGGTGNTTCTCAACTATGACCAAACAGGAGGGTACAAG 

TAAAGGAGCAATCCCAA 

Sequence 957 

GTCGACCACGCGTCCGCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTGG 

TGACCCTCAACCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCAG 

AGTATCTGCAGGTGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCATG 

CTACGTCACCAAGCAGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTTA 

TGATCTATCTGAGCTCGGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTCA 

GCGTGCCTGTGGTTAGGAATGCAGCACTGTTTTGGTGGAACCTGCACAGGAGTGGTGAAG 

GGGACAGTGACACACTTCATGCTGGCTGTCCTGTCCTGGTGGGAGATAAGTTGGGTGGCC 

AACAAGTGGATACATGAGTATGGACAGGAATTCCGCAGACCCTGCAGCTNCAGC 

Sequence 958 

GTCGACCACGCGTCCGCGCCAACTCCGGAGGCGCGGTGCTCGGCCCGGGAGCGCGAGCGG 

GAGGAGCAGAGACCCGCAGCCGGGAGCCCGAGCGCGGGCGATGCAGGCTCCGCGAGCGGC 

ACCTGCGGCTCCTCTAAGCTACGACCGTCGTCTCCGCGGCAGCAGCGCGGGCCCCAGCAG 

CCTCGGCAGCCACAGCCGCTGCAGCCGGGGCAGCCTCCGCTGCTGTCGCCTCCTCTGATG 

C GCTTG CCCTCTCCCGGCCCCGGGACTCCGGGAGAATGTGGGTCCTAGGCATCGCGGCAA 

CTTTTTGCGGATTGTTCTTGCTTCCAGGCTTTGCGCTGCAAATCCAGTGCTACCAGTGTG 

AAGAATTCCAGCTGAACAACGACTGCTCCTCCCCCGAGTTCATTGTGAATTGCACGGTGA 

ACGTTCAAGAC 

Sequence 959 

CCACGCGTCCGAGGGTGGGGAAAGGAGGAGAGGAAGAGCACTCCCTTCCCTGGCCCCTCA 

TCCAGCCTCCGGTGCTGTAAAACGCAGGCGCTGGGCCGCGGGCGGAGCTGAGGACAGGCC 

TTGGCTGGTCCCAGGATGAGCGACGAGTTTGGTTTTAGCTGGGGATTGTGCTGGCATCCT 

GCGAAGCTCCTCCCAGCCGGTCTCTCTGTGCTCGGTTGTCTTGGGGTGGGGCCCATCCGC 

CGAGGTGGGGACCGATAGGAGAAGCCGGTGGGTTGTACCCTTACACTTGTGGAGTCTCCT 

CTTGCCTCTACCTACTCCGCCTTTGTCCTTAAGGTTTTTGCAGGCCAGTGCCAAACACAC 

ACTAACTGTCCTGGCCTCTCCGTGACACAAGTCTCTTCCCAGCCTTCCTC 

Sequence 960 

CCACGCGTCCGCGGACGCGTGGGGCCGGGACAACTGGTCTTATCACGGAGGCTGGGGCCA 

NGGCAGCCCTTCGGTTCGGGTGGGCCCATGGACCCCAGTCCAACGCCGAGGGAATAGGAC 

CATCCAAAAGCGGAACCTTCGCCTCAGAAAAAGGGTGCGGGACCCCTCCTCACCGTGCGG 

TCACGCGTGGACCCTGCCAGCAGCCAGGCCATGGAGCTCTCTGATGTCACCCTCATTGAG 

GGTGTGGGTAATGAGGTGATGGTGGTGGCAGGTGTNGGTTGGTNGCTGATTCTAGCCTTG 

GTCCTAGCTTGGCTCTCTACCTACTTAGCAGACAGCGGTAGCAACCAGCTCCTGGGCGCT 

NTTGTGTCAAGCAGGCGACACATCCGTCCTNCACCTGGGGCATGTGGACCACCTGNTGGG 

CAGGCCAAGGCNNCCCCGAAGCCAACTGA 

Sequence 961 

NCCCCGCGTCCGGGAGGCTCCATGTTGTCCCCTCAGCGAGTGGCAGCAGCTGCCTCAAGA 
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• GGAGCAGATGATGCCATGGAGAGCAGCMGCCTGGTCCAGTGCAGGTTGTTTTGGTTCAG 
AAAGATCAACATTCCTTTGAGCTAGATGAGAAAGCCTTGGCCAGCATCCTCTTGCAGGAC 
CACATCCGAGATCTTGATGTGGTGGTGGTTTCAGTGGCTGGTGCCTTCCGAAAGGGCAAG 
TCCTTCATTCTGGATTTTATGCTACGATACTTATATTCTCAGAAGGAAAGTGGCCATTCA 
MTTGGTTGGGTGACCCAGMGMCCGTTAACAGGATTTTCCTGGAGAGGGGGATCTGAT 
CCAGAAACCACTGGGATTCAAATCTGGAGTGAAGTTTTCACTGTGGAGAAGCCAGGTGGG 
AAGAAGGTTTGCAGTTGTTTCTGATGGATACCCAGGGGGCAT 
Sequence 962 

GCCCCGCGTCCGCTTCTCCGAATATAGCAACGTCCAGCAGTGTCCACACTGTGGGAACCT 

GGACTACCACTTCGTGMGCCATTTTCCTCCTTCAAAGTTCTCGAAGCTTATTGATGAAA 

GCTTTGCTTTAGTMTAGCTATTTTATTGATATTATTACTTTATTA 

GGAMCATTCTGTGACATTAATTTCCTTTCTAATTTAMGGAGAGTTACTTTGTTGTATG 

TGTG CCACTAAAATAGGGGCTGCCCTTGCCCTGTCTTGATTCCCGAGTGTTAATCTGTGG 

TTTTGACCAGAGCCCAGATGGGTMTCCTGTGCATTTGGGTTGGGGGTTCACTCTTACCA 

AGAATCTTTGATGCAGCTTTAAGATGGTGGGGAGATGGGGGTTGAATTTAGGGAAAGAAT 

NTTTGTGGGTTATAAACTAAGAGCTTGATAGGAGTTGGAAGGAAACTCTTACTAAAATGT 

TAACTTTCTAAAAACCTTCTTTTANATCTTNCTTGGGCCTTTGGAAAA 

Sequence 963 

GTGTTTTGGGGATGCCTTTCCTTACCAGATTCTTCTAAAGCCCAGCTGCACCCACCCTTA 

AGTGGGAGATAAGGCTTCTGCCCGCGGGCTCTGCGTTCGTCCACCCGGCCCCACGTTTGC 

TGTGGACTAAACAGGAGCCACTGGACTAGAGTACACTTGACTCTCGGCTCTGCGGACCAA 

AAATTCCAGGACTAAGGAATAGCAAGGTTAGGCTGAAACAGTCCACACAGGGCTTGCGGT 

AAACGTCTTTTCAGGAGCCACTCGCCCAGTGCAGTAAGTCGTGTACTTAGTTGACTCGAG 

CGCTCCAGGGAGACGCCCGACCCTACTCTGCGCCGCCCCGGGGCACCAGCTCTGCTTCCT 

CCAGGTCCACTGAGGCAGGCACGCCCAGCTCTGGGACAGi3TCAGTAAACAAGCCACGAAC 

CGCGCCAGGGATCAGAGAACCCANAGTCCCCGCCAGCTGCCGGCACAAGCCAATCGCAGC 

GCANCCAGGCGGC 

Sequence 964 

GTCTAAGGGATCCAGGTCCTGTGTCTCAGGGACCTCTGATGGGATTGAATCCAAGAGGAA 

TGCAGGGGCCTCCAGGCCCNCGGGAGAACCAGGGTCCTGCTCCCCAAGGGATGATTATGG 

GCCACCCGCCTCAAGAGATGAGAGGACCTCACCCTCCAGGTGGACTACTGGGACACGGCC 

CTCAGGAAATGAGAGGTCCTCAGGAGATCCGAGGCATGCAGGGGCCTCCACCCCAAGGAT 

CAATGCTGGGACCTCCCCAGGAATTGCGAGGGCCTCCAGGCTCACAAAGTCAGCAGGGGC 

CGCCCCAGGGCTCTTTAGGACCTCCACCCCAGGGTGGCATGCAAGGACCCCCCGGACCTC 

AGGGACAGCAGAACCCAGCAAGAGGGCCACATCCATCTCAAGGGCCAATACCATTCCAGC 

AACAGAAAACGCCTCTGCTAGGTGATGGGCCCCGGGCCCCCTTCAACCAGGAAGGACAGA 

GCACAGGCCCCCCACC 

Sequence 965 

TGCGCATGCGCGGAGCGCGGCGCGCGCGGCGGTTGGGCCGTTGGCTGTTCGGCCCTGGGA 

TCCGCCGCCACTCCGCGATCAGACCGCTCTGTGCCGCGAGCCGCCGTGAGCACTCGGATT 

CAAGCCGGCGCCAACGAGTCCGGGGGCATCGCCCGCAGCGGCCAAGCTCATGGCCGGCTG 

AGCGGGACGCCGCCTNCGCCTCAGCCACCGCCGCCGCCGCCGNCTTCTTCTCCTCAGCCG 

GCGGCGGCCCGGGCCCAGCAACCATGGCTGAAGACTACTGGGACGGGCGCCTGCGGCGAA 

CAGGAGGAGAAAGGGAGGTCGCGCGGCCTCATTCCGGGCCGCCGCCCCAGGCGCCGCCGC 

GCCGGCCCCGCGGCTCTGAGGTTGCTCGCGCGCCCCC 

Sequence 966 

TGGAAMTNTTTTGGAAAAAATTACCCTTGGGACCTTGNTTTTNAANCCCNAGGTTCCCN 
GTTNNGGCAAATAMANAATGNNNGACCCGGGATTTNGGGNTTNNAAACCGGGGGTTTTT ' 
AATTTCCCCNNNNCNNGGGNCC I I I I I I I I I NCCNCCCCCNCAAGGGGGNTTTGGGAAAN 
NAAAN CCCCCCCCI I I I I I I I I NGGGGGNGAAANTTCCCCGGGTNNNNGCCN I 1 1 I I I 1 1 
I I I I I AAA 
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Sequence 967 

GTCCGCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGAAAGATG 

AGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGAACGCATTTACTGTC 

ACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAATGCAAA 

TTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATGGAGGAT 

AAGMCATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGTAGCTAC 

AGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTTCAGATC 

ACAGATGTGAAATTGCAGGATGCAGGGGGTGTACCGCTGCATGATCAGCTATGGGTGGTG 

CCGACTACAAGCGAATTACTGTGAAAGTCAATGCCCCATACAACAAAATCAACCAAAGA 

Sequence 968 

CGTCCGGGAACTCAGCAACGGTTTCTTCATCCAGGACCCGATTGCTCTGGTGGAGAGGGG 

GGGCTGCTCCTTCCTCTCCAAGACTCGGGTGGTCCAGGAGCACGGCGGGCGGGCGGTGAT 

CATCTCTGACAACGCAGTTGACAATGACAGCTTCTACGTGGAGATGATCCAGGACAGTAC 

CCAGCGCACAGCTGACATCCCCGCCCTCTTCCTGCTCGGCCGAGACGGCTACATGATCCG 

CCGCTCTCTGGAACAGCATGGGCTGCCATGGGCCATCATTTCCATCCCAGTCAATGTCAC 

CAGCATCCCCACCTTTGAGCTGCTGCAACCGCCCTGGACCTTCTGGTAGAAGAGTTTGTC 

CCACATTCCAGCCATAAGTGACTCTGAGCTGGGAAGGGGAAACCCAGGAATTTTGCTACT 

TGGAATTTGGAGATAGCATCTGGGGACAAGTGGAGCCAGGTAGAGGAAAAGGGTTTGGGG 

CCGTTGCTAGGCTGAAAGGGAAGCCACACCACTGGCCTTCCTTCCCCAGGG 

Sequence 969 

GATTGGAGGAGTCACATCCCCTCTTCAGCCGCAGCACCCCTCCCTCCCATCCTCTAGCTC 

TTCCCGCGGTGGTCGCCTCCCTCCGACCCTGCTCTCCCCTCCTGGGCCCCGCGCAAAGCC 

CCCTCTGTTCCAGCTCCCGGGCCTCGGCTGCCTCCCCGCCCTCCCATCCCTTCCTCTTCC 

CAGGGCCTGGAGCGCTCCCTTACATTCCTGAGATGCCCTTCCTCGGGGCTGTCCCCCTTT 

GCCTCCCCAGCATCCCATTTCTAGGCCTTTTTCAAGACCCTTCCAGAGCGGCCCCTTTCC 

AGCTCCCTTTCTCGTTTCCATTTC^ 

TCCMGCTCCCCTCCCTCTTTCCCTTCGTCTTACCCTGCTTTGATCTACGCAGCCCCAAA 
CTCAAGCTCCCCGCTTTCAAGGTGGTGCGAGGTTGTTGGGGGGTGCGGAAGGGCCTGCCA 
AGTCCATTTTTCGAGGGG 
Sequence 970 

GTCCGAGATCGCGAGCCGCCGCCC I I I I I II I I I I I I I I ATAAGATTATTAGTATAAAAN 

GGGGAGACGAGGTTAGGGCCCTGGGAAAGGTGGGAGATCAGCCAGAGACAGGTTTCCCAG 

AACAGAATGTCTGGCCTTTGTGGTGAGGAGGGACTGTGGTATGAGCCGCAGAAGCGGGCC 

AGGGGTAAACCCTCCTGTGCGTCCTTCCTTCAGCCTGGTCCTGAGGGTGACCCTTTGATC 

CTGGGTTCTCCAGGTAGGGCTGTGAGCTGTGAGTTGGATCCTTTTGGTGAAATGGTCTCT 

CTCATCTGGCCTGTCACTCAATGTGGAATAGAGTGAGTGAGTTCTATGGGTTCTAAGTCC 

TGCTCTGGAACCATAAGTAAGTTATCCTCTCTGGGCTTCAGTTTTTCATGGAAAGTTGCG 

TTAAGAATCTAGTTTAAGGCCAGGCATGGTGGCTCACCGCCTTGTAATCCCAGCACTTTG 

GGGAGGCCAAGGAAGGTGGATCATGANGTCAGGAGATCGAGACCATCCTNGCTAACATGA 

TNAAACCCGTGTCTTTACTTAAAAAATAC 

Sequence 971 

CCTGCCAGTGGTGAGCACCTTCGGCCTCCAGGTGCCTTTCTTCTTCTTCGCGGCCATNTG 

CTTGGTGAGCCTGGTGTTCACAGGCTGCTGTGTGCCCGAAACCAAAGGGACGTCCNTGGA 

GCMATCCGAGTCCTTTTTCCGCACGGGGAGAAGGTCCTTCTTGCGCTAGGTCAAGGTCC 

CCGCCTGGAGGGGGCCAAACCCCCA 

Sequence 972 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGTCAGCCTCCACCTGGAAGAGAGCTANGGG 
CCGGGCAGGCCGGGCAGCTGCCACCCCGCCCGGCCCGACGCCCCGCATGCCCCGAAGTCC 
CTGGCGCCCACCCGGCCGCGGCCCTGCGTGTGACCCGCGGGTCGATACCTGGCAGCCCCA 
GTGCTGGGGCGCCGCGGCCCTGCTCGCCCAGGAGGAGAGCGAGGGCCCCACACTGAGTCT 
CTTGAAGCCTCACGTTTCCCTGGGGGGGTGCTGCATCGTCGGGTGTCCCTCACCCCACCT 
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GGGGAACCTCTGTCTTCAGGTCACCCCTTTTCAGGGGCCTGG 
Sequence 973 

CGTCCGGGACCCTGCTCATGGAGAACATCAGCAGCTGGCTGCTCCTTCGCTGACGCCCTG 

GGCTACGTGAACCTGCCGCTCACCTTTTTCTGCCGGGCAGAGCTGGATAGTGAGCCCGAG 

CGGGTGGCGTCCGTCCTGGAAAAGCTGAAGGAGGACTGNAACAACACTGAGAACAAAGAA 

CGGAAGTCCTTNCAGAAGGAGCTTGTGATGGCCCTACTGAAGATGGACTGCCAGGGCCTG 

GTGGTCAGACTCATCCAGGACTTTGTGCTCCTGACCACGGCTGTAGAGGTGGCCCAGCGC 

TGGCGGGAGCTGGCTGAGAAGCTGGCCAAGGTCTNCAAGCAGCAGATGGACGCCTACNAG 

TCTCCCCACCGGTGACAGGAACGGGGTTGTGGACAGCGAGGCCATGTGGAAGCCTGCGTA 

TGACTTCTTAC 

Sequence 974 

TCACCACGCGTCCGCGAAGCGTGCACCGCTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCC 

GCAAGCCCAACGCCGGGGGCAGCCCCGCTCCGGTGCGCCGCCGGNCGGAGGCCTCGCCGG 

TGCAGAAAAAGGAGAAGAAGGACAAGGAGCCGGGAAAACGAGAAGGAGAAGAGTGCCCTA 

GCCCGGGAGCGCAGCCTCAAGAAGCGCCAGTCGCTGCCCGCCTCCCCACGTGCCCGCCTC 

TCTGCCAGCACCGCCTCTGAGCTCAGCCCCAAATCCAAGGCCAGGCCATCCTCTCCCTCC 

ACATCCTGGCACAGGCCTGCCTCCCCCTGCCCCAGCCCAGGGCCAGGCCACACTCTGTCT 

CCAAAGCCACCGTNCCCCCGAGGCACCACTGCATCCCCCAAGGGGCCGGGTTCGGAGGAA 

GGAGGAGGCAAAGGAGAGCCCCAGCGCCGCANGGCCCGAGGACAA 

Sequence 975 

TCCGCAGAMCGGACTTTCTCATCATGCTTTCCTATGGTGGGTATGAGGGGCCAGCTGAT 

ACCAACCAACTGGCCTGTATCTATCTATCTGGATTTGACTTGAATTTTTAAAATGTGTAT 

CGTTTAAAAMAMMTGTTTGCAAATTTTGCACATAGGATCTTGCACTGTTCATTTTCA 

GTGGGGTGAGTCTTCACACTAAAAACACAAGCAGAGCTCCTGGGAAAAGAGACTGGAAGT 

GGTTCAGGATAAAGAGATCCATGGTGGGCAGGGCTCTTAGGTCACAGAGCTCTAGAAGCA 

GCTGGACTTGAACCCACMTGGCTTGTGTAMTTCGTAMTTTCATGGTTTCTAGGAAAA 

GCTGCATTG 

Sequence 976 

GCGTCCGGAAGMCTGTGGMCTGCGGTTGGCAGACMTTTTGTTACAAATATTAAAAGG 

AAAGATTTTGCCAATATGACCAGCTTGGTGGACCTGACTCTATCCAGGAATACAATAAGT 

TTTATTACACCTCATGCTTTCGCTGACCTACGMATTTGAGGGCTTTGCATTTGAATAGC 

AACAGATTGACTAAAATTACAAATGATATGTTCAGTGGTCTTTCCAATCTTCATCATTTG 

ATACTGAACAACAATCAGCTGACTTTAATTTCCTCTACAGCGTTTGATGATGTCTTTGCC 

CTTGAGGAGCTGGATCTGTCCTATAATAATCTAGAAACCATTCCTTGGGATGCTGTTGAG 

AAGATGGTTAGCTTGCATACCCTTAGTTTGGATCACAATATGATTGATAACATTCCTAAG 

GGGACCTTCTCCCATTTGCACAAGATGACTCGGTTAGATGTGACATCAAATAAATTGCAG 

AAGCTACCACCTGACCCTCTCTTTCAGCGAGCTCAGGTACTAGCAACCTCAGGAATCATA 

AGCCCATCTACTTTTGCATTAAGTTTT 

Sequence 977 

NCTCCMCMTTATGGCTCATCCTTCCTTTTACTCTGTCTCACCTCCTTTAGGTGAGTAC 

TTCCTTAMTAAGTGCTMACATACATANACGGAACTNGAAAGCTTTGGTTAGCCTTGCC 

TTAGGTMTCAGCCTAGTTTACACTGTTTCCAGGGAGTAGTTGAATTACTATAAACCATT 

AGCCACTTGTCTCTGCACCATTTATCACACCAGGACAGGGTCTCTCAACCTGGGCGCTAC 

TGTCATTTGGGGCCAGGTGATTCTTCCTTGCAGGGGCTGTCCTGTACCTTGTAGGACAGC 

AGCCCTGTCCTAGAAGGTATGTTTAGCAGCATTCCTGGCCTCTAGCTACCCGATGCCAGA 

GCATGCTCCCCCCGCAGTCATGACAATCAAAAAATGTCTCCAGACATTGTCAAATGCCTC 

CTGGGGGGCAGTATTTCTCAAGCACTTTTMGCAMGGTAAGTATTCATACAAGAAATTT 

AGGGGGAMAAACATTGGTTAAATAAAAGCTATGTGTTCCTATTCAACAATATTTTT 

Sequence 978 

CCCCGCGTCCGGGTCCCCGCGACTCCCGGACTGGAGAAAACGGCTCTTGCGATGGGGCGA 
AGTCCGAGCTGCGGCGGGCGTTGGTCCGTGCAGGGAAGTGGGAATCGTTAGGTTCGTTCT 
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GGACCCGCCGCCCCATGGCCCAGGCGTCTCGCTCAGGTAGCCTGCCTCCACTCGTTATCG 

TGCCCCCGCTGAGGGCGCAACCCGGGGGCACTGGGGAGGAGCAGTGGGAGAGAAGTCGAA 

CGGNCGGNCTTCCGCTGGGCAGAGCTCAGCAGTACTTGGCAGCATGGGACCCAGCTTCCT 

TCCTGCTCCTGATCCAAAAGGACTTACCTNCTCTGTTGCATGAGGCAGAAGCTTTGTATA 

GCCTGGCCTCAGAGGAAAGCTTAGCTCTGGAAGTGGAGCAGCAGCTGGGCCTGGAGATCC 

AGAANCTGACTGCACAGATCCAGC 

Sequence 979 

AGGCTGNTACGAAGCGAGCTTGGGAGGAGCAGCTGGCCTGCGGGGCAGAGGAGCATCCCG 

TCTACCANGTCCCAAGCGGTGTGGCCCGCGGGTCATGGNCAAAGGAGAAGGCNCCGANAG 

CGGCTCCNCGGCGGGGCTGNTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 

GGTGAAGAAAGAACCGMAAAGMGAMCAACAGTTGTCTGTTTGCAACAAGCTTTGCTA 

TGCACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCANAT 

CTACCTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTTCATCATCCTGNTTGTGGG 

CCGANCCTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATNCCC 

Sequence 980 

ACCCCGCGTCCGGAAAAGAAGAGTGGCCNGTTCCAGGGGTAGCTCCAAAAGAGACTGCAG 

AGCTGTCCGAGACCCTGACAAGGGAGGCCCAAGGCAACAGTTCCGCAGGAGTGGAGGCAG 

CAGAGCAGAGGCCTGTGGAAGATGGCGAGAGGGGCATGAAGCCAACAGAAGGGTGGAAAT 

GGACCCTGAACTCCGGGAAGGCTCGAGAATGGACACCCAGGGACATAGAGGCTCAAACTC 

AGAAACCAGAACCTCCAGAGTCAGCAGAGAAGCTTCTGGAATCTCCCGGTGTGGAGGCTG 

GAGAAGGGGAGGCTGAGAAGGAGGAGGCGGGGGCTCAGGGCAGGCCTCTGAGAGCCCTGC 

AGAACTGCTGCTCTGTGCCCTCCCCCCTCCCACCAGAGGACGCTGGGACTGGAGGCCTGA 

GACAGCAGGAAGAGGAAGCAGTGGAGCTTTCAAGCCCCCACCACCAGCCCCTCTGTCTCC 

CCCACCCCCAGCCCCAACTGCCCCCCAACCTTCTGG 

Sequence 981 

GCCCCGCGTCCGAAAAGAATGGGTGAACCMTCGGCCTTTGTGMTTTATTCAGTGCCTT 
CTCTGTACCAAGCACTGGGTAAGGCACTTTTGTGGAGCATTAGACAGTAACCCTCAAGGA 
GCTAGAGMCCGGATGGGAGACATGAGCGGTMTTAACTCACTTGTTCCCCAGAGTTTCT 
ATTTGTTTTGATTTTCTTTTTCTGTGACTTATT^ 

TTCACTATGGCCCMCTMTATAMCACCTGGAAATTACAAGGAAAAAAAATTCTTCCTC 

TAATAACTTTCCAMTTTGTGGMTATTTATTTGTMTAGCAGTTATCAGTTATGCTTAT 

ATAGCATTAAAAAT TCTCCT CCTTTGACTACACACACAACCACAGTGTGGTTCTAATCAT 

GGAGATATCAGTMTTTTTAGTAACTGMTTTTGAGGACATTTCTNTGTTTAGCATGTAT 

GCAAACTGATATGTAATCTGAGG7TCCAAAGTCAA I ITTTTTCTTTTT1 

Sequence 982 

TNGGGAGTCGACCCCGCGTCCGGTTTTTGTGAGGCAGTGAGACCTAAGGTAACCTTTATC 

AAAAGGATGGAGTTGGGAAAAGGAAAACTACTCAGGACTGGACTGAATGCGTTGCATCAA 

GCAGTGCATCCGATCCATGGCCTTGCCTGGACCGATGGGAATCAAGTTGTCCTAACTGAT 

TTGCGGCTTCACAGTGGAGAGGTCMGTTTGGGGACTCCAAAGTCATTGGACAGTTTGAA 

TGTGTCTGTGGGTTGTCCTGGGCCCCACCTGTTGCAGATGATACACCTGTTCTACTCGCT 

GTCCAGCATGAGAAGCATGTCACTGTGTGGCAGCTGTGTCCCAGCCCTATGGAGTCAAGC 

AAATGGCTTGACGTCTCAGACTTGTGAGATTAGGAGGGATCACTACCTATCCTTCCCCAG 

GGCTGTGTGTGGCACCCAAA 

Sequence 983 

GTGTCGACCCCGCGTCCGCGCCCTGCCTGCAGTTGAGATTCAGATGCCTTCTGACAGAGT 

TCAGCCTCTTGGAGAGTCTTGGGGATTGTTGGCACCTAAACAGAATCAGNGACCCGGGTG 

CTTTGTGGCCAGCAGCACAGAATCAAACCCGCATCCCAGCATTGGGCCACCCATCTGAGG 

GAGGCCAAAATCATCACAGATGCTGCTGTGCTGCAGACAGATACATGCTAGTCCAGAGAG 

CCGCCCCTGAGATGGCTGTGAGAACCATGTGTCTAAGGCGTAAGATAAGGATGGAAGGCT 

GTCCAAGTTATTTGGAAGGCCTCGGCAGCTTGGGATTAGCTTGGGAGCGCAGCGCTGCAA 

AGTGGAAAATATGAAAAGACCACACAGGCCCAAGCAGTCCAGAAACTGGGCAAAAATATT 
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CTGCAGTGGGGATTT A I 1 1 1 I I 
Sequence 984 

CACG CGTCCG GAGTACGGAGTTGTTCCTTTACTGGCTGAAAGATATATTCGAATTGTAAA 

GATGCTT TTTCTCATGCATTGAMTTATACATTATTTGTAGGGMTTGCATGCTTTTTTT 

1 1 1 1 1 I ICTCCCGAGACAGGGTCTTGCTCTGGCGCCCAGGCTGGAGTACAGNGGCATGAT 

CTTGGCTCACTTCAGCCTTGACTTGGGCTCAAGTGATCCTCCTACCTGAGCCTTCTGAGT 

MCTGGGACTACAGGTGTGCACTCCTCGCCTGGCTMTTTTTTATTTTTTGTACAGGCAG 

GATCTTGCCACCTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCATGCCATCTGCCTGCCT 

TAGTCTCCCAAAATGCTGGGATTACAGGAGTGAGCCACCATGCCCGGCTGGCAGTTGCAT 

GGMGAGMCACCTNTTTATGGCTTACCCTCTAGAATTTCTAATTTATGNGNNCTGTTGA 

AATTTTTGG I I 1 1 1 1 1 ACCT 

Sequence 985 

GTCGACCACGCGTCCGCTCGGCTTCCTGCTGATGGTCAGGGTTTTGGCAACTCCCCGGTG 

TGAGAGGGGTAGGGAGTGCTCCCGGCGGCGACGGGGCCGAGTTCACCAGCCGCCGGGGCA 

GTAGTCGAAGGCCCGGCGCGGCATGTCCTGGGTGCCGCGGTGCGGGCAGTGAACGCGCGC 

CGGGCGGGATGGGCCGGCGCCGGGCGCCAGAGCTGTACCGGGCTCCGTTCCCGTTGTACG 

CGCTTCAGGTCGACCCCAGCACTGGGCTGCTCATCGCTGCGGGCGGAGGAGGCGCCGCCA 

AGACAGGCATAAAGAATGGCGTGCACTTTCTGCAGCTAGAGCTGATTAATGGGCGCTTGA 

GTGCCTCCTTGCTGCACTCCCATGACACAGAGACACGGGCCACCATGAACTTGGCACTGG 

CTGGTGACATCCTTGCTGCAGGGGCAGGATGCCCACTGTCAGCTTTCTGCGCTTTCAGGC 
ACATTAACAGCA 

Sequence 986 

CGCCACGCGTCCGCGTACGCGTGGGCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTG 

GGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTG 

GGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCA 

CAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACC 

ATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAAACTTCTTGAAAGTGAC 

TACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAG 

CCAGTGTGGAAGAAGGGACT 

Sequence 987 

GGTCGCCCCGCGTCCGTAGCAGTTACATCTACGAGGCTATTATGGATTGGAGGATGAGAA 

GGGAACT GCATGT ACCTCAACAAGGCGTCGGTCAACACCGCGAAGTTTGGCAGGCTTGAC 

AAGTGGAGTTTTTGAATCTATAATGGTTCAAGTTTTGAGACAGGAAGAACAGCTGAGAGC 

AAAAGAAGAAAAAAGGCTTCGGGAGCAGGAAAGAAAAGAAGCAGAAGAAGCTAGTCAAAA 

GGAAATAGAAGAATGGGAAAGAAAACTTCTAGCTCAAGCAGCTCCAACTTGTATGGAGAC 

CATGTGGGAMTT CCAG CTATTGGGCATTTCCTTTGTTTAGCTCAGCAAATTCTAAATTT 

GCCAGAAATAGTCTTTTACCGAACTGGAACCGTTGTCTTCTGATGCCTCAGTGTAATGCT 

TTTCTATCGAAAATAATGACTTCTTATTTAAGTCCTCCCCATCGCAGA 

Sequence 988 

NCCCCGCGTCCGAGTCCCCTGTCTGTGGCACCAGACACTCCCGACTGTGCGCTGACTCTC 

CCCGCCCAGCCAGCAGCCTTTTCCAGAGAGGCTGTGGTCCATAGCCTCTGTTCGTTTTCA 

CTGCAGGACCAGGCACGAMGTTMAACAAAATGAAGATTTTTTCTGAATCTCATAAAAC 

AGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATGTCGAGTT 

TGATATGCTGGTGAAGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATATCTAAATC 

ATTGTGGACTTGCTCAGTAG AATCT TCCATGGAATATTGTAGAATAATGTATGATATATT 

TCCTTTCAAAAAGCTGGTGAATTTTATTGTGAGTGACTCTGGAGCACATGTTTTAAATTC 

TTGGACTCAAGAAGACCAAAATTTACAGGAGCTAATGGCAGCATTAGCCCGCTGTTGGGC 

CTCCTAATCCTCGGGC 

Sequence 989 

GTCGCCC ACGCG TCCGTTCGTTGTCTGATGGACCTGCTTGCAAAAGGCCAGCTCTGTTGC 
ATTCCCAATTTTTGACACCACCTCAAACACCAACGCCCGGGGAGAGCATGGAAGATGTTC 
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ATCTCAATGAACCCAAACAGGAGAGCAGTGCTGATCTGCTTCAGAACATTATCAACATTA 

AGAATGAATGCAGCCCCGTTTCCCTGAACACAGTTCAAGTTAGCTGGCTGAACCCCGTGG 

TGGTCCCTCAGAGCTCCCCCGCAGAGCAGTGTCAGGACTTCCATGGAGGGCAGGTCTTTT 

CTCCACCTCAGAAATGCCAACCATTCCAAGTCAGGGGCTCCCAACAAATGATAGACCAGG 

CTTCCCTGTACCAGTATTCTCCACAGAACCAGCATGTANAGCAGCAGCCACACTACACCC 

ACAMCCAACTCTGGAATACAGTCCTTTTCCCATACCTCCCCAGTCCCCCGCTT 

Sequence 990 

GTCCGGCTGGGACCTCCTCCTGTTGGGGTCCCCATGAACCCTTCCCAGTTCAACCTTTCA 

GGACGGAACCCCCAGAAACAGGCCCGGACCTCCTNCTCTACCACCCCCAATCGAAAGACA 

ATGCCTGTGGAAGACAAGTCAGACCCCCCAGAGGGGTCTGAGGAAGCCGCAGAGCCCCGG 

ATGGACACACCAGAAGACCAAGATTTACCGCCCTGCCCAGAGGACATCGCCAAGGAAAAA 

CGCACTCCAGCACCTGAGCCTGAGCCTTGTGAGGCGTCCGAGCTGCCAGCAAAGAGATTG 

AGGAGCTCAGAAGAGCCCACAGAGAAGGAACCTCCAGGGCAGTTACAGGTGAAGGCCCAG 
CCGCAGGCC 

Sequence 991 

NCGCGTCCG CTTAM TGACTCGTTATCATTTTGCAATGAATGGAAAATCATTCTCAGTGA 

TACTGGAGCATTTTCAAGACCTTGTTCCTMGTTGATGTTGCATGGCACCGTGTTTGCCC 

GTATGGCACCTGATCAGMGACACAGTTGATAGMGCATTGCAAMTGTTGATTATTTTG 

TTGGGATGTGTGGTGATGGCGCAAATGATTGTGGTGCTTTGAAGAGGGCACACGGAGGCA 

TTTCCTTATCGGAGCTCGAAGCTTCAGTGGCATCTCCCTTTACCTCTAAGACTCCTAGTA 

TTTCCTGTGTGCCAAACCTTATCAGGGMGGCCGTGCTGCTTTAATAACTTCCTTCTGGT 

GTGTTTAMTTCATGGCATTGTACAAGCATTATCCAGTCTTCCAAGTGTTACTCTGCTGT 

ATTCTATCTTTMGTAACCTAGGAGACTTTCCAGTTTCTCTTMTTTGATCTGGCAATCT 

TTTGGGTAAGTGGGTATTTANAAT 

Sequence 992 

TTTTCACTGCAGGACCAGGCACGAMGTTAAAACMAATTGMGATTTTTTCTGAATCTC 

ATAAAACAGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATG 

TCGAGTTTGATATGCTGGTGAAGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATAT 

CTAAATCATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATG 

ATATATTTCCTTTCMAAAGCTGGTGMTTTTATTGTGAGTGACTCTGGAGCACATGTTT 

TMATTCTTGdACTCAAGAAGACCAAAATTTACAGGGAGCTAATGGCAGCATTAGCCGCT 

GTTGGGCCTCCTAATCC 

Sequence 993 

CGCGTCCGGGCAGGAGCACCACTCAAGGAGCTACACCCCTTGCATCGGCTTGACCGCCTT 

ACCTCAGGGGTGCTTATGTTTGCCAAGACAGCTGCAGTCTNTGAGAGAATTCACGAGCAG 

GTTCGGGACCGGCAGCTGGAGAAGGAGTACGTGTGCCGGGTGGAAGGGGAGTTCCCCACT 

GAGGAAGTGACCTGTAAAGAACCCATCTTAGTGGTGTCTTACAAAGTAGGGGTGTGCCGT 

GTAGATCCCCGGGGCAAGCCCTGTGAGACAGTGTTCCAGAGGCTAAGCTACAATGGCCAG 

TCCAGTGTGGTACGGTGCCGGCCACTCACAGGCCGCACACACCAGATTCGAGTCCACCTT 

CAGTTCTTGGGCCATCCCATTCTCAACGACCCCATCTACAACTCAGTTGCCTTGGGGTCC 

TTCTCGAGGCCGGGGCGGCTACATTCCCAAGACAAACGAGGAGTTGCTACGGGACCTGG 

Sequence 994 

ACGCGTCCGCGACCGCTGGGCATGCGGGTGTTGGCGCGGTATCCCCGCCCTGCCCAGCAT 

CTGCCCCACGTTTCTTCAGGCTAAACTACCGGGATCCCGGGCTTCTTCCTAAAGTAAAAC 

TCGCTCCGGAAAGGCCAACAGTCCAGCGGCCAGACGGGCACCTGGGAACGCGGGCCTAAC 

GCGTACTGGAGACGGAGTGGCGCCCGGCACTGCGCGCCTCCTCCCCGCCGGGAGACTGCG 

TGCTAAGCTCAGCAAAGCCCCGCTGTGGAGACGGAGCCATGTCGCCCATTACCTAATGAA 

ACTGAGAAGGGAGACTCAGTCTCTCTTCTAGCCCCGAGCGCAAGCTCTGCTGGACTTGGC 

ATCGTCCGCCCTCCACGATCCCACACTCCGGGTTTTCCCCATTCCCAGCTCGGCTGCAAC 

CGAGAGACAGACGGAAGAAAC 

Sequence 995 
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TCCTCCTGGCCCTGTTAATGTCGGGGCCNGGCCGGGGGAGGATGGCGCCCTAGAACCCGG 

CCTTGCTGGGGTAGGGGCGGGAGGGGACGGGGTGGGGACCGGCCATGTCGGAGGTGACCC 

GGAGTCTGCTGCAGCGCTGGGGCGCCAGTTNTAGGAGAGGCGCCNNACTTCGACTCTTGG 

GGCCAGCTGGTGGAGGCGATAGACGAGTATCAGATATTAGCAAGACATCTACAAAAGGAG 

GCCCAAGCTCAACACAATAATTCTGAATTCACAGAAGAACAAAAGAAAACCATAGGCAAA 

ATTGCAACATGCTTGGAAT 

Sequence 996 

CGCGTCCGGCCTGAGCCGGCGGGTCCCCTGTGTCCGCCGCGGCTGTCGTCCCCCGCTCCC 

GCCACTTCCGGGGTCGCAGTCCCGGGCATGGAGCCGCGACCGTGAGGCGCCGCTGGACCC 

GGGACGACCTGCCCAGTCCGGCCGCCGCCCCACGTCCCGGTCTGTGTCCCACGCCTGCAG 

CTGGAATGGAGGCTCTCTGGACCCTTTAGAAGGCACCCCTGCCCTCCTGAGGTCAGCTGA 

GCGGTTAATGCGGAAGGTTAAGAAACTGCGCCTGGACAAGGAGAACACCGGAAGTTGGAG 

AAGCTTCTCGCTGAA7TCCGAGGGGGCTGAGAGGATGGCCACCACCGGGACCCCAACGGC 

CGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTTCAGGTGCAGAAGCA 

CTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTACTCGGGCC7TATTGT 

CAACAAGGCGCCCCACGACTTTCAAG 

Sequence 997 

GTCCGGCCAGGAGCCAGGCCGAGCGGGAGCTGACCANGGCTTGACTCGGGTACAGAACGA 

GGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCAGTGGATGGAGGAGGCCCAGCCCTGA 

GGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCCTACAGGGTGGGTAGGCGGGCGTGCC 

GCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCCGAGGGGCCTACGCTGCGGCCCGGCA 

ACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCCCACCCGATCGGAAGGCGGATCCTTT 

ACCAGGGCCATAGGCCAGTGACTGGGCGGGCCCCTTNGGGCCTCCCATTCGGGGCCCGGA 

CTANGGAACNAGGCCCGGNNGAGGCCCCTTGGCCTACCAGACCCTTTTNTNANGCCGACA 

GCCGNCANGGAAAGAT 

Sequence 998 

CGTCCGGCCCAGAGCCCGCAGCACGCCGCCGCCGCAGCCTAGGTCACCTCCAGCATCTAG 

CACAACGTCTGCAATGGAACAGGCGAGCTGTGAATATTTGTGGAATGCATGGGTGGACTA 

AAGACCTATCACCTCACTCTAGAATGCCCAGCATGTTGGAGCATGAGGACCAAGAACCAT 

GGTGTTCCTCCACTCATCAGAGCCGTATCATTTTGATGCATGCGCCAAGAAAGAAAATTC 

AATCATCAGACTGAAGCAATCAAACTCCAAATGGTGCTGTAAACTGAACCACACATAGAC 

ATGCCATTCTTCTAAGGACCCTTAAGATCCACCCCAGGAGGAGCGCTAGCTGCTGTTCCC 

CATTCGATGCCCCTTTTCGGCCCGGAAGTAGCCGGAAAGATTCGCCCGCCAAAATTCCCC 

TAACCAGCAAGTTAGGTGTGGCATCTTCCACAAGCANGGAGCCGTTGTAGGAAAAAGGNG 

GTCTTGGGGAAGGTTTTTCG 

Sequence 999 

CCCGCGCCGGCAGTTTCNATGGTGTGTACTAATTTGAGAAAATGAATGTGTATACATACA 

AGAGTMGTCAGATTGTTAGACTCATCCCTCAGTATTCCATATGTTTTGTGACTGATTTT 

ACAGTTCTCTCTACCTTTCTCATTTACAMMMAAAAAGAAMGMAATTTGATTCAGC 

AATTCCTAAAAGTATTGTATTCAGTGACATCTTTGGAAACACCAGTTTCTGTTATCAACT 

TCAMTAA TAGTCMGTT TTATGTATGATCTMAGGGAAMCMGTTTGTTTTTCAATCC 

TGTGATAA I I I I I CTTTTAGAATGAGGTGTTGCAAGAAATGGAAAATTAAAAAAACCACT 

CTGTMCAATTTTGCTGTGCTTCTTTG ATTTTTCTCTG I I I [ I GTTTAATGGGTACCTTA 

TATTTGTACCTTTACATATTGMTTCATGAGGAGAGGTTATGCACAGCCTAGTTATTTGA 

CATTCCAGGGGGTTTAAAAAAAA 

Sequence 1000 

CCCGCGTCCGGCGGTGGCGGTGGTGGCGGTGGCGGCGGTGGCGGCGGCGGCGAAGGGGGC 

GGAGAGGAAGGAGCGCGGCGGGACCGGGCCGGGACAGCGCGTACTTTGGGCTCCGGGAGT 

CGCTCCGCGCCCGCGGTTGTAGCAGCTGCCGCTGCAGCCATAGCAGCAGGTCAGTCATTG 

GCACCATGAACTGGAATAMGGTGGTCCTGGCACTAAGCGAGGATTTGGCTTTGGAGGTT 

TTGCCATCAGTGCTGGGAAAAAGGAGGAACCCAAACTCCCACAGCAGTCCCACAGTGCCT 
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TTGGGGCMCCAGCTCTTCTTCTGGATTTGGAAAGTCAGCTCCACCACAGCTTCCTTCTT 
TCTACAAAATTGGATCTAAGCGGGCCAACTTTGATGAAGAAAATGCCTATTTTGAAGATG 
AGGAAGAAGATTCTAGCAACGTTTGATTTACCTTACATTCCTGCT 
Sequence 1001 

CCGGCCGCCGCCGCCGCCCCGCCGCCACCGCCTGGGGGTTGGTTGAGGCGGACGGCGGGG 

TCCGGGCCGGAGTACGTCGTTCCCGCTGCGCTAGGGGAAGCGGGCAGTCAGAAAAATGGG 

TAAGAAGAGTCGAGTAAAAACTCAGAAATCTGGCACTGGTGCTACAGCAACTGTGTCACC 

AAAGGAAATCTTGAACCTGACCAGTGAGCTGCTGCAGAAATGCAGCAGTCCGGCGCCTGG 

CCAGGAAGAGTGGGAAGAGT ATGT GCAGATCCGGACTCTGGTTGAGAAAATACGAAAAAG 

CAAAAAGGTCTTGTCCGTTACTTTTGATGGAAAAAGAGAAGATTACTTTCCTGATCTAAT 

GAAATGGGCCTCTGAAMTGGGGCTTCTGTCGAGGGTTTTTGAAATGGGTTAACTTCAAA 

GAAGAGGGGCTTTTG 

Sequence 1002 

GTCGACCACGCGTCCGACGCACCAAAGGGCAAATACTCGGTAGCGACTCAGAGGGAAAGT 

GGGGTCTCTCCTGGGAGAGCAGGAGGCTGCCAGAAAAGAACTCAGGTCAGGGGTGCATAG 

GCGGCTGAGGAGTGCGGGACGGGCTGAGAGTTGGGGTGCCTCCCCGCCCGCAGGTGGGTC 

CGCAGATCCCGCGGGCCGCATTGGCCCGGCTGCTGCGGGATGCCGAGGGGCTGCAGGAGC 

TGGCACTGGCGCCGTGTCACGAATGGCTGTCAGACGAGGACCTGGTGCCGGTGCTGGCGC 

GGAATCCGCAGCTGCGGGAGTGTTGGCGTTGGGCGGCTGCGGGCAACTGAGTCGCCGGGC 

GCTTGGGGCTTTGGCCGAGGGCTTGCCCACGCCTGCAGCGCCTGT 

Sequence 1003 

CGCGTCCGCTTTNCCTTCTTGGTTCCACCTCAAACATCCCTTCCGAAGTGAGGCTTTCCC 

TGACTGGGGAGCATAAAGTAGCATCTCTCACATNCCATACACCCCTACAACGAATCTATG 

CAATGGCCCTGCTCTGCCATCGCCACCTGAAACCATCTCAATAAACACATTTTGGATAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 

Sequence 1004 

ACGCGTCCGTTGGCTGCGAGGAGCGCCCGAAAGGTCAGAGGAAGGAGCTGTGGGAAGCTC 

GCAGCAGGTATCGGAGCTTAAGCCAGTGGATTTGGGGGCCCTGGGCTCCCTAGCCGGCTG 

CGGTGTGAGAATGGAGTGGGCAGGAAAGCAGCGGGACTTTCAGGGCTCTGGGTTTGCAGA 

GAGCCGAAATGACCATGACTGCCAACAAGAATTCCAGCATCACCCACGGAGCTGGTGGCA 

CTAAAGCCCCTCGGGGGACTCTGAGCAGGTCTCAGTCAGTCTCTCCACCTCCAGTTCTCT 

CCCCACCAAGGAGTCCCATCTACCCGCTCAGTGATAGTGAAACCTCAGCCTGGAGGTACC 

CCAGCCACTCCAGCTCCCGGGGTGCTCCTTAAGGGACCGGCACCCCCCACT 

Sequence 1005 

NCCACGCGTCCGGCAGCGCTGCGACGGGACCGCGCGATTCCTCTCCCACGCATCTGGCCC 

GCGTTCCTGGGCCTCGGCACCGGATCCCGGCGGGGGTGTGGACCCAGGGCCCACTCTCCC 

CGGCGCGGCCAGGGCCCCCCAGCGTGCGAGCGCCTAGGGGATGCCGAGCTGCTCAAGATG 

AGGAGGTGCGCGGGGCCGGGGCGGAGCAGTCGCAGTTCCCCGCGTGTGAGCCCCCACCCA 

TCCCTGGCGCCAGCGCTTTCCCGACCACTCGGGTTCGGCTATGCGGGAGCCGNGAGGAGG 

AGGCTTGCAGTCGTGGACCTGAGACCTCGGGAGGTCATGCTGTCTTGCTTTTAAGTGGCT 

TNGGGGAAAGTGAAAGAAAACNCCNAAAATTGGAGGACTTTGCTACCAGGGACCTAACGG 
CACCAGTGG 

Sequence 1 006 

ACCACGCGTCCGGGAAAAGCCCGGAAGTGCCACGGGACTTCCTGTCTAAGGAAGAGCCTC 

GTGMGCTCCTCCACTGGGGAGTCAGTGGCCTTCGTTGTATCTGCCCCGCTTGTCCACCT 

CCTAGAGTGAATCCCCGCCTGGAGGCTGGGACACTAACCAAGAAGTGGCACATGGCATAT 

CACGGGAGCAATGTTGCCGCTGTACGGAGAGTGCTGGACCGAGGGGAGCTGGGAGCAGGT 

ACTGCCTCCATCTGANGCCGTCCTTTGAAGGGAGAACCTGGGGTAGGGTTCGAGGAGCCN 

GCGAGAACTGTGCACCTCCTCGGGAGGAGCAGCCCCCTCCTGTGCTGCTTTCCCCCTCCC 

TTCAATATGCTGGGGGCGGAGACCCTGGCCTCCAAAGTGCAATTCCGGGACCCCAAATCC 

CAGCGGACGCACCAGGCTTTAGGTGGGCGTCAAGTTGNTGTGTGCCCCCTGGCTTCTACA 
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CCCCGGGACCCCCTTCCG 
Sequence 1007 

TCGCCACGCGTCCGGAAAAATTTATGCCTTTTTATTCATAACCCAGCTGTGGACCACTGC 

CTGAAAGGT TTGT ACAGATGCATGCCACAGTAGATGTCCACATAATAAAATTCATAGTTA 

CCMTGCAGTTTTGATATATCATTGGATTCTGTCTTTGAGTTGTAGGTTATTTCTTAGCT 

GCATGTTTTAAACTGAATTTGCATAGAGTTGTATGTTAATGTTTCAGTTAAGAGAAAAAC 

TTAAGATACATGAGTCATTACATAATGGGTATGAAATCTTTATAATCACCCTTCCACCCT 

CTATGGTGTCAGTACACATCACGTGTCATAGATACTTAAAATGTAAATGTTAACACTTTT 

CCTTCCTGCTGAGGATGTTTAGAGCCTAGTGCCAGACCCATTCATTTCCTTTTGATT 

Sequence 1008 

GCGTCCGGGCNGGGGAGTTTTGTCCATAACGTGGGCAACCGCGCAGCTGGAGGATGGCCT 

CACTCGGGCCTGCCGCAGCTGGGGAGCAGGCGTCGGGGGCTGAGGCGGAGCCGGGCCCCG 

CGGGGCCGCCGCCGCCGCCCTCACCGTCCTCTCTGGGGCCCCTGCTCCCCCTGCAGCGGG 

AACCTCTCTACAACTGGCAGGCGACCAAGGCGTCGCTGAAGGAGCGCTTCGCCTTCCTCT 

TCAACTCGGACTGCTGCGATGTGCGCTTCGTACTGGGCAAGTTGGCGGNGCCGCCGCCCG 

CTGGGGGCCCGCAGCGCATCCCCGCCCACCGCTTCGTGCTGGCGGCCGGCAGACGCCGTC 
TTTG 

Sequence 1009 

GCNCCCCGCGTCCGTTAGAGCTCAGGAAGTTATTAGGTGCAGCCTCTGGAGCCATACTCA 

CGCTGCAGTGCATAATGGGAAAATTAGGAGCATTAATAAGAAATTTCAGTAGTGTTTGTA 

AGGAAAATAAGCTACTTACTGAGATCTGTTTCTTCTATTGCATGTTTGCTTTTGAGGGAC 

AGCTT CTGTCA AAAGTGAAATCATCACCAGAACTGGGCCTGTTAGGAAGAATAGGGTTTT 

ATTTACI I 1 1 I ATGTCAATTAACTTCAACAAAAAGGCCACGCTGGCTGCTGTCATGCCAT 

CTGGGTATGCATTAAACATTMTGATGATCAGCCTTGAGGTTCTATTTATCTTGATTTGG 

CTTTATAAAGTTTGTCAGAATGGTGCTGGAGGNCCAGAAGTGCTAAGGAGAAAGAAGCTA 

TGGGCCAAGTTAAAGAATTTGAATGCAAAGGCCAGGNATGGGAGTTTTCATAA 

Sequence 1010 

CGCCNCGCGTCCGGTGAGCCCCAGCAAGGAGATCAAGATCGTGTCTGCCTGAGGAAGCAG 

AGCCATGACAATCGGAAATCTACCAGCTCAATGTCCTGCATGTAGACTACCGGACCGTGA 

GCAATCTGATTCTGACGGGCCCACGGACGATTGTCATGGAAGTCATGGAGGAGTTAGAGT 

GCTGAGCTCCTGGGCCTCCCAGCCCTCCAGTGGCCTGTGGGTGAGGGAAGCCAGAATGAC 

ACAAAGCAATGCAAAGACAAGATTGCCATGCAAATGGATGGTTTTGGACATACGAGTCTT 

CTCCGCACATACATGTCTAAAGTTGAGTTTTATACACTGGAATGTGGAAGAACCCGGGTA 

TCATATCTTTTTTAAAAAATGTCCAGTGTAGAAAACATTTGGGAAAC 

Sequence 1011 

ATTTTTCTAACATGGGTTTGMCGCTTATAACCAGTTTTATAAACCCCTTGAACACTGCA 

GTGAGTTATCAAAGCCACTGCCTGCAAAGTGGATGATTTAAGATTTTACACGCATGAAAA 

TGAGTGTGCCA TCTCC TGACCAGTGCCTTTTGACTTAGGTACCCAGATGCCACTTGTCAG 

CAGCAGGATACTTTTTACAACACGAAAGCATAATTATTTTAGAAGAAGAGAGTAGAAGGG 

CAGAATAGAATTCAACTTAC AGAA GCACCGGAGTAGTGTTGTGGTTGGCTGTTATCTGTC 

CCCCTGGGAGGAGGGACTGTTTTGCTCCCTTGTTTTNGATGTTAAACAGTAGCTTAAAGG 

CTTTCCCCCCCATACCAACTTACAGNCAAATGACAAAGAACCGGTGGNGGT7TTCAACAG 

ATTCTACAAACATGCATTTTTCCCTTCCCACTAAATGGG 

Sequence 1012 

GTCACCNCGCGTCCGCTCGTCCTCCGTGGGCACTGATGTCACCGAGGGCCCTGCTCACCC 

AGCCCCCCACACTAGGCTGTTCCATGCAAATGAGGAGGAGGAGCCAGAGAAGAAGGAGGT 

ATCGGAGCTGCGCTCTGAGCTATGGGAGAAGGAAATGAAGCTTACAGACATCCGCTTGGA 

GGCCCTCAACTCTGCCCACCAACTGGATCAGCTTCGGGAGACCATGCACAACATGCAGTT 

GGAGGTGGACCTGCTGAAAGCAGAGAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATC 

AGGCTCCACTCCAGGGCAGGTCCCTGGATCATCTGCATTATCTTCCCCACGCCGCTCCCT 

AGGCCTGGCACTCACCCATTCCTTCGGCCCCAGTCTTGCAGACACAGACCTGTCACCCAT 
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TGGATGGCATCAGTACTTTGTGGGTCCAAAGAGGGGAAGTGACCTTCGGGTGGGTGGTGA 
AGG 

Sequence 1013 

CGCGTCCGAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTGC 

AGGGGATGGTATGAATGCATATGTAGCCTACAAAGTTACMCACAGACAAGCTTACCATT 

GTCTCAGAAGCAMCAGTTTTGCAGGTAMAAAGNAAAGATNTTAGTGACTTTCTTGGGT 

CTTTATTGAGAAGCTTTCCNGAGNAAGCCACTCTCAGAAATTGGCTTCATATGTTCCCTC 

CCGCCCCCGGAGTAAGNAGCCCTCATAGNGGGATTGACATAAAAGTTGAAAAGNTTGNGG 

MCGGAAAGAATTCNriTCTTTCTGGCAGNMTTTTCTTTGAAAAAAACCNGAGGGGCCCG 

CTTTAGAAAAAGGGTACCCTTTCAGGAGGGATTTGTTAAAATTCANTCCTACCCCATGGT 

TTMCAAGGGACTCCTTGACNGTTCAGTMGTAGGTTTCTTTGNNAAAAAAGGAAAGGAA 

GCTGCCCACCGTGCCCGTNGGGGATACCCCNAAGACAATTGANGTTGGGTGGCTTGGNTC 

CTTTNCTCAAAGGMTGGTTTTCAAANCAAAAAGCCCACCAAGATTGCCCNGTCCANGCC 

AAAAAAT 

Sequence 1014 

GTCGCCCCGCGTCCGCGGNCGCGTGGGGTGCTNGTCACCAGACTGCACCCTTGCCAGCAG 

CTTCGCAGCTCTCGAAGTAANTTATCGCANGATGGCCGGCGCCTCACCTAGGAGAACCAG 

GMGGCAGGCCNCGCTAGMCGACGGNATTGMTTTTACTATTGNCAAAACAATCACATT 

CAAATTCATTCCACTTAAAACCTGAAAACATTGGACCACACAA 

Sequence 1015 

AGTCGACCACGCGTCCGGGCGGAGGGAGCGTGACTGCGCTGCGCAGGGCGCTAGGAGGCA 
TTGTCGCCGCTCAGGCCCTTTTGTGAGAAGCAGACCAGCCTGGGGGCTGGCGGCAGGACA 
CCTGTGTCTGCATGCTGAAGAAGATGGGTGAGGCCGTGGCCAGAGTAGCAAGGAAGGTCA 
* ACGAGACGGTGGAGAGCGGCTCTGACACTCTGGACCTGGCCGAGTGCAAGCTGGTCTCCT 
TTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCTCTGGCCAGATCCACCTCATCACCC 
TGGCTAACAACGAGCTTAAGTCCCTCACCAGCAAGTTCATGACCACATTCAGTCAGCTCC 
GAGAGCTCCACCTGGAGGGGAACTTCCTACACCGCCTCCCCAGCGAGGGTCAGTGCCCTG 
CAGCACCTCAAGGCCATTGACCTGTCCCGGAAACCAAGTTCCAAGGACTTTCCT 
Sequence 1016 

CGCGTCCGCTTTCAGTGAAGAAAAGGGAATTACACATNGAATCGACACATCAGTAATACC 

GATACAGTGMATGGGCCTCTMTMGMTTTNAGCGNGTTTTCTGATGTGCCATTTm 

TTGTCTTTTTTAAAMTATACCATANTTATAAMNTGGNAAATANNTTTTTGNACACCAT 

TTAAATTGACCCCCTTANAGNACNCTTGCCGTNATGNTGAAANGCTAGACCTATNGAAGC 

TGNCTTGANGATATNTGTTTTTTTAAAAAMTTTTTTACAACTNTACTTGTTGG 

TAATATGCACTATAAAATATGATCNTATATCCTATTATCTATNATCTAAAAACACTTCCT 

T GGACNC ATTTANACGTAAAATTAAAAATGGGTCTTTAANGAAGANTAATGGGGAGGCCC 

CI I I I I I AAAACCTATGGN NCMTCTTTTTATGNCAMGGGGNGG ACCATTTTATTAAAA 

Sequence 1017 

GCGTNCGCTGCGCCCGTGGGACCGGTGAAGTTCTGGCGACCCGGTACAGAGGGGCCAGGT 

GTAAGCATCTCTGAAGAGAGACAAAGTCTGGCTGAAAACTCTGGGACAACGGTTGTTTAC 

MCCCTTATGCTGCCCTTTCCATAGAGCAGCAGAGGCAGAAGCTGCCGGTATTCAAGCTT 

AGGAATCATATTTTATACTTGATAGAAAATTATCAGACAGTGGTGATTGTTGGTGAAACA 

GGATGTGGGAAGAGCACACAGATTCCTCAGTACCTTGCAGAAGCCGGCTGGACAGCTGAA 

GGAAGAGTGGTAGGAGTGACCCAGCCTCGAAGAGTGGCTGCTGTTACACATGATCTTTCT 

TNCCAAAGGTTGCAGGGAGAGTAGCTGAAGAAAGGGGTGCAGTGCTGGGCCACCAGGTGG 

GCTACTGCATCCGCT 

Sequence 1018 

AGTCGCCCCGCGTCCGGTGGGAATCTTTCNACTTCTTGATCCATCTGGGAGAGAAGACGT 

ACCATTATGTGCCCGAATTCCGAAAAGTGTCCATAGCAGCTACCATCATCTATGCCTATG 

CCTGGCTGGTTCCTCTTGCACTCTGGGGTTTCCTCATGTGGAGAAACAGCAAAGTTATGA 
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ACATCGTCTCCTATTCATTTCTGGAGATTGTGTGTGTCTATGGATATTCCCTCTTCATTT 
ATATCCCCACCGCAATACTGTGGATTATCCCCCAGAAAGCTGTTCGTTGGATTCTAGT 
Sequence 1019 

GGAGTCGCCACGCGTCCGGTGGCACGATCTTGGCCCACTGCAAGCTCCGCCTCCCAGGTT 

CACGCTATTCTCCCGCCTCGGGAGCTGGGACAACAGGTGCCCGCCACCACGCTCGGCTAA 

TTTTTTGTATTTTTAGTACAGACGGAGTTTCACCGTGTTGGCCAGGATGGTCTCGATCT^ 

CTGACCTCGTGATGCACCTGCCTTGACCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCA 

CTGCGCCCGGCCAATAATTTTTTTAGTTTMGTTCATTTTTGTCCCCGCTGATGAMG^ 

ACAGCAGCTGCCCAGTGCCTCTGTCCACACCCACCTCCCTGGTGTACTTGCCCCCTACAG 

CAGCAGCCCAGATCCTCCCCTGGATTAAAATTGCAACTGGTGCCCTAACCCAAAACTTGA 

GAAAAAATTCTCTATCACATCCACTCTTCTGGCATTTGTAGAATCTC 

Sequence 1020 

GTCCGGTNGATATATATATTTACCTCCTTAGTAATGCAAGAAGTGTTTGTGGGAAGCAGA 

GMGCAAGCAACTGTATTTCTTGTTCTCACCTAAGCATTACTGGAGGATAAGCCACATCA 

GTCTACAAAGAGGTTTTCATACAAACATAATAAGATGTAAAATGGACCAAAAGTTGAAAG 

CAC ATTCTT GCAAGTAAGCACCTGTTACTCTCCMGCMCCATGGGTTTACCATATTTGG 

GGA TTTTTT GAACACTTAGNCACTTTCTTGCTCCCNMGGGGACNTTTACAAAAAGTGNA 

NACATTTTTGTANTGTNNCCCGTTATTAM 

CAAMGGGCTTGGTNTTTTTGGACAMTTCAAAMTGGMAAATGGATTTTCCACCGTTT 
TMGCCTCMTTTTAAGCCCCAAGGTTC^^ 

GTATTAGGTNGGNCCTGGGTTAAAAAACCAAGAANAAAACCATTNAAAACAAAGCCCATT 

CAAAAATTCTTNTGGAAAAAAA 

Sequence 1021 

CAGGGAGTCGACCCCGCGTCCGAGCATCTTGGGGAATTTATATTCCTTTGTGAGAAATGT 

TTTGATCATAAGCCTAGAATGATAAGTAAAAGAAATAAAGATAATTCTACTGCTTGTTCT 

CACCCGGTTACAAAGCATGAGTTTGMGACAATMGTGCCTTGTCCACATTTTGCGAGAG 

ACAACAGTAMATACTCCAAAATACGTTCTTTTCATGGTCAGTGTCAGCTTGATTTATGT 

CGACATGAAGTTCGGTATGGCTGTTTAAGGGMGATGAGTGCTTTTATGCCCATAGTCTT 

GTGGAACTGAAAGTCTGGAf AATGCAAAATGAAACAGGTATCTCACATGATGCTATTGCT 

CAAGAGTCTAAACGATATTGGCAGAATTTGGAAGCAAATGTACCTGGAGCGCAGGTCTTG 

GTAATCAAATAATGC 

Sequence 1 022 

CNCGCGTCCGCAAAGCAAACGACAGCTCAGGGGGCTCCAAAGACCTCATTCATAGCAGCA 

AAAGGAACTCAGGTAGTCAAAATATCAGTACACATGGGACGTGTCAGTTTAAAACAGGAG 

CCCCGGAAGAGTCATAGTCCCAGCAGTGACACATCAAAACTAGCAGCTGAAAGGGACTTG 

AATGTGACCATCAGTCTTAGTACTGATAGACCAAAGCAGCGATCACAGGCAGTAGCAAAC 

GAGAGGGCACACCCTGCCAGCACAGCAGTGNCGAAGTCTGGGGAAGCCATGGCCTTAAAC 

AA AACT AAGACTCAGAGCAAAGAAGTCAATGCAAATAAACACAAAGCCAATACGAGTCTT 

CCTTTTCCTMGTTCACTGTCMTTCAAATCGCTTAAGGAAGCAATCTATTAATGAGACA 

CCTTTGGGAAGTTTGTCAAAGGATGATGGAGCTAGAGGGGCTCATGGGG 

Sequence 1023 

CNCGCGTCCGGCCAACCGCCGAGGAGCAGTGCCCTATTCAGCACAGTGCGCCAGGGCCAC 

TGGCAGATTGTTGATCTTTTACTCACCCATGGAGCTGATGTCAACATGGCAGACAAGCAG 

GGCCGCACTCCCCTGATGATGGCTGCTTCCGAAGGCCATCTAGGAACCGTGGACTTTCTG 

CTTGCACAAGGTGCCTCCATTGCTCTTATGGACAAAGAAGGATTGACAGCCCTCAGCTGG 

GCTTGTTTGAAGGGCCATCTCTCAGTAGTACGTTCTCTGGTGGATAACGGAGCTGCCACA 

GACCATGCTGACAAGAATGGCCGTACCCCACTGGATCTGGCAGCTTTCTATGGCGATGCT 

GAGGTGGTCCAGTTCCTGGTAGATCATGGGGCCATGATCGAGCACGTTGACTACAGTGGA 

ATGCGCCCTTTGGATAGGGCAGTGGGGTG 

Sequence 1024 

GTCGCCCCGCGTCCGAGAAGTCAGGGAGTGGAGGTTCTATAAGGAATTAACAGCTGAGGA 
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CGGM GGGTTTGTTTCCCGTTTGAACCTAAACGCAAGTGGAAAAGAATACTCAGAATGTA 

I 1 1 1 1 CTACTTTACATCTGCTGGGGAAGGAAATGTGTCAGGAAGCCGCTGCATCTGGTCA 

TTTCATCGCATCAGAATCACAGCAGACGTGGAAGATTCCATGTGGTGGGGAATAAAGAAA 

TAACTTTATGCTCTCCTGAAAAACAGCGGGAGCCTATGTGTGTGTGCGACACTGTAATCT 

CAA GGAGA TTCACTCAGAGCTGTCTCAGTCCAACTCCTGCATGACCAGATCTTCCCTTAG 

CATCTTTTCTGTGATGAAATATTATCTTGTGTTAGAGTTAGGAATAGGAACTAACCTGTA 

GGAGCATGTCCCCAAATGGACATTTGAATGGACTAACAAAAACAACTGGAAAGACTGAAT 

TTCCGACACAAAGGAATGATGGGATCAAAAAGAAAGC 

Sequence 1025 

GGAGTCGACCCACGCGTCCGGTCGACAGCCTCCGCCACATCCTCCACCTCTCTTGGTCCA 

GCGAGCGTTGCCGGGCCAGGGTCAAGCGGAGGGCTCCGACGGCGCGGACGGAGCGAAGCG 

CCGAGCCATGGCGCACCAAACGGGCATNCACGCCACGGGAAGAAGCTGAAGGAATTCTTT 

GCCAAGGCACGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTNGAGGACCGAGCAGT 

NTCGTGCCTGGGTGCNCTTCGCAAGGGAGCCAGNTAAGGCNCGCTNGGGGATCAGGGACT 

ATTGAACAGGNGGCCCGTGGCTTGCNCACCTGCNTGGGACCGCCCCAGGCAGGCCCTGCN 

TACCTGGCTCTACCGCGCTTCGACTNACAAGAAATGGCTCAGGGGNCTTTCGAAATGGGG 

CTTCTTTCCTTCGCCCTTGGTTCCGCNCTNGAATAAACCTCCCCCCCGTGGCGGCTTGAA 

AGANTGCCTGTACCGCCGGGNCAATGCNGGGCCCCACAAGTGGAAAAAAGGGAAG 

Sequence 1026 

AGGGAGTCGACCCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGG 

GCCGCCCGACCCCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGGNGATTCTGCTGC 

TTTTCCTGACGGAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATTAACCGC 

GGAGATCTGTTCTCCTGCCGCTAGGACTACGGACCCTGCCGGGCCCTACNTTCTCCGTTT 

ACNTACTACGACAGGGTACACCGCAGTAGCNTGCTCGCCAGTNTCCTGTTACNGGNGGGC 

CTGCNGAGGGGGCMCCGCCCMCCAATTTTCTTACACCCTGGGGNAGGGCTTGCCGAAC 

GMTGCCTTGCTTGGGAGGGATTAGNAAAMAAGGTTTCCCCAAAAGTTTGCCCGGCTGG 

CAAGG ATGGAAGTGT TGGGACCGAACCCAGGATGTGAAGGGGGGGTTCCACCAGAAAAAA 

AGGTTAi I I MCTTTTAATCTTAAAGTTTCCANTGGMCATGGTNGAAAAAAMTTTCTT 

TTTTNCGGGTGGGGGTGGTTCACCCGGGAAC 

Sequence 1027 

CGTCCGTAGTCTCTCTCGTGGCCCGGAAAAAAAAMGAAAGAAGGTTGGGGCCAGTCACC 

CCCACATCCCTTTATGGAGGCTTCCAGATCATGGATCCTGTCACGCGCATCCCGGTGAAG 

AGAGTCCACCAACAGGCTTTGTATGGGGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTG 

GTTCGATATGGCTCACTGCAGCCTCAGCCTCCCTGGGATCAAGTGATCCTNTCACCTCAG 

CCTCCCAAGTGGCAGGGACCGCAGGAAGGCCTGGACGACGGCCCGGACTTCCTCTCAGAA 

G AGG ACCGCGG ACTTAAAGCAATAAA 

Sequence 1028 

CGCGTCCAACCCCTTCCTCAGCCTGTCGGGAGCAGAGGGCAGTGGCGGTGGCCCCAAAGG 

AGGGACCGCTGACAAAGGAGCCTCAGCCAACCAGGAAAAAGGCTAAATCCACCCTTACCC 

CTCCTGACCCCCCCAAGTGGAGGGAACAGATCCTGGCCTGAGGGGTCCTAGCCTGGAGCA 

GGCGCCTGCGCCCAGACCCTGGAGAGCCTTGACCCAGAGCCTGTGCTGAGGTCCAGGGAG 

TGTGGAGAGCTCCTGGTGTCGAGGACTGAGACTGACAGGGGAGCCCCCTCCATCTGGCCC 

CCTTCCCTTTCCGCACTGTCCGCTTTGTGAGGCTCAGAGGAAGGACAGTCTGCAAGCCCG 

CCTAGGAGGTCCATCCCCAGCAAATGTTTTGGAGGTCCCCCCAGAGAGCAAAGTGGGCCA 

TGGCAAGAAGTAGGGGGTTGGTTGGACCTGTCACATGAAATGGATCAACACTTGAATGGG 
GA 

Sequence 1029 

CGTCCGGAGAAGATGGGCCTCCCGGGCTCAGACTCACAGAAAGAGCTGGCCTGACCACCA 
GGCACCTCACTGGCACTGCTGACCCATCCCAGAAACACAATCTCAGGGACCCGAGCAGCT 
CCAAGGACGAGAGGATACAGCAGACACAACCTAATAGAGAGGGCGCCTGCAGCCTTAACC 
TCCACGGCCTTCGATACTTATGCAAGCCTGGTGTTGCTCCTGTCCTCAGAGTCATCCTGC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

166/467 



GCTCATGCCTTTTCCCGMTGGGTTCACCTCTGGCAGTTGCCGCTTCAGTCTTGGCCTTA 
GCCTCATCTTGAAGTGGGTAGCTGGCGGGAGAGGGTGGGCTGCGCCCCCTGCTGGCCCTG 
AGGCTGCAGAGTTGGGAGCAGGACACCTCACCTGAGTTTCA I 11 I I I I I CATGTCCAAAC 
CATGCACATACTATAGTCCAGAATCAAAGCACTTTTGAAAA 
Sequence 1030 

CCCCCCCGCTTTTTTNNANNCNCGGGGGGTGNGGCAAAACTTTTTCCTAGAATTNGCGCC 

ATGTTGGAAACNNTNTCNCAGCANCCCGGCTGCACAGGGNGTAGCCCANCCTCCAGGTCC 

ACGGAGTGGTGTGGACCTCCCACCTCACAGCTGCCTCTGGCAGCCAAGCCTCTTTTCGCC 

CGGCCCCAGCCCCTCTGGTTGATAAACGGGTGGGCCTCCTCAGCAGCGTGGCTGCCTTTC 

ACCTTGATTTCCCCAGGGCTCTCGGCAACATCGATAAACCAAGCCTCGCCCACCAGCTGG 

GCCCTCCCCCACCCAGTCTGCCAGGCTGGGAGCTGGAGCTTGCTGAGTCTTGAATGCCCT 

TCTAGATGGCTTCTCTAGAGGCTCTCCTGGCAAGAGAGGGTCCCAAGGGGAGCCCTGCAA 

AGCAAAGGCTCCTTGTCTGGGGCGGGATAGAGAATCTCGCCTCTGTCTGGTGTGTTACCT 

ACTGGGGGCACAGGAACAATTTCCTCAAGGAGACAGTGGCATGGAGCTTTGAAAGACGAA 

GTANGTGTTAGCAAGGAAATAAGGAGGAACGGGGGTTACGGGCAGAGGAGAAAGCACATG 

CCAAGTCAGCAMGAAMGTAGAATTCGAAAAACTTTTTTA 

Sequence 1031 

GGCCAGAGCTACTACGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGC 

CGCAGCCCGGGAGCGGAGCAGCCGGTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGC 

CGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTT 

CCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCA 

GGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTT 

TGTGGACTATGCTTCCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCG 

AGATTATATTCAGTCCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGC 

CAAGACAATTGGGCAGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCT 

CACCACCTCCAACATGCTNCTGAAACCCCCCTGGAACAAGCTGAACATTGTGCTTATAGA 

CT 

Sequence 1032 

TCG CCCCGC GTCCGCMTTTCTTTTGMTTTCGATCACTTCTACATTCCCAGCTTGCCAC 

ACTCI MM IGATGAAGTTGTGAAGCAGATGGTAGCTGCCTTTGAAAGAAGAGCATGTAA 

GCTGTATGGTCCAGAAACAAATATACCTCGGGAGTTAATGCTTCATGAAGTCCATCACAC 

ATA MGGC MAAAAGAACTGGTGCCACCTGCTTCTGACTTTAGTTTGTTCACTTTTAGGA 

AGTATTTTCATGACATGTTTTCAGMGCCAGAAAGCATTTGTTAMCGCAGCTTTGGTTA 

TAMCCTGCACCATTGAAAATTTGCACATAGAATATAGACTCACTTGTACATAGAATTAT 

TTCTTCAAGTATAATTCAAAATAATATGGACATTATCATGTTCTGCATTACAATAATGGG 

ATGTCATCACCATTGCTAGAATACTGGCATGATTCTTCTGAGCAGAAGTTGAAACTGTAA 

ATTTAAACCTTTTAATTATCACCTTACCT 

Sequence 1033 

NCGCGTCCGGCCTTTTGTTCAGCTTGCCGGGCAACCGGCCCCTGCTGGGGACTACAAGTC 

CCGTMGCCTCCGCGGCGGCACGTCCTACCCTACACTGTCCAGCCGGCTCCCTTTTTCCC 

CCTCCCCGGGGGCCAAGGGCTCCGGCTGCTGCCTGGCGGCCAACGGGCCAGGTAGGATTT 

CCGGGAGAGGCTGCTGTGGAGGCTGAGGAGGCGGCGGCGGAGATCTGGAATGAATTTTCT 

ATCTGTGAAATTGTGAAGACGAAAAAAGAAAATTGTGCTAGTTTTGCTGTTGTACTCCAC 

ATTGCAGAAGTTACAATCACAGTGCACAAAACAGTATCTCACCTCCCTAAACTGGTTAAT 

AGTGGCATGGAAGATCCATTTGAGGAAGCAGACCAGCCCACTACAGAGCCAGGCATGGTC 

CTGGACAGTGTGGAAGCAGGAGACACAACACCTTCTACCAAAAGGAAGAGCAAGTTCTNA 

GGCTTTGGCAAGATCTTTAAGCCCTGGAAATGGAGGAAAAAAAAAAGTAGTGATAAATTT 
A 

Sequence 1034 

CGTTTTNNTTCTTCCCTTGGATNAATATTTTCTACTGAAAAGAAATGAAATCTCAGTTCC 
ACTATGATAAAAGAAACGAATCACCAGTGTCTGTCAGTGCCCCCACTGCTCCTCCTCATA 
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CCAGCATTGACACTGGTAGCTGAGGAAGGGGACTGCTCAATTTCTAATGTGATCTATTCA 

AGMGCCACATATAMAAGGCATTGAGGGCTCACTGTCCAGAGAATTGCTTTTGTAAATG 

CTCCACAACGTATGGGCAAAAGTTTAAGGACTACTGCCTGATGTACCAGGAATCCTGAGT 

TCTGTGAAMTCTGTTCATTCCAAATCTGTTAAGCATTTCCAAACATCCAGAGAATGGTT 

TCACTGTTGAGGGTGCATGTGGCAGAATCTGTCTCTCTCTGCACCTGTCTTCTGTTACCA 

TCCCTGGACAGTGACAGATTTTMGCCAGCCACCAGAATTCTTTTCGAGTTAACCATTTC 

TTAGTTTCTGTAAA 

Sequence 1035 

GTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCCTCCCTCCCGGGGGGTG 

CACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTC 

CTGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCC 

GGCAGCACACAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCCGCCGTCG 

Sequence 1036 

GTC CGCC CCGCGTCCGGGAAAATGCCGCAGTTTGTTGCCTTGAAACCTAAGAGCAATCCT 
TGGTTTTGTTGCTACATTATT1TTCCAG 

CTTTMTTTCATAATAAAGTTAGTAGAGTCACTCAACTTACAAACTTTATTTATGNTGGC 

TTT GGGC MAAAMTCACTTATAAGGCAGCTCTAAATTTGCCTTGATAAGCTAAATAAAT 

TACTTTTATAACTTACTAMGCAGAACAMCAGTGAAACTTTCTAAAATATTCTATNCTG 

GAAATAGN GGACAG GGGGATCTTTTATTTATAATNCTCATCAAGATGAGTGAGGTTGTTC 

ACCAGGATATTTTTTATGG I I I I I I I I I MTTTTTCTCCAAAGNAATTATTTTTAATAGG 

AATTCCCAAAAGNAATCAAGAAATTAGTTTTTCAAAAATAMTTTTTTCCAGNTNGATAA 

AAAGGAAGGTNGTTTGTAAMTTAAATCCATTATTTTACCACCTTAAAAAATTTGGGGGG 

AATACCATTCTAAAGGGMCCTTTTAATTCTTTACCTAATTCANGNTAGGGGNGTCTTGG 

CAATTTNGAATANTTTTCNTTTT 

Sequence 1037 

GCGTCCGAAAATATTTAGGTMTGTCATAAAATTTATTTTACCTTTCTCATTT^ 

AAATMATGAAAMMCCCTAGATATTGCTTTATTACCAACAGTGTGTAGGTTTTTGTAC 

ATATGGAMTTTGACACAMMMTAGGGAATTTGTATAGAGMGTTTCCCTCTTATAAA 

AGGACTCCCATTTGATTGTTCGAMCTATAAMTGCACTTTTACTTTACCATATCTTGAA 

ATGACAAMTATCGCCCTTTGGAAMCCTGACTCTTTGCACCGTTGTAAATTCCCAGAGT 

CTACCT CAGTT AACCAGGCCTTAGTTTTAGGCMGGAATGMTTGAATTAAMTTTCAGT 

TCMTCATTTTATGCCAGMTTTTGTTmCTTTTTM 

CTGGNTGGCCCTTCCCT CCCCA TTTAACTTTTCTTTTTTTMTNCTTTGAA 

TTAMAATATTTCCMTCCTTTTCCTTTTCCTCTAGCCAAAGTNGGTTTTGTNATTT 

AAATAAAAGGGCCCTCTGGTGGAAAANGGGNCTGGAAATTAACCTT 

Sequence 1038 

CGCCNCGCGTCCGCAAGACTTTGAAAAATNNGATCATGGTTCTTCTCAAAATACCAGCAT 

GTCTAGCATCTATCAGAATTGTGCMTGGAGGTTTTGATGTCCAGTTGTTCACAGTGTAG 

AGCTTGTGGAGCTTTAGTTTATGATGAAGAAATTATGGCTGGATGGACAGCAGATGACTC 

AAATTTGAATACAGCTTGTCCATTCTGTAAAAGCAACTTCTTGCCTCTTCTCAATATAGA 

ATTCAAAGATTTGAGAGGTTCTGCAAGCTTTTTCCTGAAACCAAGTACCTCTGGTGACAG 

TTTACAAAGTGGAAGCATTCCATTGGCAAATGAATCCTTGGAGCACAAACCTGTATCCAG 

TTTAGCAGAACCTGACTTGATCAACTTTATGGACTTCCCAAAACATAACCAGGATCATAA 

CTGAAGAAACAGGCTCTTGCAGTTGACCCAAGTGATGAAATAAAGAGAGCCAGTGGGAGA 

TGTCCAAACTATTGMAATTTCATCTGTGNCTTAATAGGTTTATC 

Sequence 1039 

TGANATTAGCATCACTTCGTCTACTMGAATCTTAATAGATGTAMAATATCTTTTAAAA" 

CATATGGTAGGATGGGTAAAATTTGGCAATACTATCCAGGMGTCACTAAGTACAGATGA 

ACTGATTTAGTCCTMTTCCAAGAAGTGTGATTCCACCTACTTGACTAGAAATTTATACC 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

168/467 



TGGTAATMCTCCTTGTCCTTGAAGATTTTCAACTMGGAAMCTGTTTTTCAGCAGGAC 

CTGATTATGCA CTGCTA TCTAGGTAGGGTCACTTATGGTTTTATMTATATTTAATTGGA 

TTATAATATTCC I Mil I CTTGTCTCTTG G ACAA AATCCT AG CTTT ACTGT AATTT AAAA 

AGATGAGTT TAAM TTTCAGGCTTTAAAAACATACCAAACATTGATAAAAATGAAATCTA 

GATAAAAGTATTTTATCAATGTTCAGTTGCCTGGATTCAATAACTGTATTATGGTTAtGT 

AAGGATAATATCTTAGGAAATCACATTATGGTATTAA 

Sequence 1040 

GTCGACCCCGCGTCCGGTAGCTTAGTTGAGTAGATAATCTTTTGTTGTTTCCTCCTTGTA 

ATATACAAGCCTTGGCTTCTGTGACATCATACTCTCCTAGATTTCCCCCTGTCACTGTGG 

CTTCTTCTCAGTCTCTGTCCATCCCTGGTGCTCCTGMGGTTCTGTTCTCAGCCTTACAC 

ACATTACCTGGGTGATCTCATTCTCTGCCATGACTTCACTTGCCATATATGTGCTGATTT 

TCCCCAAATTCCTATTTCTCCCGACCTTTACATCTATT7TATTTGCAGGTCATATATCTA 

ATAAGGAATTGATATCCAGTGTACATGTAGAACTCCTGTAATTCAATACAGAAACCAAAC 

AGTCCAATTAATAMTGGAG AAGA GATTTGAATGAACATTTTTCTAAAGAACATCTCAAG 

CTCAAGATTTCCCAGATAACTTTTCTTTCTTCAAMTCTGCTTCTGTGTTTCCTCATCTG 

TAGGTGGCACAGCATACATCTGATTTCCCAAGCCAGAAACCTCATAGTTATTCTTGACTC 

CAGGAAGAAATATTATTGAGTTTTTAAAAACTC 

Sequence 1041 

CGACCCCGCGTCCGTGCTGAACTGAGCTCAGGTGTGTTTTTCTTCCAAGCTTTCTAGCAA 

GGTTTCTA CTTAA AATCACCTGTGTGCAAGCCCAAAGGACATTTCATCTATTCTAAGCAG 

MAGGCTGTTTTGTTCATTACAGTGAGTGCTGTTCATCTCATGGAGTGGGAGGAGCACTA 

AACCAGGAGACAGAGGACATGGATTTGGTTTCCAGCTTAACCAGTTAGGACTCTGTCCTC 

TGCATTCTGGAACCATGAT GCCTG CCTGCCTGCCTCACAGGGCTGTTGTGAGGACCAGAT 

GAGATGATGTATGTTCATACTTTTGGAATCTCTMTTTAMGTCTTAATATTTTGTCTTC 

TGAGTGTGAGGGGATAAACCTGGATGTAGACTATTAAGCAGCATAGGAGAAAAGAACAAT 

AGAATCTAATGGACTGGGTTTGCAATCTCTCTCTAAATGCACTGCTTCAGACAAAGTGAA 

ATCCAAAGGTGTGAAAAAGTATAGCTGCAAATTGGAAAAATGTGTTTCAAGAGT 

Sequence 1042 

AGTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCTCCCTCCCGGGGGGTGC 

ACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTCC 

TGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCCG 

GCAGCACACAAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCGCCGTCGCGAGCGT 

CTCTGGGACCGGAAGTGCGGGCGAGCGCGGNTCCCCGGGTCTGACAGGAGCAAGCTGTGG 

GCACCGNGGCGGTAGTTGGAGGCGGNAGAGGGTNCGTAGCCGNGCCGNCTGCCCGNCATG 
GGCCTNC 

Sequence 1043 

AGTCGACCCCGCGTCCGCAGGGGCGTGTTGGCCCCGCACAGATTGAGCCGAGTTGTCGCC 

CCGCTGGGAGAAGTGACCCTCCTGCGCCTGAAAAGAAATTTCATTTCAATATAGGTGACT 

ATGCAGCCTGCAATTCAAGTATGGTTTGGAGAAGATCTGCCTCTAAGTCGTCGGAGTCCT 

CTGACTCCCAGACACGGACCAGGATTGGCTAATGTTTGTCAGTACGATGAGTGGATAGCT 

GTGAGGCATGAAGCCACTTTGTTGCCCATGCAAGAAGATCTGTCAATCTGGTTATCTGGT 

TTATTAGGTATTAMGTTMGGCAGAMMTTATTGGAAGAACTTGATAATGGAGTACTA 

TTATG TCAAC TGATTGATGTTCTTCAAAACATGGTGAAAACATGCAACTCTGAAGAATCA 

GGGMTTTTCCAATGAGMMGTGCCCTGTMGAMGATGCTGCATCAGGTTCATTCTTT 

GCTCGGGACAATACCCGCAAACTTCCTTCACTGGTGTAGGGACATTGGGGTTGATGAAAC 
TTA 

Sequence 1 044 

ACGCGTCCGCCCACGCGTCCGGAGTCCTCCTCCCCGGGTGCCTGCCCGCAGCCCGCTCGG 
CCCAGAGGGTGGGCGCGGGGCTGCCTCTACCGGCTGGCGGCTGTAACTCAGCGACCTTGG 
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CCCGAAGGCTCTAGCAAGGACCCACCGACCCCAGCCGCGGCGGCCGGACTTTGCCCGGTG 

TGTGGGGCGGAGCGGACTGCGTGTCCGCGGACGGGCAGCGAAGATGTTAGCCTTCGCTGC 

AGGACCGTGGTGAAGCCTCTGGGCTTCCTGAAGCCCTTCTCCTTGATGAAGGCTTCCAGC 

CCGCTTCAAGGCACACCAGGATGCACTTGCCACGGNTTGCCGTGCCCCCTNTTCAGCAGT 

CCCT 

Sequence 1045 

GTCCGCAGMTTGACNMTTCAGGAGGTGTAAAAATAMCAGTGTTCTCTTCTCTACCCC 

AAAGCCACTACTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCTGGCTAAGGCATGC 

ATTAGCCACTACACAAGTCATTAGTGAAAGTGGTCTTTTATGCCTCCCAGCAGACAGACA 

TCAAGGATGAGTTAACCAGGAGACTACTCCTGTNACTGTGGAGCTCTGGAAGGCTTGGTG 

GGAGTGMTTTGCCCACACCTTACMTTGTGGCAGGATCCAGMGAGCCTGTCTTTTTAT 

ATCCATTCCTTGATGTCATTGGCCTTTNCCACCGATTTCATTACGGTGCCACGCANTCAT 

G 

Sequence 1046 

ACCACGCGTCCGCCCACGCGTCCGGGCGGGGGCATGGACTACTGACCCATGCGGGGCAGC 

GTCCCTGTGACCTGGCCGATGAGGAAGTACTGAGCCTGTTGGAGGAACTGGCCCGGAAAC 

AGGAGGACCTTCGGAACCAAAAAGAAGCTTCCCAGAGCCGGGGCCAGAGCCCCAAGCGCC 

CTCTAGCAGCAAACACAGAAGGAGCTCTGTGTGTCGTCTGAGCAGTCGCGAGAAGATTTC 

CCTCCAGGACTTGTCCAAGGAGCGCCGGCCTGGTGGGGCTGGGGGGCCCCCCATCCAGGA 

CGAGGATGAGGGGGAAGAAGGTCCCACCGAACCACCCCGTGCAGAACCCAGAACCCTCAA 

TGGCGTCTCCTCCCCGCCGCACCCCAGCCCTAAGAGTCCCGTGCAGCTTGAAGAGGCCCC 

CTTCTCCAGGCGCTTTGGCCTCCTGAAGACAGGGAGTTCTGGTGCCCTGGGTCCCCCTGA 

AAGGCGGACAGCGGAGGGAGCCCCTGGGGCTGGGCTTGCAACGCTCGGCTTTCTTCTCCT 

GGCTGGAAGGGACCTTCACTTANGCCAAGGAGCTTCGTNTTGGCAGAATTACCCCGACCC 

CCTTCCCGAAGCTTGCCGGAGCCCTTNTGTCCTTGTCTGAGGGTCACCAAGCCTTCTTCC 

TTTGCTTGGGAGMCTTCCTTGNCTTNCCTTCAGGAATTCCNGAGCCTGGATTCCCAGCG 

AANCCNAACGTTCCCACAGGCTTTCACGGGGC 

Sequence 1047 

TGTCGACCCACGCGTCCGCTCCCCGCCGAGGCCTCCTGCACCACCTAGAGCCCCACCCCC 

GACCCCACCCCGGGAGGGCAGAGCCAGAAGAAGGCTCATTAGACCTGGGGGACCCAAAGG 

GTCTGGCCTCTTTGGGCAGCCCCAGAGATGAGGGGTCAGCAGAGGAGAGCTCTGGGGTTG 

GGGATGGGTTAGGGACGCAAGCTTGAGTTCTAGCCCTTGCTCTCATTCAGCTGTTGTGTG 

ACCCTGGGTMGACCCTTCCTTGTTTGACCCTCAGCTTTCCCATCTGTTTAATGGTGGCT 

TTGGCCAAGGCAATCCACAAACGTCAAAATTCCCCTTCCCATCAGTACACACACCGATGC 

ACACACACTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACACACAC 

ACACACACACACACTAGTTAGTGCCTTGGATGAGGCGGGGCAGTGTGTATATGGACCCCT 

GGACTTGCTACCTTCAGGGTTCCATACTCGTCCCTCCCCTCCTGGCTCTGCTGTCTGGAG 

TCTGGCAAGCGGG 

Sequence 1048 

CGCGTCCGCCACGCGTCCGCCGNCCGCCGCCTGCCTGGGCCGGGCCCGAGGATGCGGCGC 

AGCGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCTAACTTCCCCCGGTACGT 

CCCCGTTCGCCCGCCGGGCCGCCCCGTCTCQCCGCGCCCTACGGTCGGGTCCTCCAGGAG 

CGCCAGGCGCTGNCGCCGTGTGCCNTCCGCCGNTCGCCCGCGCGCCCGCGCTNCCCGCCT 

GCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCATGCTGCCCCTCTGCCTCG 

TGGCCGACCTGCTGNTGGCCGCCGGGCCCGGGCCGAGCCTGGG 

Sequence 1049 

NCCACGCGTCCGCAAGGCGCTACGTTTATTGCCTCGTCTTATTCACTGACCTTTGTAATG 

ATACACAGTGMTTCTTTTTGACAAAGAGAAATGCAGTGTAGTATGCAGAGCTGCTGTTT 

TAATGC CTATGCA TTTACTCTTTCCTGATTTAGGCAGAGGTGGCATTTTCTTTATTGCAT 

TTCTCTATTTTTTTAATGTACCCTACCTTCAGTATTCTCTTTGTAAGTTGGTGACTTGCA 

TCTGTGGCCTTGAATATTTTATTATCACATGTGGCATAACAGTATCCACACTTTTTAGTT 
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CTTTAI I I I I I I I I I I I I ATTTTGAGCAATTCTCCTGCCTCAGCCTCCCAAATAGCTGGG 

ATTACAGGGTGCATGCCACCACACCCAGCTAATT 

Sequence 1050 

CGCCCCGCGTCCGGGATGGACAAGACAMTCTCTTGTAAAAAAMTTACMGTMTTTTT 

ATAGAAGCTCTGCCCTGAGGGAGGGGGAGCGTGACTTCTCACTCCTTCAGTGTGGGCTGC 

ACAAGTGACTTCCTTCCATATGGGATCGTTATAACAAAAGACTGTAACAAGGGCTATGGG 

AGTTATAAGACAGGAATTGTGGACAAAAACCAGTGTATATCATAACATCACACCTTGTAA 

TGTTGGCAGTACAGTCACTGACCTTTGATAAATGTTGATGACATGTTGAGTAAAGGAATG 

AGAGAAAGAGGATTGTTTATCTCTGTTTTTATCCTTCTCAGAGAAACTTAGAGTAACAAG 

GTGTGTTATCAGCCATGCTGATGCCTTTGGTAAACTATTGTGTGANATNGGGTGGTTTGA 

ATTGGTCAAGTAGAAACTGGGGCTGCCAGGCGCAGCCGGTAAGCATTTNCATGTGAGCCT 

TAGGGAANAAGTGCATTTTGGTAGGAGCCATCAAMATAGCTTCTTGATATTTCATTAAA 

AG 

Sequence 1051 

GACCACGCGTCCGGGGCTCCTGGGTGTGCCCGCGGCCTCTGGCGCGCAGCGACTCAGAGA 

ACGTCTACGAGGTCATCCAGGACTTGCACGTCCCGCCGCGGGAGGAGAGCGCAGAGCAGG 

TGGACGACCCACCGGAGCCCGTGTACGCGAACATAGAGAGGCAGCCCCGGGCCACTTCAC 

CGGGCGCCGCTGCAGCCCCCCTTCCCAGCCCGGTGTGGGAGACGCACACGGACGCGGGCA 

CCGGGCGCCCCTACTACTACAACCCAGACACGGGAGTTACCACCTGGGAGTCGCCCTTTG 

AGGCTGCCGAGGGTGCCGTCAGCCCAGCCACCTCCCCTGCCTCGGTGGACAGCCACGTGA 

GCCTTGAGACCGAGTGGGGCCAGTACTGGGATGAGGAGAGCCGCAGGGTGTTCTTCTACA 

ACCCGCTGACGGGCGAGACGGCCTGGGAGGACGAGGCCGAGAACGAGCCCGAGGAGGAGT 

TGGAGATGCAGCCGGGCCTGAAGCCCTGGCAGCCCAGGGGGACCCGNGGNCCC 

Sequence 1052 

CGGCTTTGCCGCMCATGCTCMTTCCCGATCATCGCTCAMGTGCTAAATTTTCAGGAG 

TGAAAAGAAAAAGAGGAAGGAAGAAACCCCTCTCAGGCAATCATGTACAGCCACCCGAAA 

CAATGAAATGTAATACATTCATAAGACAAGTGAAAGAAGAGCATGGCAGACACACAGATG 

CMCTGTGMAGTTCCTTTTCTTAAGAMTGCAAGGAAGCAGGACTTCTTAATTACTTAC 

TTGAAGAAATATTAGACAAAGTTCATTCAATTCCAGAAAAACTCATGGATGAGACTACTT 

CAGAATCAG ApTAT GMGAAATCGGGAGTGCACTTTTTGACTGTAGATTGTTCGAAGACA 

CATTTGTAAATTTTCAAGCAGCAATAGAGAAAAAAATTCATGCATCTCAACAAAGGTGGC 

AGCAGTTGAAGGAAGAGATTGAGCTACTTCAGGACTTAAMCAMCCTTGTGCTCTTTTC 

AAGAAAATAGGAGATCTTATGTCAAGTTCTACATCAATATCATCCCTGTCTTATTAGGGA 

TTACCGTTTCCTAAGCCAAGAGTCATGTCAAATTGCAATCAGGC 

Sequence 1053 

GACCCCGCGTCCGGGAGGTTGAACGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAG 

AAGAAGCCAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCA 

GTCACTGACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATC 

ATGGTAGATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGC 

TGTCAGGACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACA 

AAACTTCGGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGC 

CTCAGCTCTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAG 

CAGCCCTCTAAGGAGGAGGAGGAAGAGGAGGAGGAGGAACAACTGAACCAGACCTTGGCA 

GAAATGAAGGCCCAGGANGTGGCGGAATTGAAGAGGA 

Sequence 1054 

GTCGACCCCGCGTCCGCAAGGACCATGTTGTACCACAGCCTCTGCTGAGCTGAGGGACAC 

ATGTCCTTGGTGAAGACCTGCACCCCTGGAACCTCCCACCATCATCACAACTGNAGTCTC 

ATTTGCAGTGGAGAAAAGAACCCGACGTCCCACAGCCAGATATACACCCAGCTCCATGCC 

AGCCCTTCATGTTTACCTTTTGCTTTGTTAATTACATGTCAGACTCCTAGAGGGCCTCCA 

GACTAATAGGAAGCATTTCTGTAACCAACCTGCCACCCACTGATTCAGAAATGGAAATCA 

CATTCCACAATCTATGGCTTCCACCAGCTAGCCCAGGAAATACTTGAAATCAGCATTCCT 
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NAAAAAAAAAAAAAAAAAAAANAAAAANNNANAAAAANNAANAAANAAAANNAAAAANGG 

GGNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGTNCAAAT 

AAAGCAATAGCNTCCAAATTTCCAAATAAAGCA I 1 1 1 1 I 1 1 CACTGNATTCTAGTTGGGG 

GGTTGG 

Sequence 1055 

CCACGCGTCCGCTGTTCCCGACAGCATGGATTAGCTTCCGTGTTCTGAAGTTGTTCTTTT 

CATGGTGTCTGACACCGAGGGCCGTTGTTCGTCCATCAGGCGGGATTGGATGGAGTCTTG 

GTGTTTTGCCTTCTCAGGGACCAAAAATGTATCATTGACTCCTTAACAGTGACCTTCCTC 

CCMGGACATATCCGTGTTCATTTTTCATAGGTTTTACTCATATTCATAGGTAGATTCTG 

TTAATGTGAGTTGGAAA GAAAAG ACCAATTTGTACACCAGTCACACCACAAGACAGTTTA 

TCATATAAMTACCTCAATTTTTTGTATTCCTCATTTCCACCTCACAATTGTACTGGTGA 

TGAATTTTAAGGGTCTG 

Sequence 1056 

TCGCCNCGCGTCCGGGCAAGGAAGGCTCTGTTAATTTATAGCTGTTTAGAGGGGAAAGCA 

GTGCAGACCACTTATTAAGCCCGCTGAGGACTAGCTTTCTGTCTTTCATACATTTGGGAA 

AGATAGGAATGAC TGTT TCAAAGAAGAGAGGTGCACATAATTTATGCAGGCAAGTATGAT 

ACTTATTTCATTGGTTTTGTGAGTCATATATATACATATATATATATACACATATATATA 

TGACTTGAAMTTAAGATTTAATACTTTMTGTTTTAAGTGTGGGGGGTTTGGGAAGGAA 

GGMTGTAATATNTGGGA1TTAGCCTTAGGCTTTAAGTTTTAGGCTGGCAAAAGAAATGT 

TATTCAGTGGGTTTGAGGTTTGGACTACTTTCTTCAAACTTAGAGAATTATAACAGGATG 

GTGTTTACCTTTGTTCACCTGGGATGTCCCCAAGACTCTAGCTTCTTCTATCAAGTGGTT 

GGGTCTGATAGAAGAGGTAAAAATTGCTCTTGAAAATNNTCTACAATTATGCAGGTTCTT 

TGATAAAATTTTCTGGTTAAATCC 

Sequence 1057 

CCNCGCGTCCGCCNCGCGTCCGCCCAGGCCTGGGCCGGGCCGTGACGGCGGCGCTAGGAC 

CCGGCGGGCCGCGGGTGCGGCGAGGCCTGGGCGGCCTGAGGAGCGCGGACCCCGGCGCTC 

GGCTCCCGGCGCCATGTGAGGGGGCTCGGGGGCCGCGGGGGGCCGGGCGCTCCCCGCCGG 

AGGTGTGAACCCACATCCCTGCCCCCAGGGCCACCTGCAGGACGCCGACACCTACCCCTC 

AGCAGACGCCGGAGAGAAATGAGTAGCAACAAAGAGCAGCGGTCAGCANTGTTCGTGATC 

CTNTTTGCCCTCATCACCATCCTCATCCTNTACAGCTCCAACAGTGCCAATGAGGTNTTN 

CATTACGGNTNCCTGCGGGGCCGTANCCGCCGNCCTGTCAACCTCAAGAAGTGGAGCAT 
Sequence 1058 

TGGGGCCGGGAGGAAGTCTAACCTTTGGGAGACTCCAAGACCGCAGCTCCGAGGTCGGCG 
GGGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAG 
GAGGACGAGGAGAGGGACTAGGTTGGGCCCGAGGGGGCGCTTGGCAAAGAAGCCCCTTCC 
AGCTG ACCG CCGAGGACNGTGTATNACATCTN CTACCTG 
Sequence 1059 

TGGCGATCGCTGAGAGGCACGGAGGGCCGAGGCGGNCCTGGGAGGCGGCCCGGAGGTGGG 

GCGCCGCTGGGGCCGGCCCGCACGGGCTTCATNTGAGGGCGCACGGNCCGCGACCGAGCG 

TGCGGACTGGCCTCCCAAGCGTGGGGCNGACAAGCTGNCGGAGCTGCAATGGGCCGCGGC 

TGGGGATTCTTGNTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAG 

GAGCAGCCCCCGGAGACAGNGGCACANAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGAT 

GATTGTACCTGTGATGTTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTTCCCAA 

GACTACAA AAACT TCTTGAAAGTGACTACTTTAGGTATTACMGGTAAACCTGAAGAGGC 

CCGTGTCCTTTTTGGAATGACATCAGCCAGTGTGGAAGA 

Sequence 1 060 

CCACGCGTCCGGGGTGTTGGTGCGGCGCTGTTGGGGTCCTCCGCTGGCTCATGGCGCCAG 
GCGTGGGAGGCCGAGTCCCCAGTGGCGAGCACTGGCCCGACTCGGCTGGGAGGACTGCCG 
GGACTCCAGAGTCCGCGAGGCGCTGCGGGCGCTGCACGCCGCCAGGGAAAACAAAGAAGA 
AGAGTTAATCGACAAACTGGAGGTGGTCACAATGCCTTCCCCATCACCAAAAGGACTGCC 
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AGTGAAGCAATATGCTGTGCAGTCTCAGCTTCCCGTATATGAGTGGCCGGATGTGGGATC 

TGGAGAATATGATGTTGGAGTAGTGGCTTCGTTTGGCCGACTTTTGAATGAGGCTCTTAT 

TCTTAAATTTCCCTATGGCATATTGAATGTTTCATCCCAGTTGCCTCCCGAGATGGCGTG 

GCCCAGCCCCTGTAATCCATACAGNTGCTTCACGGAGACACAGTTACNTGGAGTAACAAT 

TTATGCAAATTAGACCTAA 

Sequence 1061 

GCCGGTTCTTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCCCATGTTCCTGTTGCT 

GCCTTTTG ATACG CCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCACCC 

TCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCAAAC 

TCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAGGAG 

CCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGGATC 

CCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGGTAGTAGTAAGGCTCTGG 

ACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGGAGA 

CCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTGGAGTCCTGGAACC 
TGCTGAGCAGAACCAATT 

Sequence 1062 

CCNCGCGTCCGCTTTGMTNCTTATCTTTGATTTMTTTACACGCCAGCATTTTGCCACG 

TTCTAAATAATATTTAGCTCAACTGATTCATACGTATTAATGACCATTCTAGCAAAGGCC 

TACAAGTGGTGTGGGAATCAGGGAAAGGCTGCCTCTTTGGTATCTCAACTGGTATTGATT 

ATTGCTATCAACTATTTGGGGAGAAAAAATCAAAATGAAGCCCTGTCAAATTTTAGAAGT 

ACTATCTTTGGTCCTTCAAACACTTTGTGATGACACCTTAAGAAAAATAAAGTTGAAGTT 

CAGGTCTTGCCATTGCCATTACAGACAAATTAGGAGACTTGGTTTACCTGGGAACAAATT 

T ACTTG AATATTCAGTACCTGAAACTATGCCAAACCAAAGAGCAGCTGCAGTACATTCGT 

TATTrTAMTGMCAAGGTTTACAAAGNTTATTTTCATCTATACCGTAAGGNTGGATTTT 
TTTTNAA 

Sequence 1063 

GTCACCACGCGTCCGCCNCGCGTCCGGCGTGCATGGAGGAACGCTGGGCACGGGCCCGGC 

GCGGGTCGGGGGGCGCCCGAGGGGCCCGGGCCGAGCGGCGGCGCGCAGGGCGGCAGCATC 

CACTCGGGCCGCATCGCCGCGGTGCACAACGTGCCGCTGAGCGTGCTCATCCGGCCGCTG 

CCGTCCGTGTTGGACCCCGCCAAGGTGCAGAGCCTCGTGGACACGATCCGGGAGGACCCA 

NACAGCGTGCCCCCCATCGATGTCCTCTGGATCAAAGGGGCCCAGGGAGGTGACTACTTC 

TACTCCTTTGGGGGCTTGCCACCGTTACGCGGNCTTACCANAACTGCAGGCGAGAAGACC 
ATTCCCCGCCAAA 

Sequence 1064 

GTGCCACGCGTCCGCCACGCGTCCGCCTGCCCCTCGCCGCCCGCCGCCTGCCTGGGCCGG 

GCCGAGGATGCGGCGCAGCAGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCT 

AACTTCCCCCGCTACGTCCCCGTTCGCCCGCCGGGCCGCCCCGTCTCCCCGCGCCCTCCG 

GGTCGGGTCCTCCAGGAGCGCCAGGCGCTGCCGCCGTGTGCCCTCCGCCGCTCGCCCGCG 

CGCCCGCGCTCCCCGCCTGCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCA 

TGCTGCCCCTCTGCCTCGTGGCCGCCCTGCTGCTGGCCGCCGGGCCCGGGCCGAGCCTGG 

GCGACNAAGCCATCCACTGCCCGCCCTGCTCCGAAGAGAAGCTGGCGCGCTGCCGCCCCC 

CCGTGGGCTGCAAGGAGCTGGTGCGAGAGCCGGGCTGCGGCTGTTGCGCCACTTGCGCCC 
TGGGCTTGGGGA 

Sequence 1065 

CGCGTCCGAACGGCATCATCACGCCCGCCACCATCCCCAGCCTGGGCCCCTGGGGAGTCC 

TGCACTCAAACCCTATGGACTACGCCTGGGGGGCCAACGGCCTGGATGCCATCATCACAC 

AGCTCCTCAATCAGTTTGAAAACACAGGCCCCCCACCGGCAGATAAAGAGAAAATCCAGG 

CCCTCCCCACCGTCCCCGTCACTGAGGAGCACGTAGGCTCCGGGCTCGAGTGCCCTGTGT 

GCAAGGACGACTACGCGCTGGGTGAGCGTGTGCGGCAGCTGCCCTGCAACCACCTGTTCC 

ACGACGGCTGCATCGTGCCCTGGCTGGAGCAGCACGACAGCTGCCCCGTCTGCCGAAAAA 

GCCTNACGGGACAAGAACACGGCCACGAACCCCCCTGGCCTCACTGGGGTGAGCTTCTTC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

173/467 



TTCTTGTCGTCATCGTCCTTCTTCAAGCTTGGCCAGCAACGAGAACGCCACAAGGAAACT 

NGTGAGCCCACGTTNGGCCGTCGGGAAAACACGGGGN 

Sequence 1066 

CGCGTCCGGCCTCCAGCACATCCTGCCTGCAGAGGGTCTGGCTAGCTGCCTTTTCAGCTC 

TCGAGGGATAGAGATTCTACAACCTCCCTCTGTCATCAGTTCCAGAGCCACTCCCCTTTG 

CACTAGAAGTTCTTGCTTTCAAAGAATGAGGGTGTGAGGGAGGGAGGGGTCAAGAAACAG 

AGTGACAGGGGAAACAGGCAGAACAAGTCAGGGCAAAGGACCCCAGCATGAATAGTTGTG 

GAGGTGGAGGTGGGGAAGCAGCCTCACATCTCACACTTCCTTCCTTCTCTTAAATGTGAG 

CAGCTGACTCCAAGCCTTGTGGAAACTCTAGAAGGTAGAACCAGCCATCTGGGGAAGCTG 

GCCTTACAGATGCCCCGTCTGGCATAGTGGGAGGTTCCTGTGCTCTGAGAACCCCAGTGT 

GAATCTAGACATTCCACTGCAGCCTGGGMGAAGCCTGTGTTTTTCTTTAAAAAGTCT 

Sequence 1067 

GCGTCC GGTTC TTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCACCATGTTCCTGT 

TGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCA 

CCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCA 

AACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAG 

GAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGG 

ATCCCACTTCACTAGAAGANGAGATCAAAGAGATTNGTCGAAGGGGNCNNAGTAAGGCTC 

TGGACMCACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGG 
A 

Sequence 1068 

TCGACCCCGCGTCCGGCTGGTTTTCCGTCTGGTGAGGGGTTACTTCCGGGTCGGACGGCG 

CTAGCTGCAGCATCGGAGTGTGGCAGTGCTGGGCTGGCCGGCGGGCTGGGCTGCGGCCCG 

CGCGCGGCCGGCGATGCANGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGA 

CCGGGGCTGGGGCTGTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACC 

CCGAATATGACGAA TCTG ATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAAC 

AAGCCMCCTTCCCTTTTGCMGTTTGCAMCCMCTCTTGCTGGGTTTCTTTGCTGGAA 

GCACNTTGAGCCCACTGTGCATGAACCA 

Sequence 1069 

CCGTCCGGGAGGTTGAAGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAGAAGAAGC 

CAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCAGTCACTG 

ACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATCATGGTAG 

ATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGCTGTCAGG 

ACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACAAAACTTC 

GGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGCCTCAGCT 

CTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAGCAGCCCT 

CTAAGGAGGAGGAGGGAAGAGGAGGAGGGAGGAACAACTGAACCAGACCTTGGCAGAAAT 

GAAGGCCCAGGAGGTGGCGGAATTGAAGAGGAAAGAAAAAGAAG 

Sequence 1070 

GCGTCCGGTGCTGGAGGAMATGTTTCTGGGGAAGATGACTCAGTCATTTTGTGGCGAGA 

CACCCTTTGGTAACTCCCACTGACCAGTCTTGGGAGCCTTCCTGGAATGATCGTGGGCTG 

AGCGGAGATGTTTTTTGCAAAATGAAACTGAAGCTGAAAGAAAGGAGAATTCGAGTGAAC 

CAAGAGAAATCCAAAGACCTGGGGAAGGAGGACTTAAGATGAAAGTGAAGCAAGAGAGGG 

AAGGGGAAATGAAGTGAAAATGGCGTGAGGGTGTGAGAGAGGTTTGGGTTAGGAAACATG 

TTTTTAGTGCTATTTNCAACCAGGG 

Sequence 1071 

CACGCGTCCGGGACTGATCTCNAGGACCAGCACTCTTCTCCCAGCCCTTAGGGTCCTGCT 

CGGCCAAGGCCTTCCCTGCCATGCGACCTGTCAGTGTCTGGCAGTGGAGCCCCTGGGGGC 

TGCTGCTGTGCCTGCTGTGCAGTTCGTGCTTGGGGTCTCCGTCCCCTTCCACGGGCCCTG 

AGAAGAAGGCCGGGAGCCAGGGGCTTCGGTTCCGGCTGGCTGGCTTCCCAGGAAGCCCTA 

CGAGGGCCGCGTGGAGATACAGCGAGCTGGTGAATGGGGCACCATCTGCGATGATGACTT 
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CACGCTGCAGGCTGCCCACATCCTCTGCCGGGAGCTGGGCTTCACAGAGGCCACAGGCTG 
GACCCACAGTGCCAAATATGGCCCTGGAACAGGCCGCATCTGGCTGGACAACTTGAGCTG 
CAGTGGGACCGAGCAGAGTGTGACTGAATGTGCCTCCCGGGGC 
Sequence 1072 

CCCGCGTCCGCGCGACGGCCGCGGCGCGGGACCTTAGGACCCGCGGGCTCCAGGGCTACT 

GTCCGTCCGCCACTGCGCGCCAGCAGGTCCTGGTCTCCGCTCTCCAACAGCTGAAAGGCC 

GGCGCAGTGAACACAGAAACGAAAACCAAGAAATGCCTTATTCCACAAACAAAGAGTTGA 

TACTTGGCATCATGGTGGGCACTGCTGGAATCAGCTTGCTGCTCTTGTGGTACCACAAGG 

TCCGTAAACCAGGGATAGCMTGAAGTTACCTGMTTTCTTTCTCTGGGTAATACATTTA 

ATTCAATAACTTTGCAAGATGAMTACATGATGACCAAGGMCAACAGTMTCTTTCAAG 

AAAGGCAACTTCAGATACTGGAGAAGTTAAACGAATTCTGACAAATATGGAAGAACTCAA 

AGAGGAAATCAGATTTCTTAAAGAAGCTATTCCAAAGCTGGAGGAATATTTACAAGGATG 

AACTTGGGAGG 

Sequence 1073 

CGCGTCCGCTGAGTTCNAGGATGGTTTTTTCTTGGGACCAGACATGAACAAAAGTTGACC 

TCATGAGCACTTCAACCTCTCCAGCTGCCATGCTCCTCCGGAGGCTGCGGCGACTCTCCT 

GGGGCAGCACTGCTGTCCAGCTCTTCATCCTAACAGTGGTGACAGTTTGGCCTGCTGGCC 

CCCCTGGCCTGTCACCGACTTCTACACTCTTACTTCTATCTGCGCCATTGGCATCTGAAC 

CAAATGAGCCAAGAGTTCCTGCAGCAAAGCTTGAAAGAGGGTGAGGCTGCCCTCCACTAT 

TTTGAGGAGCTTCCCTCTGCCAATGGCTCAGTGCCCATTGTCTGGCAGGCCACCCCCCGG 

CCCTGGCTGGTGATCACCATCATCACTGTGGACAGGCAGCCTG 

Sequence 1074 

CGTCCGTGAAAATCCAAAGATGTATCATTTTTATTTGAATCCATCATGCAGTGTACATTT 

CAGATMTTTCCTTCAGTCTCCAGATAGGAGTGTATCCAAACATCTMTTTTATGTGCAC 

TGTGTATCTTATATGAATGTTTTATTTTATATACCACATGCAAAAATGNCCATATGCACT 

ATTTAAATGTTTTAAATAATATATTC(JrTCTTTATAATGCTAAATCTATATGAGTACCAT 

ATTTTTATMGTCAGTGGTCTGACNGG^CATTTTTNMNTMCNNN 

GGTATTCAANGNGAAAAGGGTGGNTGTGAGGAGAANATGTGAAAGNGGNNTGGGNGNNCT 

CTTTTGCCTTTGGGCCAGGAATTNGGGGGGCNAAAATNNACCCANAACCTGGNNNAAAAN 

TAGGNCCANTTGGGGGNGANAGGTTTCACTTTGGGGCNCAAAAANAAAANCCCGGGGTTT 

TTTNTNTTTNCCAAAATANATTNTTTTTGGGA I I I 1 1 I I I I GTNCCCCCCCCGNATTAAA 

TGGGGANTTGGCTGGNGTCTTGGCNCCTNTCATTTGTGCCAGACC I I I I M I I ATTAAAA 

AAGAACCTTGGGAAAGGTCTTAAGTNCATTTGGGAAAAAAAAAAAAAAAAAA 

Sequence 1 075 

GAGCCGNCCCACGCGTCCNCGGNCGCGTGGGCTACCTTGGAAGCAGTCATCTCTCAGTCT 

TACATTTGGAGAATGTGGATGGCATGACATCAGMTTCCTTTATATAATTTAACTTCAGA 

ATAGTCTGAGATCATCGAAGCACGATGGTCMGGGAATTCCGTTTTTGTTTTAGAGCAAA 

TATGTTTGCTGTTTGTCTTTCATCACAMCATCAGTGGAGTTTCAGCACCTTACAGAGCT 

CAGTGAACCCCCTGGTCACCATCAAAGTTAGCACACAACAAAGCCAACCACGTGTCCCCC 

TCACAGATGACAATGGCTGAACTCTGAGTGAAACCACCTGTATGGCCGGGCACAGTGGCT 

CA 

Sequence 1076 

GCCCCGCGTCCGCl l I n GAGAATCTCTGCTCTGTTCCTAGGTTCAGTGCTGGGTCCTGG 

GAATACAGCAGGACAGACCTCAGCTTATCTCTTCATAGAAATTATACAAAGAGAATTGGG 

GAGACAGCTAAGMGAAMCMAGAMTAMGCAGTTACAAATTGTGATAMGTGCTTTT 

GAAGGAAAGAAGGGGTCTGAGACAACAACAGGGAAGGGGCCTCTCTTGAMCAGTAGTTG 

GGAAGGAGGCAGACATGCACCAGTGATGTGGTGACAGGTGCTCTGAAGGAGGTCACCAGG 

ACCTGACCTCTTTGAAGGATCAGAAAATACTTCCCTGAAGGACTGACATTTGAGCCTAGA 

CCTGAAGGGTGAGCCATCAAGCTAAGACAATTGGGGAAGAGCATTCCANGGAGAGGGAG 

Sequence 1077 

CGCGTCCGATCTTTGTCTGCTTTCCTATAACTCAGTACTGTAACTCAGTACTCTGAAATA 
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GTTTCCTTTGTTAATAGAGTCACTTTTATAGTACTGNGCTTGAGGNNATATACAGAGTAT 

TGTGTCCAAATTTATCATTGCACAAAGTGTTTTTGGAMTTCTTGGTTACTCCTTAGTAA 

ATTACCTGTMTTGGGTTAAATGCTGGTAGGGTTTAAAATCTGATTGCTAAAGTGAATTC 

TCTA TAM GTGAGTTTTGATACATAGAMCTTTNCATATAATTCTTAAACTCATGTGTCA 

TGTATTTTCA7TTATAGTTTTTCATATTCATTM 

CTCANMTTCTGCANATGCAGATTTTTGCAAACTTTGATGCATTTGGACAGTCTAGTGGT 

TCGAGTAATTTTGGAGGTTT 

Sequence 1078 

TNCGGGCGGCTGCGGCGGGCGGGCAGGCGGGCAGGCCGGCAGGCGGGTGCGCGGAGGGCT 

GGTACCCCTNAGCAGGTGGGCGGGGTGCGGTTGGNGGCGGCGGCTGGGCCGGGGGCTGCC 

CGGCTGCGCTCGGGCCGTGCGCGGNGGCCGTGCGGGCACGCCATGGACTTCAACATGAAG 

AAGCTGGCCGTCGGACNGCGGGCATCTNTNTTCACCCGGGCCGGTGCANTTCACGGAGGA 

GMATTTGGCCAGGCTGAGMGACTTGAGCTTGATGCCCACTTTGAAAACCCTTCTGGCC 

CGGGCAGACAGCACCAAGAACTGGACAAGAAGAAGATCTTGAGGCAAGACAAGAGGTGCC 

TGCTGCAGCCCCAACCCCAGTGCCCCGAGTGGAGGGAATTANCTGTATGAAGAAGCTTG 

Sequence 1079 

CACGCGTCCGTGCTGAGCGGTCCCGAGGGGAGGGGCCTGAGGCCGAGTTCCAGTCGCTGA 

CCCCTAGCCAGATCAAGTCCATGGAGAAGGGGGAAAAGGTCTTGCCTCCCTGCTACCGGC 

AGGAACCTGCCCCGAAGGACAGGGAGGCCAAGGTGGAAAGGCCCAGCACCCTCCGTCAGG 

AGCAGCGTCCTCTTCCCAACGTGAGCACCGAACGTGAGAGACCCCAGCCTGTCCAGGCCT 

TCAGCAGTGCACTGCACGAGGCTGCCCCCTCCCAGCTCGAGGGGAAGCTGCCATCTCCTG 

ATGTCAGGCAGGACGATGGGGAAGACACCCTGTTCTCGGAACCCAAGTTTGCACAGGTCA 

AGCTCAAGTAATGTCGTCTTGAAGACGGGATTTGATTTTCTGGACAATTGGTAAAATGTA 

tTAGAAAAATACAATGAAAGAACCCTAAAATGTTTTCCAAAATGGTGTGGTGGAGGAGGA 

TAAAAAAGGGCCACCTTTTCCTATGTATTTTACTGGTTTCTTGACACTCTTTTCTTAATC 

ATTTG G AAACTG GTC AATACTG NCAG ATTTTTT 

Sequence 1080 

GTAAGCCAGGTGCTCCCCCTTCACTTCCTGTGTGCGGAGCACGCTCGCCCTGGGAGTTTC 

ACTAGAAAGAAGGTTGCCATGGGCCAGTGGGACAGCTTGGATCTCAAGTGCACGCGGATG 

CCCCAGAATCCCAGGATCTCAGCTGAGCTGTTTGTGGATTATTAGATCTGACTTAAAAGA 

ATATTATCCAGCAATGCAAATGAACAAACTATAACTACACACAGCTGCATGGATAAATGT 

CAGAAACATGACGTTGAAGTGTGAGAAGCCAGATGCAAACCGAGGACTCACTGTGCAATT 

CTGTGCATGTACAGTGGCCAGGAGMGGGAGCACTGGCTTTTGCTTTCATCAGGCCAAAG 

ATGCCTTTCTTTGGGAATACGTTCAAGTCCCAAGAAAGACACCTCCTCGGAAGGTCGGCA 

TCTTTCTCCAACCTGCATTCTTTGGATCGATCAACCCGGGAGGTGGAGCTGGGCTTTGAA 

TACCGATCCCCCGACTATGAACCTGGCAGGGCAAAAGCCTGAAAGTTTGAAAAT 

Sequence 1081 

TGCTCTCTCACCTGTTAGCTGTGTAGCATTGGGCAGGTTACTTAACCTGTCTGTGCTTCA 

TCTGTGAAACAGGMTAACAGCATTATTAAGGATTGTTTTAGGATTGGATGGGTTAATAG 

ATGTAMGTCTTAGMCTATATTGAGCATCCCATCAAGGCATTGTATTATATTGAAACAA 

TGGGGTTTNTTTCTCTTTATNCTTTTTTMACTATATMTGMCACTTT^ 

ATTNCTAA 

Sequence 1082 

CCCGCGTCCGGTGAATGTTAGTATTGGGTGTGGGATGCATCAGGGACACAGGTTTGTAAA 

CCATGACAATTCAATTGTAAAACTAAAGCCNAGTGCCCCCTGTAGTCCCAACTGCTGGGG 

AGGGTCACTTGANCCCTAGGGGGGGAMATGCAGTGGACTGGATTTTGNGCCCTTTGCAC 

TNCAAACTGGGTGACATAGTGAGGCCCTTGTCTTCTACCANAAAAAAAAAANNANNANNN 

NNANGGTGCCGGGCCGCTTAGAACTAGGTCTTAGAAGAAAAAAACCCTCCCACAACCTTC 

C CCCT GAACCCTGGMMCATNAAAATGGAATGCCAATTNGTTTGGTTGGTTAACCTTTG 

GTTTTATT TGCA AGCTTTATAAATNGGGTTTACCAAAATTAMGGCCMTTANGCCATTC 

ACCAAAAATTTTCACAAAAATAAAAGGCCA I I I I I I I I I I CACCTGGCATTTCTTAGTTT 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

176/467 



' NNTGGGNTTTTGGTCCCAAAACCTCATCNAATGGGTANTCTTAATCATGTCCTNGGGATC 
CCCCCCGGGTTACCCGGAGCCTTCGGAMTTAMTTTCCTTCTTTCCGCTTTTCCTTCGC 
TTCACTTGACCTCCGCTTNGNCTCGGGGTCCGTTTCCGGGCTTG 
Sequence 1083 

TCGACCNCGCGTCCGTGGAGGGCCCATCTGCCAGAGCCTGGAGTCTGCGAAGGCCGGGAC 

CCGGTTCCCCGGCCCACAGTGGGGGTGTGCAAACCCGAGAGAACTGGGAAGTGCCGTCAG 

AAGCGATAACTGACGACGTCTAATGTCTATCTGACCGCAGTCGCTGAAACCTCTACAACT 

TAGTTGACCGTAACTGCCAGAGCCCTGCCCTGAATTCCTGTCCTTACTCCCTCTTTAAGA 

TTGCGTACCCACTGCAGAGTGCTGAAGACGGGGTAGCCACCGAGGTTGCAAATTCGTGAA 

GAATCAGCATCATGTTTGGCAGCTGAGTATTGGAGCCAGGAGCCTGCCATGAGGTTTTGA 

GMCAGAGTGCTGTTTTAGAGCTGGCAGCAGCATCTCAGCCCAAGAGAAGGTTATATTCC 

CAGAGGATGTCAGTCCCAAGGACCAAGTAGCTGCCATCAGTTTGGATTCTGAAAACTAAC 

TGGCATCAAACTGGGTGTAGAAACATG 

Sequence 1084 

CGCGTCCGGACTGTCGCTCTAAAAGAATGAAGGAAATAATAAAGTGATAGACAGGGAAGG 

ATAGAAAAGACTTMCAATATACATATGTTCCGTCTTTGCTGTTTTGGAGAATGATGGAT 

AAGTANGTGTTTCCTGATTCTGMGCATAGCTGMCAATTTMTTGTGGTTTACCATCTT 

TTTGGTTCCCTCTTCAGTAATTAACCTATCGAAAATCTGTCCTAAATGTTTGGACTGGGG 

CACAGTTCCCTCCATCGCTTTGGGAGAAAATCATTAATATGGCATACTGCAGATTGGAGG 

GCAGGACCACTGAGGGTGTCATAGACATTAGCTCTATGGAATTCTGCTAGCAATTTCCAA 

GTGACAGTGAGGAATTATGGATATATGTTTGAGGTCATTCAAGCTTCCTGAGTACCACAT 

TCCCCAGCTACTTAGACACCGGGTTAAAATATTAAAGATGTCCTAGTTCAACAGCTTGAA 

TTCCATTGATTGGAT 

Sequence 1085 

GACCCCGCGTCCGGCTTCCTGGGTTCGAMGMCCCAGTTCAGGAGTTTCTGTTTTAGTT 

TGAGATCTTATAGGCCTGTCTCATCAGGTTGGTGTCAGCCCAGCTAGGATTAGGCAGAAT 

TGGG TGGGGGCTGTAGTGCATCTTTGGCACAGCATGTACCTGTCTGACTAATTCTCTGTC 

TTTTCTTTCCTGTTGCAATTCATGGGTCTTAGCATCTTCTGMTGGTGTTTAGTAGGTCA 

TCCTGTTGATTTCCTGCTAGGGAGTAGCATACTCTGGCTCTGTACCATTGGCCAAGGGAC 

TTMGGATAGGTGMGGGCTGCAGTTTTGTTAAATGGAACAATATGAAGAGATGGCATTG 

TAAAAAAACTTNTGNCAACTNAA 

Sequence 1086 

T GTCGC CCCGCGTCCGATCAMTCTTGATGMGGATTGTAGATTTTTGCTTTTTC 

GTTTTTAAMCTTATTCCMTTGCTAMTTGGTAGTTTTTCAGTCTTTATAMT^ 

TTAAAMTATATATACAGTTATATGAMTGTTTATTTTCTATGTGTGTGCATATAGTTCA 

AT ATT ATG C AATAAATTTG GTGTTTT A ACTTAAAACT ATTTCTTATTGTACTTG C AG AAT 

GGATAGCTTGCTTTTAGTAGAAGCATTAGGTCGTATACTCAGATAATCTAATAGAAGGTC 

AGATTTGATTCCTGCATAAGAAAGTAGAGCCCAAGTGCTGCAGAAATGGAGAAGAAAGCA 

GGGGCMGGGAGCAGATGGCATTAAGGAAGAATGAAGTTTTTGAAGGTTGGGGATGGACG 

AAAAGGGTGTTTCTCATGGAGAGGGGATGCTTTAGCAAAGGCTCAAACATTGGGGCATAT 

TTAGGCAAGAGCCAAGAAACAGTTTGAAGGGGAACATCAGAGGAAATAGGCCAAATTAAT 

AGTAAATN 

Sequence 1087 

GGNGTCGACCNCGCGTCCGGAAATACTCAAATAATGCAACATTACTTCCCAGAAATGAAA 

ATACATTGCATCTCTTATTGAAAGAGCCAACAGAGGCAATGAAAAATAAATGTACACATA 

TGTGCACCCTTAATTCTTTATAATGAACCTTGAGAAATCAAGGCTTTAGAAAAATGCTGA 

GAGACAAGAAGCTTCTGAGCAGCAACACTGGATGCAGGAAAAAAAATGGAGTAATMGTG 

TAAAGTTCTGAGGGGAAAAAAAATATCAGAACCTAGAATTCTAAATTTAACCAAACTGTC 

ATTAMGTAMTGAGA AAAGTAA ATTCTTTTGAGAAACGCAGAGCCTAGAAAGGATTACT 

AATCGGTAGGATCATCTTTTTTTGTTTGTTTGTTTGGNTTGGGGACAGACTCTCACCTCT 

GTCACCCAGGCCTAGGTTTGCAAGTGAGCCCAGGATTGTGCCACTGCCCTCCAGCCTTGG 
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GGGTGACAGGAGCAAGGACTTCATCTCAAAAAAAAAAAA 
Sequence 1088 

TCCCCCTGCCCCCCCAAGCTGAGGGAATAAACTCTGCCTGTCGGGTGCCCCACCAAGGGA 

AATAAACATGGCCTAGCTGCCAAGTCATGCCTGTAGGGTGCTCTCTACTGGCAGTTTCTG 

GGTGGTGCATACATCTAGTCTCCCTAGAAGAGCACAGTCCAGAATTGAGGAGGTACAGCA 

GAAACAGACTGCAGGCAGAGAGAGGTCTCATAGAGCTTGGACAGGGCTAGGCACAGAAAG 

AACAGGCAGCGTATCCAGAAGGGGGCAGGGAATGGGTGAAGAGGTTGTGCTCTAGGGCAG 

AGCTGAGCTCTGATCTAGAAAGGACAGCAAAGATACCTGGAAGGCCTCCCGATTCTTGCG 

TTGTTG G CGTCGCTCCCG AAG CCG GG 

Sequence 1089 

CGNTCCTTGGGTAAAAGCGTCCCAGAGACGGGAAGAAATATGGTATAAGCGAGAAGGCCT 
CATAMTCTGGGCTGTTAAAMTCTMGTTAMMTATGTTTTAAGTCAGAAAAAAAAAA 
AAAAAAAAAAA 
Sequence 1090 

CGCGCCTGGTAAMTTATATMGCTTAAAAAAACAAAAACAAAAACACTTGCTTTGAAAA 

GAGTCTCTCAGCAGCAATTTTGTCCTTGCCCCTACTTCCACAGTTCCTTTTCTTACCATT 

TCACATCTGGATTACTACATTGGCCTCTTTGCTTAGACTCCCMTATTCATTTGCTTTCC 

TTCACCCCATTTTATGGAGGACTGTCAGATCMTCTTTTAMGATAMTTTTATAATGTT 

ACTACTGTTGC CTATT GGATTAGAGCCCTAGGGGTGCTTTTTGTAGTCTCACTGACAGCT 

GACATTAGTGATTTTTCACCCTCTTCTTATTGCTACCCTGTGTTGATGGCCAGTTTCCAG 

GTGGGCACCTGCTCCACTTGCTTTCAT 

Sequence 1 091 

GGGGTATGTGTGGTTCTTCCAGGAAAGTGCTGAAAATATCACCCAGGCCTCTGCGCCACG 

CCCTGGGAGAGTACACTCCTGGGCTCACGCCTCTGCATTCCAAGGCTGACAGCTAGAAAT 

ATACTTTGTAAAATACCAACAACTTATTCACAAATATTCCAACTATCTACCAGCTCCAAT 

GAGCTTGCTGAGGATGGGTATGACCCCAGTCTAAGGGGAAAGAATCTAAAACACAAGTAA 

ACCTGTTTAAAGGCCAGATCTCCAGATGGAGATCCAAGCAGATGGCGCCTAAGGTTTGCC 

CTTGAAAACTACCAAGGAAGCCACAGAGAGGGATCTTTGGACCTTCTGGAAAATGGTAAG 

GCCCCAGGTAGATTATGGCTCCTCTGCCCTGGAGGCTGAGCCGCCCTCTGGTTACCTCAC 

ATCTTCTGGTTTCTTCTGAGTGGGACTTGATCTCATTTCTGCATTCACAGCAAGGNGGAA 

CTGTCTGGCAAGAGCTTAAAATTAGGACCTGNTGGTGGGGACCTTTAATAGCAGGTGGAG 

GGTTTGAGATCCCNTGAGATGCCCNAGATTAATTCAATAGGGGGANGAAAGATTTGGCCC 

AATTC AAA AG N G CTT AAA AAAAAATTTT 

Sequence 1092 

CGCGTCCGGTTCTTTGGTTGAGCTTCTTTGTATCAGTAACAAAAGGAAGCATCATTCAGT 

CTTTCTTTG TAACT TAATGTMGTCTCTTTTGTACATCCTATTACTTCAMTCATTGAAG 

TGAACTCC ATTTT CACATCTGTTGGGAACAATCATCTAGCTTCTTAAATGACTCATCTTA 

AAATATGAATTTTAGACTGCCTAAACATTCTGAGGGAGTACAGTGTGATATAGCAGAAAC 

AACCGGGGCTTAAGAAGGATCAAAATGAAAGGTTTTGTGAAGGATGTGCCAGAGACTGCT 

CTCTGTTCACCAAMCCACATTCTTCTCTTCTTCCTGGCATAGAGCCAGACTAGATTTCC 

AAGCTTTCCTTGCAATTGAGCTCTCAGAGTTCTCGTCATTAGAATGTGAATGATAGGCCG 

GGCGCAGTGGCTCATGCCTGAGATCCCAACACTTTNAGAGGCCAAGGTGGGTGGATCACT 

TGAGGTCCAGAGTTCNAGACCAGTCTGGCCAACGTGGTGAA 

Sequence 1093 

CGCCCC GCGTCC GATAAAACTGGATTTGATTTCTTTTTATGAMNGTTTCATATGAATGT 

MCTTGATTTTTTACTATTATAATCTAGATAATATGATATAAGAGGGCTAAGAATTTTTA 

AATTGAATCATATATATGATATAATTTGATCCTTCTTGTATCTTGMGTTTTGTACTTGG 

GATTTCTGGACTGATAAATGAATCATCACATTCTTCTGGTAAATATTTTCTTGGAGCTCT 

GTGTCMCTTTGATCCTTTGTCTCCCAGGAAGGTGTGACCTCTCCTTTGCCTGCATACCT 

CAAGGCCAGGGGAATATGCCTCAGTGATGCATTTATCTTTGTATATCAGGCCGCATGATT 

CCCAACTTTCTGCCACACTTAMTTACGTTCCTCCATTTCAGTTTTGTCTTTTCTGTCTA 
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AAGT TCAGTCAAAGAGTATCAAAAAATTATGTTTCAGCTAGACTGGTGTAATGTATAAGT 

TTTTGTATCTTGTATTAGAGGATTTCGTAGCTTTTATTAGAGG 

Sequence 1094 

GCCCCGCGTCCGATCCCTAGATGACATAACAGCCTTACAAAAGGACAGGGAGGAGTGTCT 

GTTCCTACTCTCACATAGCGGAGGAAAGTTAGAGCCTCTCAGTCTCTGTTTATGAGGACT 

CATTMTCTCAAATMTTGATGCATTTTTCATACATTAGGGTCTCTGTCCATGTGTCTTC 

CTGATATTGTTATAGAAATGGCTTCAGGCTGCTGGTAACAGATGCTGCGGAAAAAGAATG 

CCTTAAACAAAGCCAGGCGCGGTGACTCACGCCTGTGATCCCAGCACTTTGGGAGGCTGG 

GGTGGGAAGGATCACTTGAGCCTAGGAGTTAGACACCTACCCAGCCTGGGCAACACGGTG 

AGACCTCGTCTCTACAAGAAACAAATAATTGGCTAGATGTCGTGGCGCACAAGCTTGTGG 

TCTCGGCTACTTAGGGGGCTGAGGCGGGAGGATTATTTGAGCCTGGGGAGGTCAAAACTG 

CGGTGGGCTGGGATTGCGCTACTGCACTCCGGGCTGGGAGACCGAGTANGACCCTGCCTT 

AAAAAAAAAAAAAAAAAAAA 

Sequence 1095 

AGTCGCCCCGCGTCCGCTCATACCCAGTGAATCTTCAACAGAATCTCTTAAAGATCTCCA 

GGMGTATMGCTCTCATTMTGTTTGAGTTAGAAGAACTTATTCTGGGCCTTTAATTTG 

TTGCATGTGCTGTACTTAAAGCATCCCAGATMTTTTAGCTTATATTTTCATAGTGTTTA 

TACAGAGCTTGMTTGGAA TGGT CCTTTCCTTCCTTGCCTCAGTACTTCCTTCCATAATC 

TTTCCTGCCATAACCATTTATTTTGCCACCATTTCTTAAACACTTATGTGGCAGGCATTA 

TGCTAGACTGTAATATG 1 1 1 I 1 1 I I AATCCCAGTTGAAGTGGATGTGGGAAGGTATTAGA 

AAGTAGAAGAAAGTATAGTCTAAAATAGAGAGGAAAGAAAGGAAGAGAAAAGTGGGATAT 

TTCAAAACCATTTGCGCAGAGGTAGAATGAAATTCGCCAGAATGGGAATCTCCGTATTTC 
TTTACAAT 

Sequence 1096 

GTNGCCCCGCGTCCGAGTNAACAGTGGTAGTNAAATTCAGGGTTGGTAAGTTTTTCCATA 

GAAGGCCAGATGGTAAATATTGTAGGCTTGCAGACCATGTGGGCTCCACGACTCAACTCT 

GCCACAGTAGTTTGAAAGCAGCCACAAACAGCCTTGGTGTGACTTTGTTCCAGTAAACTT 

TCTTTATAGAATGGGAGAAAATATTTGCAAACAATGCATTCAACAATGGCCTGATGTCCA 

GAATTCATGAGGAACTTAAAA AACTC AACAACAAAAATCACCAAATAACATTTAAAAGTG 

GGCAAAAGATATGAATAGTCATTTTTCAAAAGAAGACATACCGAATGGCCAACAAGCATA 

TGAAAAAATACTCAACATCTCTAGGCTTTCAGAGGCATGCNAANTAAAACCNCCATTNGA 

TATTATNTTACNNGANCCCNAAATGGG I 1 1 1 I I I I A AAAGGCCAAA 

Sequence 1097 

ccccncgcgtccgttnaagtgtgctttggaaaagggaaaaagtcctaagtagatataaaa 

ccctaactaagga agaa agcaggtagcagtggtggtccaagagaccgtgtagtggatgca 

aggaccgctcgtattttacacgctatatttcagcaaagggtggcccatctggcaggaaga 

tggggacatatgtcacatatagagcagttaaggaactagggaaagtggaagactcagaag 

acctgtctttgacctggtatgttctatctctacagaacctaatatggcttatacatactg 

ccacagaaaggactgaggtagacagtggcaaagacttcctaggagttgaacccctgaaat 

tacataaggagtaggaccccaccagaattctgtctttgtaggctgctgactgcagaagaa 

acgg tgtag cggaggcagggggaagaggagtcagnanagtacactgggaaggaagaaacg 
ggttcttttctctt 

Sequence 1098 

TCGCCCCTGGGCCCTCCTAACCAACCAGGGGAGGGGAGAAGGACCCAATTCTTTTCCTTT 

GGTGCACGTAGCCTGGACCCGTTATGGACAGAGGCCAAAGGAAGATAACAGTGTGGTGTC 

CAGAGATGAAACCAAGTGGTTGATGGGCAGTTCTTTGAGCAACCTTGTTTATGAGCCTAT 

TGATATGCAGATATAGAGGCATCCAATACTATTGACTAAT7TAAAATCTTATTCAGTGAG 

TCMCACTCTMATMGCAATGGAGATGGTTCCATTCATTTTTTTGCAAGTATCATTTTT 

ATMACATAMTTTCCTGAGATTTTTGTTTTCATCTTAGCCTCTGTGGAGCTGCTTCGTG 

GTTATGATAAGTGCTGTGTGATGCTCACCTTGGGAGGTCTGCGACATATATTGAAGTCAT 

CTCTAACCTGAAGTACTGACAGACTTTCTGGAAGAAAAGGCTTGTAGGAGGAAACTTCAG 
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AATTCTATTAAATGGTGTAAATGATGAAATTATAGTTGATATATGCTAGAGCATCAGTGC 

TGGGTATTTTAGAAGGGATGGA 

Sequence 1099 

CGGGCCTGTTCCTAGAGCCTCATTGGAGACATTGACAATGCCATGAGGACCTTCCTCAAC 

TACTACACTGTATGGMGCAGTTTGGGGGGCTCCCGGAATTCTACAACATTCCTCAGGGA 

TACACAGTGGAGAAGCGAGAGGGCTACCCACTTCGGCCAGAACTTATTGAAAGCGCAATG 

TACCTCTACCGTGCCACGGGGGATCCCACCCTCCTAGAACTCGGAAGAGATGCTGTGGAA 

TCCATTGMAMATCAGCMGGTGGAGTGCGGATTTGCAACAAAAGATCGCTTTGGCTGC 

TTTGTGAAGAATAGATTGAAAGGGTCAAAGGTGAGAGCCATCTCACATCCATGCAGGAAC 

CAAGCAGGCAAGATATAAATATGAAAGTAGAAGAAAATAGTCTGGAAGAAAATCCC 

Sequence 1100 

CGCGTCCGGGAGTGCACCCCAAGATCTAACAGCTGTTTCAGAGCTGCTCATTTTAGAGTG 

ATTGGTAGGGAGTTGGTGGCTCAGAGGTCCTAATCAGAATGTGTCCTGGGTTCTGAATGA 

CTAGCAGACTATCATTAACCAMTAAATTATGGGATTTTGTCTTAATTATATACATATAC 

ATATACACACACATACACATACACATACATGTGTATATATTCCCTAAAACTTAATAAAGC 

TCMATAATAAAATCAGATTTCTTMGTATTCCAATrCCCTTTAAAATGTAAATCAGATT 

TTATM TTCTTTTGTTCAAMCTGTCCATTGGCTCCCATTTCACTTAAATCAAAAGCTAG 

TTTTTACMTMGCTAAGATAGCAAACATTATTATCTATTTACTTATGAGTTACTTATGT 

AACTCAAGCATCCAATAACACTGTAGGGTGCTCAATAAAATAGTTGCTGAATGGATAACT 

TTC 

Sequence 1101 

TGTTTTACGACAGAGCTTAGTGCAGCCNGTTCTTGATGGCTGTGCAATGCTTTCCTTTTA 

AGAGTGGAGTTAGCCTCGTCATMAGCGTGTTTTTGAGTCTGTTCGAACGGGTCAACAAC 

GMGGGAAGTTTCAGGCAGATCTTGTATGCCTGGCCCTGGTGGCTGCTTTCATTTCCTTC 

CAGTATCAGTGCTAAACAGGAATGAACATGTTCAAGCCCCGTCTCACCCACCTCTGGCAT 

CTTCGCCCTAACTCTGCCCTAGAAGACCTTTCCTTCCGTATCGTCAAGAAAACTGAAGTT 

GCTGTTTCACTCCTTCTCCCACCCAGAAACTTCGCTGCATCTTCCTGGATCCCTAGCTCC 

TTGCACCCATGATCCTGTCTCCTTCCTCAGCCCGGCTTCTGGCTGAGCAGCCTGCACTTG 

CTGTCTTCACTCCTACACGCTGCCCCCACTCCTACACGCTGCCCCTGCGTGCTTNTCACT 

TCTCTACCCTTCC 

Sequence 1 1 02 

GTCCGTATCCTAT CTTCA MTTTTTTAAATATGTTCAAAATATCTGGAGGGTGAGAAGTT 

ACCMGTTTGTATGTTTTGTTTGACTCACCATCTTTATTTTCTGTATATGTAGTAGCTGG 

CMTTGCATATATTTTCTTGATTAACATATTAGAGACTGCTTCCATCATCTTATGTAAAC 

CTGG AAAC AAGCTGAAACTAGTCTTTTCTGAAGAACCGTGTATCAGTGTTAGATGTGCAT 

CCCGT TTTGTCA TTCCCTCAGACTTTGAATACAGTCATTACTCTCTGGAAGAGAAATGTA 

AGTATATTTTTTGTTATCTGCAGTATGGTTTAACATGTATTAATAATACACATATGCAGA 

CTCACTAAAGTATCCCCAGTAATTAGTAAATTCCAAAT 

Sequence 1103 

ANTTGGGTACCCCCGGCCNGGCCAGNTGCGCGCGGCGGGGCATGCTGCTCGTCCCCCCGC 

GCCCCCGGCCCGGACACTTGGCGGGTGCCACGAGGACCCGCAGCAGCACGTGCGGTCCCC 

CGGCGTTCCTGGGCGTGTTCGGCCGCCGTGCGCGGACCTNNGCGGGAGTTGGGGCNTGGG 

GGGCGGCNGCCGTTGGTNCGGACAGNCNGGGTGCGCACTTGGGCCCCCNTGNCCATGGCN 

GCAAAGGTGGACCTGAGCACCTCCACCGACTGGAAGGAGGCGAAATCCTTTCTGAAGGGC 

CTGAGTGACAAGCAGCGGGAGGAACATJACTTNTGCAAGGACTTTNTCAGGCTGAANAAG 

ATCCCNACATGGAAGGANATGGCGAAAGGGGTGGCTGT 

Sequence 1104 

TGCCNCGCGTCCGAGCATCTCAGGTMCMTTTGAGCATAACTTTAACCATAACTTATGA 
TAGCATAATAACAT TCATTA GTAATTCAGTAGCCGTATGTGCCAGGCTGTGTTAGGTGCT 
TTATATATTGTTTAATTTTTAAAMCTTGTGGAGTGTACAGATTGGTAAGGTGACATTGT 
ATCACAMGCTAGTCTTTGAGTCCAAAGTTTTGTGGTTTTATGTTATGATATACTTTTAT 
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CATGGMTTGTCTTATTAMTGTTTTGCCAGTGGTTCTTAMGTGTGTTTCTGACACCAG 

TAGCATTGACTTCACTTAGAMCCTGTTAGMATACAAATTATTTGGCCCCACCCAACAC 

TTGAGTCACAMCTTTGCAGATGGGGCTCMTCTGTTTTMCAAGCGCTTCATGTAATTT 

TGATGCAGGCCTMGTTTTTGAGCCCTGCAGTATGCATTTCTATTTTTAAGCAAAGATCT 

TGGTCTTTCTTTTTGGACATTGTAGAMTAACATGMCCTGGTTTTTGGm 

NTTTGGTTTGGT 

Sequence 1105 

ACGCGTCCGCTCTGGTCAAGCAGGCGGTACTTCTCCTTGGATGTCTCAGCCACAGTGCCT 

ATCAGGGTACTGAGGGAGAGCACACATGGCCCGAGGCCCTNGGAGCCCTCGGAGGCTGAG 

TCAAMGAGTCTCCCTCGAATTGGTGGGCCTTTAGAAGACTTGGCTTCTTCACTGGAGAG 

CTATAMGTAAACACCACACTGAGGGCCCTCGTCCCAGGMGGCCTTCAGAGCATTTTCA 

TTTCTGAACACGTCCCCTCATCTTTCAAGATTTTCTGGTCCTCTAAAGCTGAGAAACTAC 

AAGCACTGAMTGAGATGAGTTTTGATAAGGATGGTAATGAAGCACAAAAGCGTTATTCA 

CATTACTCACTGACTTTMTATAATTTTGAAATATTTCATACTTTTGAAAAACAAAATAG 

CCTGGGCGACAAGANTGAGACTCCATCTCAAAGGTAAAANAAATTTAANCTGGGTGCCNG 

CCGCTTGACTATGTCTAGAGAAAAAAACTTCCACA 

Sequence 1106 

GACCCANAGAAMGGNGCCAMGGGCATGTCMGCMTTGAAGTTAAGCTCATGTTTTTA 

AAGATCCGTTTATTGAGATGATTTTGAMTGCTCCTTTACCATTCATAATTTAAAAATAA 

AGTTTMCAATGGTTTAAA7TCANAATGGATTAAAATGGAGTTGGGGGTGGAAAGTAGAG 

CCATTCTTAGTAAATATAAATAACTGAAAAGTTCTTCTGAGGAGACTATGTACCGAAGTT 

ATCATTGCATCTTTCAGTATAGGCAGATCTCTCCCTCATATAACCGGATGTTTCTTGGCG 

CTTGGMTATCAGATAMGGTAMGTTTAAGAAACTTCTCTAGCGGGGGATTTAGGGAAC 

TTCTTAAAACCTAGAGTTAAMGCTGTTGCGTGT1TGTTGTGTTATTTTAGACCAATCAA 

CTTCATAGGCTAGACTAGTCTAGA 

Sequence 1107 

ACGCGTCCGAAAATTCACAGGGTGTGTTGGCACACGCCTGTAATGCCAGCTACTCAGGTG 

GCTGAGGCATAAGAATTGCTTGAGCCTGGGAGGCAGAGATTGCACTGAGCCGAGATCGCG 

CCACTATACTCCAGCCTGGGCAACANACATCCTGTCTCAAATAAATTAAATTACATTAAA 

TGTTTAAGAAGAAGTCTAAATAAGATTCATATGCTGCCCTCCCTCAGATAATGAGGGAAC 

CTGGGGTACTTCTGGGCTACTCTGGGGGACAAAGTATAACTATTCAAATGGCAAGTTGAA 

TTAGTACAGTCTAGGAGCCTTGGAGATGGCTTCTTGAAAGAGGTAGAACCTGAAATTCTC 

CTTCCTTGAGGGACGGNCAGGATTTGGCCAGATGGAAAGGCAAGTGGAAGGCTTTGCAGG 

GACAAGCAATGTAANCAGANCCTAGAAATGG 

Sequence 1108 

TCGACCCCGCGTCCGGNGTMTTCTAGGGGAMTNATATTTCTGAACAACAATGTTGGTT 

TGTGCAGGAAAATCACCAAAGAACATGACTAGAAAGTGTATAGCTACAGTTTCCCTCTTT 

TAAATGGGAATAGCAAAACATATAAAGAATATTGATAGGCCGGGTGCGGTGGCTCACGCC 

TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCAAGAGGTCAGGAGATCGAGAC 

CATCATGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGGGC 

GTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGA 

ATCCGGGAGGTGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGTGA 

CAGAGCGAGAGACTCTGCCTCAAACAAAAAAAAAAGNNAAAANAAAAAA 

Sequence 1109 

CCGTCTCTGGCTTGGCCAGGTTTAAATTAATAAAAATGAAGATGAAAATAAGTTGTCAGA 

TTTAGGATGTATTTTAGAAACCCMCTGATAATTTGCCAACTAATTGGATGCAGAGAGTA 

AGAGGGAGACTCAAGAACACCTCTAAGATTTTTACCCTGATCAATGGGATAGGTGAAAGT 

ACATTAATGGAGATTGAGAATCCTGGTGGAGGTACAAGTTTAGGGGTACTGAAGAGTGCT 

TTTGGACATGTGAATTCTTAGAAGCCTACTAGATTCTCCAAATGGAGACATAAAACATAA 

TTGAATACAAAAGTCAGGAGTTCAGGAGAGGGCTGAGCTAAAGATACAAATTTGATAGAC 

ATGAGCATTTAAAAAAAACTGCATGAAAATACTAAAGATAGGCTGTCCTGCCTATGGAAT 
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AGCCATTCTTrGATCCCTTTACTTTCTTMTMACTTGGTTTCACCTTACTCTATGGACT 

T CCCC CAMTTCTTTCTTGTGTGAGGTCCAAAAACTCTCTGTTGGGGTCTAGATCAGACC 

CTTTTCAAGTACATCTTNCTGATGAACCACAAANGGATTATACTAAAGAGACCCCCCACC 

Sequence 1110 

CCGGNAATTTGCTTATTCTAATTGAGACACANTGGTGGGGAGTGGGGGTCGGGGACTACA 

CAGGTGCATTTTCTGAACATTTATAAAATGAAAAAGATGGAGGCTTGGCTAGAATGGTTA 

ATCCCCTTTTCATTCCTCTMTTCTATGACMTTTTTTTAAAAMCCAAACACAACCAAA 

ATMGAGTGGACAGTTGAGMTTACCTTTAGGTTCCCATGACCCTGAAGACTGTATTTGG 

CCTTGGATCCATTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1111 

CCCACGCGTCCGCGGCCATTTCTGTATCCCCCTGCCTGGGTTTGCTGCCCTTTATGCTCC 

TACCTCACCAGGTACAAGGAACATGAAGATGGCTATATGCGGCTGCAGCTGGTTCGCTAC 

GAGAGTGTAGAGCTGACACAGCAACTGCTGCGGCAACCACAAGAGGGATCGGGCCTGGGA 

ACGTCGCTGAACGAGAGCAGCCTGCAGGGCATTATTCTAGAAACAGTGCCAGGGGAGCCA 

GGACGTAAGGAAGAGGAAGAGGAGGGCAAGGGTAGCGAAGGGACAGCCCTCTCAGCCTCT 

CAGGACAACCCCAGTTCTGTCATCCACGTGGTGAATCAGACCAATGCCCAAGGCCAGCAA 

GAGATTGTCTACTATGTGCTGTCTGAAGCCCCAGGGGAGCCTCCCCCAGCCCCTGAGCCA 

CCTTCAGGGGGCATCATGGAAAAGCTTCAAGGAATAGCTGAGGAGCCAGAGATCCAGATG 

GTTTGAAGGCCGCAGAGCCAGACCATTTCTTCCCAGGTCTGAAAGTTTGAGCCAGGCAAG 

TGGCAGTGCCCCTAGTGGGCAGCCGTTGCCAATGGATGCC 

Sequence 1112 

CCCCCGCGTC CGTM TTTTAAAGAACCTTGTTATTAGAAAATCTCAGCCTAATACAATCT 

GMGTTMGAGTTTTAGCAGCATTGTTTTTCTAAGTAGATTTAGCTATAGATT^ 

GGCCAAACMGGMGAGTATATGCCCTTGTAAATGAGTCTTGTTTTGTTATTTAAATAGT 

CAGTCAAAACGTAGAAATCAGTATACGTAAMTAAAATGCATGAGACTATTAMTCTTTT 

CATATACTCTACAAATAAAATGAAATCTGTGTGTGGTCCTGGTTGACTGGGCATCTAAAG 

GGAATC AGA AAAG AGATTGTGAAMGTTATATATATATCCTCTTCCTTATTTTAGTTTTG 

CM I I ICCTATTTTCCATAATTMGTGCCGTTTACAAAGTGGCATCAAAAAATTGAAGCA 

GGCCAGGCATGGTGGCTCATGCCTGTGGTCCCAGCAGTTTGGGAGGCTGAGGGCAGGTGG 

ATCACTTGAGATCGGGGGTTCGTGACCAGCCTGGCCAACATGGTGAAAGCCCATCTCTAC 

TGGGAATATAAAAATTAGCCCGGCGTGGTGGCATGTGCCTGTGGTCGCAGCTACTTGGGA 

GGCTAAGACAGGAGAATTGCTTGGGCCCTGGGAGGGGGGAGTTCANNNGNCCTGGANCGN 

CCCCTGNNCTCNANCCCTGGCAACCANTGNNGACACCNNCTTAAAAAAAAAA 

Sequence 1113 

TCGACCCCGCGTCCGGTTTTTTGTCCCAGCAGTGGCATTTAMTTACTGTACTTTAAGAC 

ATGGAATTGCTGGAGGCTTGGAAACTTGAGTGCAATTTCCCTAGTACGACCTCCAAGGAG 

MTAGAGCAAMCAGTGGTAGGAAMACTCTCAMATTTTACCCMTTGTATGTTTTCTA 

CATTGTCAGTATCTAGTTTTATATAGTTMTATGTACTTCTAAMTTTCTGACAGTGNTT 

GGTGTATAAAACAGACCAAGCTCAAGATGTAAAGAAGATTGAGAAATTCCACANTCAACT 

AATGCGACTTATGGTAGCCAAGGAAGCCCGCAATGTTACCATGGAAACTGAGTGAATGGT 

TTGAAATGAAGACTTTGTCGTGTACTTAGGAAGTAMTATCTTTTGAATTAGAGAAAGTG 

TTGGGACAGAAAGTACTTTATGTAACTAAGTGGGCTGTTCAGAAGCTTAGAGGTCATTTT 

TTGTAATTTTNTTTTTAATTACTTTAGAAGAGCTAGGGATGCAMTGTTTTCMTTT 

AAG CCTTTATTT AC I I I I i GGGAAA 

Sequence 1114 

TCGACCCCGCGTCCGATTCCTTCTTCATATATTATGTCAGAAGAGTTTGAGAAGAAATGG 

TATTAATTCTTCTTTAAATGTTAGGTTGACTCACCAGTTMTGCAGCTATTTGGTCATAA 

ATG TTTC TTTGTTMTCACTTTCGATTACTAATTCAATCTGCTAGGTTATAGGTCTATTC 

AGATTTTCTCTTTCTTCTTGAGCCACTTTGGTAGTTTGTGTCTTTCTAGTGATTC 

TTTCATCCAGGACAGCTAATTTGTTGTTAGACAGTTGTTCACAGTATACTCCTGTAATCC 
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TTTTGT ATTTC TGT A A AGTTG GT AGTAATG G CTCTG CTTTC ATTTATTATTTTAATAATT 
AGTCTTCCATCTTTTGCTCAGTCMTATAGTGAAAGGCTTGATCTTTCAAATMTCTATG 
TTTATTCATTCTACTGCTCTCCAAACTTCTATTTTATTGATTTATGCTCTAATTATGCTC 
TCTATTATTTCTTTCATACTGCTAGCTTTGGATTAGGCTTATTTTGNCTTCTTC 
Sequence 1115 

GCCCCGCGTCCGGGATGACTAATGAAAGCAATNAGCTTGAACATTTAGAAAAAATTCATA 

TATGATCTAAATTTTTATATTATCATTTCTGTGCCTTCTAATTCCTGCATCCTGTTCAAA 

ACATCTTTCCAGACATTAACTTACACATTGTATAAAACCGACCAAAATGATTTCCTAAAG 

TTCATGCAAAAAAAAAAAAAAACAACCTAATTTTCTGTTAATATAAAAGAAACTTCAGTT 

TACTGACCGTGAAACAGACTATGTACTGACATCCAGGGTAAAGTAAAAGACTTTTAAATA 

TTGGTCATTAAAGGACAGGAGCTAAGCTAGCAAAGCAAMCATCTTTAGGACTTTGCAGA 

TCTC MGCAGTTAACCAGGCTCTGATTCCCTTCCACTGTTTTATGMTTAATTCCAGTTC 

TTTTCA TGTATC TTTGAACCTMGATTATGMGTMTTTCCCTATTAGGGACTAGAATGA 

CTTCAGTTTTTTCATITGATAAAAATCAGAACTGCTACCTTTCCCTTTT^ 

AAATGTAGATGAGTGCATTAAGGGTTGTAAGGATCTTTATCATTTTATGNCATTCATTGA 
AAATTGAAATGTTCATTC I I I I I AATGGTT 
Sequence 11 16 

CNGCTTTCGTCTCTTCCTTTNAAGTTGATACCCTTCTTTTTCTTGTCATTT^ 

TGGGACCTCCAGAA TMTG TTTCATGMGTAGCATGTATCCATATCTGGTTCTTGACTTT 

TTCATCATTATAATTGTTTTCTATGGGTTACTTATCAGTTTMGAATGCTTAATTCCTAG 

ATGAA CTAAGAG TGTTTATTACATGTTGAGATTTATGGTATGCTTTTTCTTCCTCAAGAT 

MTGCATTTTTTGTA TTATCTG TTAATGTGATAGGTTATCCATTTGTGTATTTTCAATCA 

TTGMCAACCC TTGAT TTTTTTGGATAAACTCTATTTGGTCATTATGCATCATTCTATAA 

ACCCTGCTGM TTTTTCATTT GCCAACATCTTATTTCAGATTCTTTT 

CMTGAGATTTGTTTTCTTTTGCAATTTGTTTTGAATTTTTGGTAT^ 

CCTTATMTGGAAMTACATATTTCTCAMCTTTTACACTGATATATTCATAGTATTTTT 

TTATMTTTGAAAMTCTTGTCAGTATCTGTATTMGGCCTNCATTTCAGTTCTGCTATT 

TCATATTGCCTTAAGGTGGCTATTTGGCTCTTTAAGGACCCCGATTTTGATTTTGTCATT 

TTTAAAATAAACCCCATTTATGCTATAAAAAAAAA 

Sequence 1117 

GCCTTTTATGGTGATGGAATATGTCTCAGGAGGAGAGCTATTTGATTATATCTGTAAGAA 

TGGAAGGGTMGCTGTTCTGCTTTMTTCTGTATGTATTTTGTNNCTNGNCCTTTATCCT 

TTACTAGCATCMAATGTCAGCMCCAATTTTAAGAGGTCTATTTAATAACCAGTTCCCT 

TAGTCATATATTTGTTTAGAATCATAAACTATGTAGAACTAGAAAGGATCTTAAAGATTA 

TCTCCTTAGCCTGTTTATACAGATGTGGATACTGAGCCTCGCGGCTTATATGATTGCTCA 

CAGTAACGTGATTTATTAATGACGGAATTGGCTTGAGCCCCCAGAACTCATAATCCTCAG 

ACTTATGCTTCCAGGGTATACAAATACTTTGAATATGTATCTTAATGTAATTAATCGTAC 

CAAATATATTATTATTACT 

Sequence 1118 

GCGTCCGTTGTCATCTATTTACTTTACATATGTCATAMCCTMCACTACATGGTCATTT 

TTGTTTMACAGTCMTTACCTTTTAAAGGGATTTGAATAATAAGTCAAAATCTAATACA 

TTAACTGTGT AGTT AGCATTTCTGGTGCTCTTCTTTCTTTTCTGTAGATCCATACTTCCA 

TCTGGCATTATTTTCCTACTGCCAGAAGGACTTCCTTTAACAT7TCTTGTAGTGTAGATC 

TGCTGGTGATGAATTCTTTCAGCTTTTGTMTTCT^ 

GAGTATAGGTTAATAGCTTTTTCCTTTCAGTACTCTAAAGATGTTGCTCCAGGCCAGGCG 

CGGTGGCTCACTCCTGTAATCCCAGCACTTTGGGAGTCTTGAGGTGGGCAGAACACTTGA 

GGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCCGTGCTTCTAAACATAT 

TAAAGAAAAAAAAGA 

Sequence 1119 

NCGTGACATGCTGGCTGCTAGTNAGCTCCCCCATGATTGTCAGCTTCCGAGCCCTCACTA 
GAAGCAGATACCACCCCACCACCATGTTTCCTTTAAAGCCTGCAGAACTGNGAACCAATT 
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AAMCTCTTTTCTTTATAMTTATCCAGCCTCAAGTATTTATAGCAACACAATAATGGCC 

TAACACAACCTACMCTCTCTATATGTATTTGTGTGTATTTAAAACATGCAGGAAATAAC 

ACAGAATCCAAGGCACCCAAAACTCATTAAAAAATGGAATCAAGAATTCATATGCCATTA 

TGAMTTAGCCAGTCCTAAAATCTGACCTCTCTGCATTTTCACATTATTCTCCTCTCTCT 

ATCCCTGCCTTCCTCCCTCCCTTCCTCCAACTGTCAGAATTGTCCTGTAATCAAACATGT 

TCACATCACAGCTTTTCATTTTCTATTTCCAATCAATTGACCAGTCTAGCCAAGTAGCAT 

CCTGGATCCCGTATTACATATTCCTAGGACAGGAAGCCAGATTTTT 

Sequence 1120 

AGCTCTTTAGCAGGAGACAATTCTTAAACTTAAATTAAACTGAAAAAGCCACAGAAAAAA 

GGGTTTGACACCCTAAAGCCAGTGTCCAAATGAACGCTACGGTTGNCCTCATAGGTGTGT 

TATGAATGTTTACCCTGACCTCCTAGAAGAAAAGGAAAAAGAAAGGAAAGAGAGAGGGAG 

AAGGTGAGGGAAGGGGGGAAGAAAAGAAAGGAGAGAAAAGAAGACAGAGAAAAAACAGGG 

AGGGAGGMGCTGGGGAAGGAAAMGACCATTTGCTGACTCCGTTGTTTTATTTCCCAGA 

ATGATTCAATACCTCAAGMGATTTCACTCCAGAAGTGTACAGAGTTTTCCTCAACAACC 

TTTGCC CTCGAC CTGAAATTGATMCATCTTTTCAGAATTGTAAGAGTACACATTTTAAG 

CCATATCTTTTTCAGCTTGCATTGATTCTCAGGTGGCTAGAGCAGGACTTGGAGTGGTAA 

TTGGAGATGGAAGAACATCATACACTGTGTCTAAA 

Sequence 1121 

CGGCCGAGGTACTATAATGGTCCCCATCTTMTTTGAAAGCGTTTGAGMTCTTTTAGGA 

CANGCACTGACGAAGGCACTNGAAGACTCCAGCTTCCTGAAAAGAAGTGGCAGGGACAGT 

GGCTACGGTGACATCTGGTGTCCTGAACGTGGAGAATTTCTTGCTCCTCCAAGGCACCAT 

AAGAGAGAAGATTCCTTTGAAAGCTTGGACTCTTTGGGCTCGAGGTCATTGACAAGCTGC 

TCCTCTGATATCACGTTGAGAGGGGGGCGTGMGGTTTTGAAAGTGACACAGATTCGGAA 

TTTACATTCAAGATGCAGGATTATAATAAAGATGATATGTCCGTATCGAAGGATTTCGGC 

TGTTGAGCCAAAGACTGCGTTACCCTTCAATCGTTTTTTACCCAACAAAAGTAGACAGCC 

ATCCTATGTACCTGCCCG 

Sequence 1122 

CCCT7TCGAGCGGCCNTNCGGGCTTNTACGCGGGGGCAGCGGGAAGCTCGCAGCAGCTGG 

GGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGAT 

GAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCA 

AGGCTGAAGMCCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCA 

ACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATC 

ATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTTAACACCATCTTACTA 

ACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTATTCCAATCCACATTCGCCTG 

GCTTGG G AATTTTTCTCAAAAATAT 

Sequence 1123 

CCCTTTCGAGCGGCCGTNCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

ACTGCMAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1124 

CCCTTATTTTNGGCNTTNGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCA 

MMATTGCCAAAATGCNACTTTCTMAAATGGAGCAGATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGG 

TTTGGAGTGTGGAAGCCTNATCCCTTCANCATCAAGCTGGAATGGGGAATGAAGAATGGA 
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NATGTGGTGCCCACTAGGCTACTGNTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTA 

TTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTA 

Sequence 1125 

CCCTTANCGTGNTCNCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TNTATAGAGGGTGGGAAAATAAACCANAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAAATGCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCAT 

GfCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCGTCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGG ACAG GGTTATTGTTAACACCTG AATTG 

Sequence 1126 

CCCTTTCGAG CGGCC GCCCGGGCAGGTAC I I I 1 1 I I I I I I I N I I I I I I I NNGGACTTGCG 

ACAGTTCCACCTTTTACTCTCATTGGTAAAATCTCCTTTTAATTATTAAAAATATTGATA 

AATTTATTAAA TTMGTC TTTNATTCTTTTGTMTCAGMGAGGACATTAATGTTGCGTG 

TCTTGACTGT CTTTTT TGCTTTGTAGATTTATTTTGTC 

TTTGTGNTTGATTTTTCCAGMGCAGTTACm 

TTTTG C TTTAG G C ATT AGTTTCTTCTTCCTTTATAG NTTTTTC AAAATC A ATTAATTCCT 

TTATTTGTTTTGAAAGMGTAAATTTGGGGTMTTTTTTCTTATC 

AGTTAAAAAATTACCAACTGATTGCATTTCTTTACTTMTTTT^ 

CATCAAAAAAAAGAATTTMNAATTAATTTACCTTGTTCAGGTCTTGGAATTTNCACGCC 

CTCTAAGACGAAGAGCCACTTTTACTCTGCGTATCTMTAAAMTTCTTTGGCTTTTTTT 
GCTT 

Sequence 1127 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAMAATTGCCAAMTGCGACTTrCTAMMTGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1128 

CCCTT AGCGTGGT CGCGGCCGAGGTACATGCTANAAAACATTANCACAGATACGACAGAG 

TGTGGI I I I I I I I i AG AAAT GGGTAATTTCTCTCTCCAGTATCCTTTCACTTGTATGAGA 

TATTTCTCCTCTCCTGTTTTCACAAACCAAGAAATCCCCAGGTAGGCCAATCCCAGAGGT 

GCCATTTAGCAGTATGCAGCAGCCCAGTTTCAGCATAACAAAACATGCCTTGGTAGTGGC 

TCTCTCATGCAAATAAAAGAAAGCTTAAGAAATTCTTGTTGTAGGTGGATTAGGCAAGGC 

T GCCA TTCAGCTGGTATAAGCTAAAAGTAAAAMTCAAAACGCTCAAGAAAACGGACACA 

ATTTTGGAATGATTAMGATGTCTTTATAMGTTTTTTTCAAGACTTCATTCTA 

CAGAATAAAAAATGGGTGTCAGCTCACTTGTAAGACACCMCCAGATTTTCCTTATACTG 

TCTC AAAATTTAAAGATCAATTTCCCCAGAAGGTGTNCAATGCATCATAAAATGGCCCTT 

TTTTGAGGATGGGAGAGGAAGGGTTGGGCAGGATGGAATATTAAATTGTACATGGATAAA 

CATGCCMGACTGTTATCCMTCTAGATMTTTATATACATTTTGATGACTTAAGGAAAA 

CMAGCAATCATTTGGTGACAGCCTAAAAAGCNTGACCNTATTTAACATACTTAGGAACT 

TTTTTNGG 

Sequence 1129 

CGTTCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAA 

AAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCA 

AAATGCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTC 

AGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTG 

GAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCC 
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CACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATG 

TAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTA 

TGTTAACACCTGAATTGCTGGGTCTTGAAGAGAGCCCAAGGAGTTCTGGGAAGAGGGACC 

AGATTGGGGGGTAGGGTCACGGGCTTGGGTGATA6AATTATTTCTCGAATGACTTTCTTG 

AGTGCCMTTTGAACTGTAACATTTGCTTANTCACCT7TAGNGGAGTAATCTCCTGGGCT 

TGGTTCTATATTTATATAAAAG 

Sequence 1130 

CGGGCAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTA 

CCAACTAACTMTGTTCCGCACCCCCATTTTTMGTGAAGCTGTGMGCTCCTTTCTATT 

ACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGGTTCCAATAAGTTATCCC 

CAGTCTTAAANGTAGGTTAAGTACCTCNGGCCGGCCACCGGGNTGGAGCTCCAAATTNGC 

CCTATAAGTGAGGTCGGATTTACGCCCCGCCTCACTTGGCCCGNNGTTTTACAACCGTCC 

GTNGACTNGGGAAMCCCCTGGGCGTTTACCCCMNCTTTAATCCGCCTTGGCAGCACAA 

TCCCCCTTTTCGNCCAGGTTGGCGTNAATAANCGAAAAAGGCCCGGAACCGAATCGGCCC 

NTTTCCNAACANGTTNGCGCCAGTNCTGGAATNGGCNAAATGGGGACCCCNCCCCTTGTT 

MCCGGGGNGCATTTAAACCNCCGGCCGGNTGNTGGTTGGNTTACCCCCCAANNGGTGAC 

CCGNNTTCMCTTTGGCMGGGCCCCTAANGGCCGNTTTCTTTTGNTTTTTTTC 

Sequence 1131 

CGANGTACGCGGGGCTTTCCAAACAGCATAAGAAGTGTTTGAGCCNTNAGTATACTGAAG 

GAAGGGCTCCCTCGAGTTCTGGTGTGAAGAGATAAATCACCAGAAAAATGATGGCATCAA 

AACAAAAAAAGAACTCATTGTGAATAAGAAAAAACATCTAGGCCCAGTCGAAGAATATCA 

GCTGCTGCTTCAGGTGACCTATAGAGATTCCMGGAGAAMGAGATTTGAGAAATTTTCT 

GAAGCTCTTGAAGCCTCCATTATTATGGTCACATGGGCTAATTAGAATTATCAAGAGCAA 

AGGCCACCACAGACTGCAACANGCCTGAATGGAGTCCTGCAGTGTACCTGCCCGGGCGGC 

CGCTCTTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGAT 

ACCCGGCNGACCTNGAGGGGGGGGCCCCGGTACCCAACCTTTTGT 

Sequence 1132 

AGCTCACTCAAAGGGCGGTAATACGGGTTATCCACAAGAATCAAGGGGATAACCGCAGGA 

AAGAAACATGTGAGCAAAAGGGCCAAGCAAAAGGGCCAGGAACCGTAAAAAGGGCCCGCG 

TTGCTGGCNGTTTTTCCATAGGGCTCCCGCCCCCCTGGACCGAAGCATCACAAAAAATCG 

ACGCTCAAAGTTCAGAGGTGGGCGAAAACCCCGGACAAGG 

Sequence 1133 

CCCGGGA ACAAAA GCNGCAACCGNGCCCCCCCTCCAGGTCNACGGNNTCGANAAGCNCGA 

AAACCGMTTTTTGNAGNTTTNGGGGACCCACTANTTNGNGAGCGGGGGCCGANNNAGNG 

CGGC 

Sequence 1134 

CCCCCCCNCGAGGNCGACGGNANCGANMGCAGATNNTNAATTTTTGCAGCCCGGGGGAT 

CCNGGAGNGNGAGAGCCCACCGNGGGGGNGCGCCAACACCCCCGANAGAGAGNCGAANNA 

CGCGCGCNCACNGGCCGNCGANAAACAACGGCGAGACNGNGAAAACCCGGGCGANACCCA 

ACANAAGCGCCAGGCAGCACANACCCCAANCGCCAGCNGGCGAAACANNnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGANACCGCGCAGCGNGACCGNAACACNAGC 

CAGCCGCCCNAGCGCCCGCNCCAAGCGCNNANCNNCCCANACCAACANCGCCACGNACGG 

CCGGGCAANCCCCGGCAAGCACAAAAAGCGGGGGGGCNCCCTGGAAGGGGNNCNCGAANA 

AAAGGCAAGACGGGNACCNCNGAANCCCCAAAAAACC 

Sequence 1135 

GGGAGTNNGGGAGCGGCCGAGGGCCAAANGANCCGCGAAGACAAGGCCATCCACCACTNN 

ANAGAGGGGGNAAAAACAAACCAGAAACCAAGGGAGAAAGAAAAGANGAAAGACAAACNG 

CAAAAAANNGCCAAAACGCGACNANCAAAAAANGGAGCAGANGCNGAGGCANNGCAAGNC 

ACGGCANACCGNCAGGAGCCGAANGAAAAAAAGCAACAAGCAAGANGAAGACNCTGAGAG 

GGGCNCGGAGGCCNGGAAACCNCATCCCTNCAACAACAAAGCNGGAANGGGGGAAANGAA 
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GMNCAGAAGAAGGGGGCGCCCACNAGGCTACNGCNGAAAGGGAACCGAAAAACCCCNCA 

CCAANNAGGNATNCAAAAAANNGGAAACGGACCGGGNCANGGCAAAAAAAANGGAACCAA 

GACACCGGGCCCANACCACNGGACCANGGGGNAANGGAAACACCCCGGAAATGGCAGGGN 

CCCCGAANAGAACCCCAAGGGAGCCCAGGGAAGAGGGACCCAAGATGGGGGGGGAAAGGC 

CCACCGGGCCNGGGGNGAAANAACAAAGCCACCGAGGGCCAACCGNAAGNGGCCAANNGA 

ACCCGGAACNANAAGGCNNAANCCACCCTGAAGNGGGAGGAAAACCCCCAGGGCNGGGGG 

CCAAANANNAAAAAAAAGCNGCCCAAAACCCCCAA 

Sequence 1136 

NTTTAAI I I I I IGCAGCCCGGGGGANCCAGGGGNAGGGNGAGCCCACCGCGGGGGAGCGC 
CAANCGCCCNACAGCGAGNCGNAANACGCGCNGCCCACNGGCCCGCCGNANAAACAACGN 
CGAGACGGGGAAAACCCCGGCGNCACCCAACNGAAACNGCCANGCAGCACAANCCNCAAN 

CGCCAGCGGGCGGAANAGCGAAGAGGCCNCGCACCGAACGCCCANCCCAACAGNGGCGCA 

ACAGAAAGGGCGAAA 

Sequence 1137 

TTTGCANTTTNGGGGACCCACNAANTGGNGAGCGGGCGANGAACCACACNNACGGCCCGA 

GNCCNNGNCNACNGGNCNAGAGNGANAANCACAGGGACACCGANCANCCCTNNCGGCANG 

GACCCACAGCAGCAGGNGGANNGCCATACCGAGANGAAGGAAGACNCAGACANCGCCAAC 

CATNCCCGAGNGGACCNGCCCGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACGNNAAACAACCGGCGAGACNGNGAAAA 

CCCNGGCGNNACCCAACNNAANCGCCGNGCAGCACAACCCCCNACGCCAGCNGGCGNAAA 

AAGCGAAGAGGCCCCGCACCGAGCGCCCNTCCCAACAGGGGCGCAGCCCGAANGGCGAAA 

NGGGGACGCGCCCTGGAAGCGGCGCAANAAAGCGCCGGCGGGAGGGGGGGGANACNCCCA 

NNCGGGACCGCNACAACNANGCCAAGCGCCCNAANCGCCCGGCACCCANACGCTANCNAC 

CCNNCCNANCCACGCCCACGGGANCGCCGGCNANCCCCGGCAAGCCNNAAAAAACGGGGG 

GCNCCCCNNAAGGGGNCCCGACANANGAGGCCNNACCGGCACCCCGACCCCCANAAAACC 

AGGAAAAAGGGNGGAAGGGNCNACGGAAGGGGGCCC 

Sequence 1138 

CACGCGTCCGGAAAGGAGGGGAGGGGTGGGGCGGGCCGGATCTGCTCGCAGGCCGCACCC 

GCCTCCGGCTGGATGCTCAGAGTCTCGCCCTGATGGCCAGACTGGAGTGCAGNGGCGTGA 

TCTCGGCTCACTGCATCCTCCACCTCCTAGGTTCAAGTGATTCTTCCACCTCAGCCTCCC 

CTCCCAAGGAGCTGGCATCACAGGACAGGCACGGAGACTCACGCCTGAAATCCTAATACT 

TTGGGAGGCCGCGGCAGGAGGATCACCTGAACCCAGGAGTTTGAGACCAGCCTGCCAACA 

TGATGTTGTTATTCATGAGGACCAATGGGTTGGCGAGACAGTACTACAATCAACATTTAG 

CAGTCAGTTATTAAATCTTGGGAGTTATTCATCTATTCAGCCTGAAGAATATTCCAGTGT 

AGTTAGTGATGTTGTACTTCAAGACTTACTGGCATATGTGTNCTCAAAACATTCC 

Sequence 1139 

TTACCCCGCGTCCGGGTAAACAAAACAAAGATCGTTTGTTCTGGAAACAGGTAAAATGGT 

MTCAMTAGATTGTGTTCCAGGAGTGCAAAGGTGGCTTAATATTCACAAATCAGTTGCT 

ATTGTACACCACCTGTAGAAAAGTAATCTGGCATGCAGAACATTCTTATGGTAAAGTTAA 

TGTTCATTTATGATCTTAGCAMTGATGGATTGAMGGGACTTCCTTAATTGCATAAACA 

GACTTCAACAAACAGTATGATGAAATAGTGAACATTTCTCCTAAGATTATAAAAATAAGA 

CAAGGATATCTGCTGTCAATGATTTTATTCAGCATTGTTCAGAAGGACCTAACCAGAAAA 

CTMTGCAAGAMCAGAMCAAAAGGCATMAGATTANMAAGMGTMMTTTTAAAAA 

AGAAAAAGAATATMATCTCTATTTGCATATGCCATGAGTAAATTTGGTAAGTTCCCTGC 

ATAAMGTTATGCAAAAAGCATTTTTTTGATATACCAGCCAAAAATCAAGGGAAATGGAA 

AA 

Sequence 1140 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 
AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 
AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 
CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 
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AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTC 

TTGAGAGAGCCCAAGGAGTTCTGGGGAGAGGGACCAGATTGGGGG 

Sequence 1141 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1142 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAG 

Sequence 1143 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGTGGATA 

GAGAAACTGAAGCTCTCTAAAGACCCTGCAGCTGGGAGGTGGCAGAGTCAATGGCAGCCC 

TCAGCCCTATCTGCCCTGACATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAA 

TAACMTGTGTGGGGTC CTTGG CTGAGAGACTTNCCTTTTGGGAATCAATCTGAATGTAT 

GATGACAAAGAMACAACTTTTGCTTTATACAACCTTCTGGTTAGATTCAGGCACCAAGC 

AGGACACTTCTTTGTGGCGCTCCAAGAATCTTTCAAATTCTTCATCACCAATAACAAATC 

TTTCTGCTTCTCTTAGAGCATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGA 

ACACTTTCCAGCGGACAGG GTTTAGTG CTTTGATCTGTTCCGTCATGTCCTCTTCCACGT 

TG AAACGATTAATGACAG AA I I I I I I! I GGAGGCGACTCTATTAATCCCTACACCACCTN 

CTCAGCTTTTGAAGGGTTTNCACATGGGTTCTTTT 

Sequence 1144 

GNAGCTCCCCGCGGTGGCGGCCGAGGTACGCCACCATGCCTGGCTAATTTTTGTATTTTT 

AGTGGAGACGGGGTTTCACCATGTTGGCCAGGCTGTTCTCGAACTCCTGACCTCGTGATC 

CACCCACATTGTCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCATGAGGAT 

TAGTAMGTGCACTCATGGTMGTAAAAAMTTTGTTTTATGTTNATGCTGATTATATGA 

AG GTCAT CATAGCTTAGACACAATCMAACCCATGGGGMCATCTTTAGMTTCATTTTT 

CTCTTTTCTTACAAAAAAAGTAAATAGGTAAAATGGAAAATAGAAGACAACCTATCCTAT 

CCTGGAT GAGACACACACATATTTAAATTGAATTATAGACTTAAAATTTAAGTAGGGANT 

I I I I l I I I NTGNGNMCAAAAGTTTNCAAMMCCCAAAACTTTTNAGAATCACCCAGTT 

NTTGGAAAATATGATTATGAMGCAGACTTTTTGGATGGGNGCTTMTGACATTTAGGCG 

A CATT TAAAATGCCCCTAGGNGNGGGAACACTTGAAATTGCCANCTAAATTAAAATGACC 

CTTTTMTTTGCCTGGACMCAMMANTTTCCATGATTTTGGCTTTTT^ 

GANNAAGAAAAI I I I I I I I I I AG 

Sequence 1145 

CNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAMCCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCANGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 
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Sequence 1146 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCACGGTACTGTGTCAACAT 

GCAAGGCCAGGTGGCCGTGAGGAATGTCCCAATGACAGGCTCTATCAGTCATGTCCTGGT 

GCAGCTCCC ACATGT CTTCCAGAGGATGGAAGCTGAAAACCTAGCTTCAGTGATTGATGC 

CAGGTTTMCTTTTTTGTGMCAAGATTTTGCCACAGTATCGTGATGCAGTCATGTCTCA 

CACGCTCATCTATATCCCCTCCTACTTTGACTTCGTGCGTCTTCGAAATTACTTCAAGAA 

GGAGGAATTGAATTTTACCCACATCTGCGAGTACCT 

Sequence 1147 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAMAAATTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTNTTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGGGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGTAGGTCC 

Sequence 1148 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTTCT 

GCGCAGCTTCCCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGA 

MGATGAGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGAACGCATTT 

ACTGTCACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAA 

TGCAAATTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATG 

GAGGATAAGAACATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGT 

AGCTACAGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTT 

CAGATCACAAGATGTGAAATTGCAGGATGCAGGGGGTGTACCTTGGCCCGCTCTAGAACT 

AGTG 

Sequence 1149 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGGGGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGG 

Sequence 1150 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 
TGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 
NAAAGAAAAGATGAANGACAAACTGCAAAAAATTGCCAAAATGCNACTTTCTAAAAATGG 
AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATTGAAAAAA 
Sequence 1151 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 

TGAGGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 

GAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCTAAAATGCGACTTTCTAAAAATGG 

AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCA 

ACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATC 

AAGCTGGAATGGGGAACGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 
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Sequence 1152 

CCGCGGTGGCGGCCGCAGTTACCTGACGTATGACAACCCAGATATCTTGAAGAGGGTGTC 

AGGATCAAGATCAAATGTGTTCATGAGGTTTAAMCAACTGCCMGGATGGCCTTTTGCT 

GTGGAGGGGAGACAGCCCCATGAGACCCAACAGCGACTTCATTTCCTTGGGCCTTCGGGA 

TGGAGCCCTCGTGTTCAGCTATAACCTGGGCAGTGGTGTGGCATCCATCATGGTGAATGG 

CTCCTTCAACGATGGTCGGTGGCACCGAGTTAAGGCCGTTAGGGATGGCCAGTCAGGAAA 

GATAACCGTGGATGACTACGGAGCCAGAACAGGCAAATCCCCAGGCATGATGCGGCAGCT 

TAACATCAATGGAGCTCTGTATGTGGGTGGAAT 

Sequence 1153 

GCNMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGMCTAAATTTTTTAAACTTTA 

TTTGCTGTTAAATTCTGTGMGTTTCAGTTATCTAAAATAAATATACACAAATATGAAAT 

ATMTGTTTCAGATTGCAAGGTAATATGTAATAGTAGTGTTTGTAAGATACTCTTGTCTA 

ATATTAACTAGTAGTATTTTGATTTGTACAATGTCACCCTCCCAGCAACAAGAAGAACAA 

GCTACTGAATCAGTGTCCCTTTATTACTATGGCATCAAAGATTTGGCTACTGTTTTCTTC 

TACATGCTAGTGGCGATAATTATTCATGCCGTAATTCAAGAGTATATGTTGGATAAAATT 

MCAGGCGMTGCACTTCTCCAAAACAMACACAGCAAGTTTAATGAATCTGGTCAGCTT 

AGTGCGTTCTACCTTTTTGCCTGTGTTTGGGGCACA 

Sequence 1154 

GAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGT 

Sequence 1155 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAATGTGTGAAGACAAGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAMCTGCAAAAMTTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTT 

GCATGTCTTGGCATTTCCTTCAGGAGCTGAATGAAAAAATGCANCAAGCANATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCAC 

CAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGG 

CCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGG 

Sequence 1156 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCACGATGTCTTGAGTTTCA7TC 

ACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGA 

GGCT GGCTCAGGTTTGACTCTATCTTGGAAAAAMTAGGAAMCTTCATTTATGGAATAG 

TTTTGAATAACCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACG 

GGTAGAATGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTC 

TGG AATTG GTCTGACCCAATTAACACACGGCCTCTGATGGGAATAGATGTATTTTGGGGA 

CACATTTTAATCTGATAGCTGTMCCCCTTTTGAGTTGGCTTTTGTTCACTGGAATCCCT 

TTCCAGTCAATGAATTTCCGAGAAAAATTCAGAGGAAGAGCTGTCGGAGGCACCAGAGTG 

CTGATGTTTTCT 

Sequence 1157 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 
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TCATTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCAATACCAMTCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATG 

TTTMCACCATCTTACTAACCATTATTGCTATGGGTGGGTATACACTGCCTATGTCTTTA 

TTCCMTCCACATTCGCCTGGGCTTGGGMTTTTTCTTCAAMATATGTGGATATCACAG 

GNCCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCCGGGCTTGCAGGGNAT 

Sequence 1158 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGA 

AGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATA 

CACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATC 

ATTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTG 

CAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTA 

CTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTATGT 

TTAACACCATCTTACTAACCATTATTGCT 

Sequence 1159 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGAA 

GCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATAC 

ACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCA 

TTTCGTGMGGAGCAAGGCTGAAGMCCAGACCTGGACTTTCTTAGGACAAACTTACTGC 

AGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTAC 

TATCMTACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTT 

AACACCATCTTACTAACCATTATTGCTATGGGTGGTATACACTGCCTATGTCTTTATTCC 

AATCCACATTCGCCTGGCTTGGGAATTTTTCTCA 

Sequence 1160 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCACAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCANATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTNAGCATCAAGCTGGAATGGGGAATGAAGAAtAGAN 

ATGTGGTGCCCA 

Sequence 11 61 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGAAGGTGGGTGACGTGCGGA 

TCTTCTTCTTTTTGTGGCTGTGGACACCTTTCAACACTGCCTTCTTGGCCTTTAAAGCCT 

TCGCTTTGGCTTCAGCTTTAGGAGGGGCAGGAGCCCATCGCAAAACCACGCTGCGGAGAG 

AGGGGCGGGTAATGTAGCCCGGTTGAACATGAACCAGAAGGAAMTGGTTAAAGCTGAGG 

GCACTAATTCCTTACAGGCCCGGGGACATGGAGCTCCAACCAGTGGATGCATGTAGCTTC 

CCAGAACCGAATGTCTGCCCCGCGTACCT 

Sequence 1162 

CCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGNGGATAGAGAAACTGAAGCTCTCTA 

AAGACCCTGCANCTGGGAGGTGGCAGAGTCAATGGCAGCCCTCAGCCCTATCTGCCCTGA 

CATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAATAACAATGTGTGGGGTCCT 

TGGCTGAGAGACTTCCCTTTTGGGAATCMTCTGAATGTATGATGACAAAGAAAACAACT 

TTTGCTTTATACMCCTTNTGGTTAGATTCAGGCACCAAGCAGGACACTTCTTTGTGGCG 

CTCCAAGAATCTTTCMATTCTTCATCACCAATMCAMTCTTTCTGCTTCTCTTAGAGC 

ATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGAACACTTTCCAGCGGACAGG 

GTTTAGT 

Sequence 1163 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 
CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 
AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 
TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 
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AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGAC 

TAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCT 

Sequence 1164 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNGGCACGGTNCCAAATGAAC 

GTGTGAAGACAAGGCCATNCACCACTTTATAGAGGGTGTAAAAATAAACCAGGAAATCAA 

GGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAA 

ATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAA 

TGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAG 

CATCAANCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCTGAAATTCCTCCACCAAGTT 

Sequence 1165 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTTTCTTTNTCCTGT 

CCATAAATCTTCTTCCACCACGTGGCTGTGTNCAAGACTCTCTGAACCTNCTCTGGCTCA 

GGAGGCTTNTAGATNTGTGAATTGTCTGCTCAGTNNAACTCCATTAAATTNAATNTGGCC 

MGMNTTTCTTCTTAACAGNGGTATTGATGACCATTAATCCTTCAACCTAAACCTGCTC 

ATTAA 

Sequence 1166 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTT 

Sequence 1167 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAAGCGTTTGT 

MTGTTTCCCAMTATTAGCTTTGAAATCCAAATGTCAAGCAGGTTAAAGTTCANAAAAC 

TATAMTGTAATTGTTAATAAMCMTGGATGAAAMAGTCATTGAAATTTTTTTCTACT 

TGGATTAAGAACATAAAT TAAAA GTGCAACTGCAAAAATAATATTAGTTTGATAAGTAAA 

ATAAAACAAACTAGATATATTTTGAMATAMAMCAAATGAAACAAAATAAAATTTAGG 

TAAAGAMATTCAACGTAATTTGTTGTAGCTATATTTTTTGTMTMTTACAAAAGTAGA 

AATAATAGCTCATAAAGCAAMACAAAATTTATTCTATGTTCTTTTTTTCAGTCMTTCA 

GAATTTT AGCTT CATATTTGMGCATTTTTT^ 

TTGCGGATTTTGATTTGCCATAAAMTATTATATTNTATTTMTTATATTMTCTTCGTC 

AGCTTTAATTGCTCTTCTTTAAAATTTGATCTGAAAT 

Sequence 1168 

CCGCGGTGGCGGCCGAGGTACCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTCCAGGA 
TCAGAAGAGAAGCCAACGTGACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCT 
GTTCACGAAGTGGTTGAAAAACAATTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCAT 
CCAGGTGTTCACAAAAAATCAGAGAATCTTTTTNAGGGGGTGGNCGCNTTAACCCTTTTN 
NGGTTNANTGAAAATCCCCCCCCTGN 1 1 1 1 1 1 I I GAGAGGTTNAAAAA I 1 1 I II II T N AA 
AAAAAAA ACCCCCCCCCCNGGGNATTNTNAN I I I I I I I I I TNNNNNCCCCNNCAAAAATT 
MUM I AACCCCCCCCCNNAAAAAAAAAAGGG I II I I I I I I I ll I AAAAAACCCNCNGN 
TCNCCCNNNCCAN AAAAAAAAAAANGGGGGTCCCCN I I I I I I M I I NTTNNCCCCNGGGN 

nrnrm i inttctnnnnnnggggggggggaaaaaaaaanaaa i m i i i i i i i i 

TGGNGGGGGGGGNNGCNCCCNCCCCCCNCGGGGGGGGNGGNTTAAAAAAAAANGNCCCCC 

CCNGGGGGGGAAANNANTTTTANTNNNNTCCCCCCCCCCCNGGGGGGGGGG 

Sequence 1 1 69 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 
CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 
ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 
TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 
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GAGGGGTTTGGAGTCTGGAAGCCTCATCNCTTCAGCATCAACTGGAATGG 
Sequence 1170 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGA 

GAGGGGTTTG 

Sequence 1171 

GNGGCGGCCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGG 

GTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAA 

TTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCAT 

TCCTTCAGGAGCTGAATGAAAAAATGCAACA 

Sequence 1172 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTAGGTCAG 

TTTAAGAAGAGTCAGCTCAGAGAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATC 

AGGGAACAAAATCCTCTCCTTGTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTA 

TCTTATTGCCTAMMAAATTTCTTATCATTGTTTCAAAAAAGCAAAATCATGGAAAATT 

TTTGTTGTCCAGGCAAATAAAAGGTCATTTTMTTTAGCTGCMTTTCAGTGTTCCTCAC 

TAGG TGGC ATTTAAATGTCGCCTGATGTCATTAAGCACCATCCAAAAAGTCTGCTTCATA 

ATCTATTTTCAAGACTTGGTG ATTCT GAMGTTTTGGTTTTTGT 

MAAAAAATATTCCTACTTAAATTTTAAGTCTATAATTCAATTTAAATATGTGTGTGTCT 

CATCCAGGATAGGATAGGTTGTCTTCTATTTTCCATTTTACCTATTTACTTTTT^ 

GAAAAGGAGAAAAATGAATTTCTAAAGATGGTCCCCATG 

Sequence 1173 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAG7TCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCTGCACACTGTGTTTATGACTTGTAC 
CTGCCCG 

Sequence 1174 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCNTGCACACTGTGTTTATGACTTGTA 

CCTGCCCG 

Sequence 1175 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTNNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 
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' GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGNTGCACACTGTGTTTATGACTTGTAC 
CTGCCCG 
Sequence 1 1 76 

CCGGGCAGGTACAACAAGCGTTTGTAATGTTTCCCAAANATTAGCTTTGAAATCCAAATG 

TCAAGCAATTAAAGTTCAAAAAACTATAAATGTAATTGTTAATAAAACAATGGATGAAAA 

AAGTCATTGAAATTTTTTTCTACTTGGATTAAGMCATAAATTAAAAGTGCAACTGCAAA 

AATMTATTAGTTTGATAAGTAAMTAAAACAAACTAGATATATTTTGAAAATAAAAAAC 

MATGAAACAAAATAAAATTTAGGTAAAGAAAATTCAACGTAATTTGTTGTAGCTATATT 

TTTTGT MTAATT ACAAAAGTAGAMTAATAGCTCATAAAGCAAAMCAMATTTATTCT 

ATGT TCTTTTTTTCAGTCMTTCAGAATTTTAGCTTCATATTTGAAGCA I I I 1 1 I ICTAA 

TTTTGTTTGTGAATTTGAATTTGmGCGGATTTNGATT^ 

TATTTAATTATATTAATCTTCGTCAGCTTTAATTGCTCTTTCTTTAACAATTTGATCTGA 
AATTTGTTTTGGTGTTATTTCATAGTGATCAAATTGCATTTGATMGTTCCACGACCTGA 
TGTCATAGACCTTAATTGTGTTGAGTATCCAAACATTTC 
Sequence 11 77 

TAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCTACGGAAATCCT 

AACTACCACTGGCAGGAAACTGCATATNTTCTGGTTTACATGAAGANGGAGGGCTAANGG 

AAATGCCCAAAACCTTCAGAGATTGACACCGCTGTCATTNTCCATNTCNGTTCCTGGAAT 

CTACCGGGAGTNTTNATAAGMGANTTTTGCAAATNGAGGGAAGAAGCMTTGTTTTCAA 

ACTATATMCTGGAGNCCTTMTTTATMTTAGGiGGATATTTAATCAAAAATATNGTAAA 

CCATGGAGGGCCCCCTCAGNGTNCTGGATCAGGTTCAAGAAATNGAAATGNTTTTCACCC 

AAGNCANGACCCCGGCCATGTGGGCATGNTCCGGGTNCCTGGGGGTGGCNTCGNCTGGCT 

TGTGGCGAANGAACAATTAAGCCCCTTTTAAGTTTATTGMGCCCTGNGGGGAMCTTTA 

AGGGGGTTTCCANAGTTGGGGGANGAAGCANTNGGNNAGTTGGNGAAGGGCATTTTGGGG 

GGG 

Sequence 1178 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GAAAGACMACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGAC 

ACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGMTTGCT 

Sequence 1179 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAMATGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGG 

Sequence 1180 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATTGCCAGGAAGATAGGCAGCTCATCTGTG 

TCCTGTGTCCAGTCATTGGGGCTCACCAGGGCNNCCAACTCTCCACCCTAGACGAAGCCT 

TTGAAGAATTAAGAAGCAAAGACTCAGGTGGACTGAAGGCCGCTATGATCGAATTGGTGG 

AAAGGTTGAAGTTCAAGAGCTCAGACCCTAAAGTAACTCGGGACCAAATGAAGATGTTTA 

TACAGCAGGAATTTAAGAAAGTTCAGAAAGTGATTGCTGATGAGGAGCAGAAGGCCCTTC 

ATCTAGTGGACATCCAAGAGGCAATGACCACAGCTCATGTGACTGAGATACTGGCAGACA 

TCCAATCCCACATGGATAGGTTGATGACTCAAATGGCCCAAGCCAAG 
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Sequence 1181 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 

CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAMGACAAACTGCAAAMATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGA 

GCCCAAGGAGTTCTGGGAGAGGGACCCAGATTGGGGGGGTA 

Sequence 1182 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAA1CAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 

Sequence 1183 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

GAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

ACTAAGACACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTAGGTCCCGGGCTTGG 

TGATAGAMTATTTCTCGATGACTTTCTTGAGTGCAATTTGNACTGTAACATTTGCTTAA 

TCACCTT 

Sequence 1184 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTAT 

TATCTAACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACNGCCATAATGCC 

GTGCACAGNTTGAAMGCMTTAGAGTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTTTN 

CTCAAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAA 

TAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCT 

NTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTNTACTTT 

GCAGAACCTCACCAACTTGCCCAGGTNTTCTCCCCGGTCTTGAAGAAATGGCTCTCCACC 

TGAAAAGTNNGATCTTCTCCATACCAGCTTCTTAAGCAAAAGCAATCCTCTCTTTGCTTC 

CTCAAGGGGCA 

Sequence 1185 

TAGGGCGANTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAA 

GGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAA 

GATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTC 

TGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGA 

TGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAA 

TGGGGAATGAANAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

TCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATG 

Sequence 1 1 86 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
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CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAG 

CCCAAGGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTA 

Sequence 1187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA . 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAMAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1188 

CCGCGGTGGCGGCCGAGGTACAAGATANTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTT 

GAMAGCMTTAGAGTAAGCMGATTAGTTTTTCCTCCCTTTCNAAGTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGNCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGT 

Sequence 1189 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGGAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAA 

AGAATTGGACTAAAACACTGGCCATACCCACTGGACAG • 

Sequence 1190 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGCGAAGCTAGA 

GACTGTAACCTGAAGATTGGGACMATTAAAGAAAAAAATGTGATTTAACACAATTACAA 

AAACTGTTACGTTAGGGTCAMCMGAACCATTTTATGAAACTGAATTACAACAAATGAC 

ATTATATCTMCTCTTCCGGGTCTCCACAACACTTATACTTACTTAAGCAGCTTAAACAC 

TTCCGAGTCTCCACAGCACTCTGATACTTACTTAAACAGCTCTTTTAACCTGCCCTAGTA 

TTCTTAAGTGCAGCATATCTAA I I I I I I I I I I CCTCAAGTAGTTTGAA 

Sequence 1191 

TAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAATGAACGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 
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' Sequence 1192 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCNTCCACCACTTTATAGAGGGTGNAAAAATAAACCAGAAATNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCC 

GATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGA 

Sequence 1193 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGGAAAMTGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAG 

ACACTGGCCATACCACTGGACAG 

Sequence 1194 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACATAAATCACCTGGAACCTTG 

TTAAAATGCAGATCCTGACTCAGGAGGTCTGAGTTAGAGCCCAGGATTNCATATTTCTAG 

CCAGCTCCATGATGAGCTGCTGGTCCGCAGATCATGCTTGCNGGTTTTGACCAGAGTCAG 

TGTTGGTTANAGTAAGAGGATGAGGCANACATNTGGGAAAAGTCCAGCTGGGGCAAGCAT 

TTGAAGTCTGCCTTCCTACCANGTCAAAATCAAGGCAACGACCTTCCATAGATAACTATC 

AAAGCTTGAGGGGGNGCCTTGAACCCAACTCCTAAATCCCTAAGACCTGCCCACCTCTTG 

TGTCTCCTGTNTNAGCAAACATTCCCACACTCTTGCATATTGTTAAAAGTAACCTCTGCT 

TACCAGGCTTTTG 

Sequence 1195 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGGGGTGTCCGAACAAGGCAGGTTG 
GTGGGTTAAGGTC7TAATCTTGACTCGAGATCTCTCTCCGGAGTTCACAGNGTNGGCGAC 
GAAGCCGAAGCAGCTGGAGCGCGACCCGGAGGAGTCTGACTTCTCGTTGTCTTCATAATT 
TTCATTCGTTGCTTTCTTCGTGGACTTGCGGCTGGGGGAGGATCCCCGCTGGTCGCCGAG 

CAGGCGGGCGGGTAAAGGTAGGCCGCCGAGAGCGAGGTTAGGAGAGGAGAGGAGGCCGCA 

GTACCT 

Sequence 1196 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGAGGCGGAAGCGC 

AGCGGGGGCGGGAAGGTTGTAGTGCCGCGAGTTGAGCTCCTCTTGCCTAAGTGGTCGCGC 

CCCCTTTAAGAGCAGCGATTGTAAGGAGAGGCGGTCCCGGTGTCCTCGGGTCCCAGGTGA 

TTGTGAAGTGCTGACCAATTGCCACTGGACATACTTGAAACAAAATAGGAAAATGGCAGC 

AAACCCTGTCTCTAAATCAATCAATCAAGCGAGCCAGAATGCAGTAGTGGCCTGAGAGAG 

GCATCCTGGAACGCAGTGCGGTCTGGCTAGGCTTAGMGTATTCATGTGATTTTTACCTG 
ACAAGGG 

Sequence 1197 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGG 

Sequence 1198 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGCG 

AGCTTCGGAGAAGCAGTGGTGGGTTCCATGTGGTGGTGGAGTAGGAGGCAGGTCTCCGCG 

GTGGTTTCCACAAGAAAAATGGCACAATGTTTCTCAGAAGACAATTACATAAGAATCAGC 
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ATACTTTAAATTCACAGCAAATAATCAGACAATTGATGAAAATACTTACCCAAACACTAA 

TTGTAGACTGTGCCTTCTGAATATGTTTTGTCATAAACTTGGAGTAAGGAATCCTCACAG 

GCACTGGACMTTCAAAAAACGTAMGTTTGTTTGTTAGAATACCTGGGTGCTTTTGGAT 

AGAAACCCTCATCCATATCCTGGTAAGGCTTGAAGTTGCACAGGAGTTTTCATTTGTCAA 

AACCCAGAAAACCATAAGCTTTAGATTTGGG 

Sequence 1199 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTTTGA 

GAGGGGTTTGGAGTCTGGAACCNTNATCCTTTCACNTTCAACTTGAAATGGGGAATGAA 
Sequence 1200 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAANTGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAATCAAGGGAGAANGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCNGACTTTCTAAAAATGGAGCAGANTC 

TGAGGCTTTGCATGTCTTGGCATTCTTCAAGAGCTGAATGAAAAAATGCAACANGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATATAGATGTGGTGCCCACTAGGNTNCTGCTGAAAGGGAGNTGAAATT 

CCTCCACCAAGTNGGTATTCAAAATA 

Sequence 1201 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACGCGGGGGTAAC 

TGAAAATCCACAAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGC 

AGAACGGCTGCCTAATTTACAGCAACCATGAGGCCACTTAAGGATGCAGCAAGAAGGAGC 

CATCTGCAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTA 

GGACTTCTAGCCTCGAGAACTTACAAATGGTGATGATCAT 

Sequence 1202 

TATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGMGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAANNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

NATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAA 

ATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAAG 
Sequence 1203 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATAAAATAATA 

CATATG AAGCC ACAATATT CAATT ACAGTTCAAACTAGAAATTACTAACTACATGTAGCT 

GCTTCAl I I 1 1 CGTT TTGCA TTTTG GCCCCTAATTCATATTTACAACTG ATACCTGCTGA 

GMAAAGATCCAMCTTTTAMCTTTGTATGTTTTGTGGAGGGTGCACAATTTCTTCTAA 

TATATCTTCAGGTGTTTTAAMTTTMTTTGTTTTTAATCATAAGATATCATCATGGCCA 

AGAGACTGGGAAMTMCMTTTTATTCTTTCTCCTMGATTGNGATTTTATTATTCCAA 

GATCTTATGCTTGAATTACTTAGCAAGAAGGCATGATTATGCANAAGACAGGGAAATGAA 

GAGAAAAGAGCGGGAATATACGAAAATGAAGCTTCCTTAACAGAGTTCATGGTGGAGATG 

GTAGACACTGGTGGAGTTTTTTTCCAGACTTAA 

Sequence 1204 

TCGAGCGGCCGCCCGGGCAGGTACACTCTAAAGAAAGCCATGAGGATGATAATCCACTTT 

GATACT TCCAATCT GCTGGTCTTGCTGAACTCTTTGGATCATGGATATCATAAGTTCGAC 

AAAATAI I I I 1 1 | IGTAGAAGCACAAATGTGAAGNGTCACTCGTTCTGAGACTTCCTCCT 

CTGTGAAATTCCACAATCTCTTTCTATTTATAGACTTTTCCACAGCAAACATTAGTCTAC 

GCAGAGCATTTTGAAAATCATTTGCCAGTTCTAAAGTAGTAATAATAAATACTCCAAGAA 

CTAAAAGTCCCCCTGGTAGCATTCTGGATACCTGGCAGGCATGTTCTGTGGCCCATTC 
Sequence 1205 
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NNGNCCTTTCGAGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCC 

ACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAG 

ACAAACTGTAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

Sequence 1206 

AGCGTGGTCGCGGCCGAGGTACATAAAACATTATTCCTTCCTTGGCCTAAAAACTCATCG 

CCACCTACATTAMGCTAATATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGG 

CAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGNTAGCTACATAAAAATGCACCCT 

AGACCCGAAACTTACTAGACTCATTATAAAAT1TTCTTTAAGGTGTCCACGCAGTCCCTG 

GTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAGGAAC 

TTACACI I I I I I I I MTCTTTTCCTACAACTTCATATTTTATAAATA 

Sequence 1207 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATCTAGGTCAGTTTAAGAAGAGTCAGCTCAGA 
GAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATCAGGGAACAAAATCCTCTCCTT 
GTGGAAATATCCCATGCAGTTTGTTGATACMCTTAGTATCTTATTGCCTAAAAAAAAAA 
TTTCTT AT C ATTG TTTC AAAAAAG C AAAATCATG G AAAATTTTTGTTGTC CAGG C AAAT A 

AAAGGTCATTTTAATTTAGCTGCAATTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTC 

GCCTGATGTCA TTMG C ACCATC CAAAAAGTCTGCTTCATMTCTATTTTCAAGACTTGG 
T GATTC TGAAAGTTTTGGTTTTTGTC 

AATTTTAAGTCTATAATTCAATTTAAATATGTGTGTGTCTCATCCAGGATAGGATAGGGT 

TGTCTTCTATTTTCCATTTTACCTAT 

Sequence 1208 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCAC 

CACTTTATAGAGGGTGTAAAAATAAACCAGANATCAAGGGAGAAAGAAAAGATGAAAGAC 

AMCTGCMAAMTTGCCAMATGCGACTTTCTAAAAATGGAGCANATTCTGAGGCTTTG 

CATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCT 

GAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGA 

AGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCA 

AGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

Sequence 1209 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGNCAAAATG 

CGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTNTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTNAGCATCAAGCTGGAATGGGGAANGAAGAATAGAGATGTGGTGCCCACTA 

GGCTACTGCTGAAAGGGAGCTGAAATNTCCTTCCACCCAAGTTGGTATTTCAAAATATGT 

NATTGACTGGATANGGGCAAAAAGGATTTGGACTAAGACACTGGGC 

Sequence 1210 

GCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAACGGCCATCCACCACTTTATAG 

ACGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

MAATTGCCAAAATGCGACTTTCTAAMATGGAGCATAATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCNGATGAAGACTCTGAGAGGGG 

TTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGA 

GATGTGGNGCCCACTAG 

Sequence 1211 

CGGAAANTTGGGGGGCCCCCTTNCTTAAGAAAAGGCCATTGGCTTNCCGAAGGCGGGGGC 
CCCG CCCC AAGTTGGTTGGMNTGGGGGATTATTNCTTTGGCCAAGAAMNTTTCCGGGG 
GGNGTTTTNAAGGGNGGGGGGGGGGGCCCCCCCGGGCCCCCCAAAAGGGGTNAACCCCCC 
TGGGGGGNAANAGGGGGGGAAANNTTTNNNAAACCGGNNAAAAAACCCCCCCAGGGGGGG 
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CCCCCCGGGGGGGGAAAAAANCCCCCNGGGGGGGAAACCCCCCCGG 
Sequence 1212 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAAGATGGTGGTGCCCACTAGGCTACTGCTGAAAGG 

GAGCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAA 

AGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

GGGTCTTGAGAGAGCCCAAAGGAGTTCTGGGAGAGGGACCAGATGGGGGGGTA 

Sequence 1213 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 
G 

Sequence 1214 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGTCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTCi(\AAATATGTAATGACTGGTATGGCAAAA 

GATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAAT 

Sequence 1215, 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 

AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1216 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACCTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGMTGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCCCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGT 

CTTGAGAGAGCCNAAGGAGTTCTGGGAGAGGGACCAGATGGGGG 

Sequence 1217 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 
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GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGTTGGTATT 

CAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGG 

CAGGGTTATGTTA 

Sequence 1218 

CCGCdGTGGCGGCCGAGGTACTTCTTACAGTCTTCAGGAAATTCATTAAATCAGTGCCTC 

CAGTTCCTTTGGCTTCCAGTTTTGMGGGTCTTCAGAGGTCTTATTCTCCTTTGGCTGCT 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGNGTTCTCAAA 

CCCGGNATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTCAAG 

AATGTGTCTGTCAAGGATGTTCGTCGGGGCAACNTTGCTGGTGACAGCAAAAATGACCCA 

CCAATGGAAGCAGCTGGCTTCACTGCTCAGGGTGATTATC 

Sequence 1219 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGATGCTACAGACGAGGACATCACCTCACA 

CATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCCAGGACCTGAA 

CGCGCCTTNTGATTGGGACAGCCGTGGGAAGGACAGTTATGAMCGAGTCAGCTGGATGA 

CCAGAGTGCTGAAACCCACAGCCACAAGCAGTNCAGATTATATAAGCNGGAAAGCTTATT 

GATTANMGCMTGNGCrfTTTCCGATNTGATTGATNNGTNAAGNAACTTTTTNAAANGTN 

ANCCCTGMTTNCCNNNACCCAATTAMTTTTTNCNANCCCCTTTAAAMATTTTNCTNG 

GGNTGGGGCCCCCCCNAAAANTTAGGGNANAAAAAATATNAAANCCCCNNAAAATATTTT 

NNNTTNTTTTCTAMAAAMTAAAAAACCCNCCCNTTTTTTGGGGGGGGCATTTAAAAGG 

GGGAAAAMAATTCCAATTTTTNCCCTTTTCNTTTTANCCNAAAAAAAAAAAAAT 

Sequence 1220 

CCGCGGTGGCGGCCGAGGTACATTGGCACGTCACGATGTCTTGAGTTTCATTCACTAGGT 

GGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGAGGCTGGC 

TCAGGTTTGACTCTATCTTGGAAAAAAATAGGMMCTTCATTTATGGAATAGTTTTGAA 

TAACCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACGGGTAGAA 

TGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTCTGGAATT 

GGTCTGACCCMTTMCACACGGCCTCTGATGGGAATAGATGTATTTTGGGGACACATTT 

TMTCTGATAGCTGTAACCCCTTTTGAGTTGGCTTTTGTTCACTGGAATCCCTTTCCAGT 

CA 

Sequence 1221 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAAtGAAAAAATGCAACAAGCAG 

ATGMGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCATACC 

Sequence 1222 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCMAAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCMCAGGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGG 

TTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGGACTAAGACAC 

Sequence 1223 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAA 
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TGAANGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAA 
TCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCT 
AAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCNTTCCTTCAGGAGCTGAATGAA 

AAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCT 

TCAGCATCAAGCTGGGAATGGGGG 

Sequence 1224 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATAC 

CACTGGACAGGGTTATGTTAACACCTGGAATTGCTGGGTCTTGAGAGAGCCCAANGGAGT 

TCTGGGGAGAGGGACCAGATTGGGG 

Sequence 1225 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

MMTTGCCAMATGCGACmCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAACAGAA 

GATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGGTTGGG 

TATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGGACTAAAGACACTGGCCATAC - 

CACTGGACAGGGTTTATGTTAACACCTGAANTGCTGGGGTCTTGAGAGAGCCCAANGAGT 

TTNGGAGAGGGCCAGATGGGGGGGTAG 

Sequence 1226 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGMGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTT 

AACACCTGAATTGCTGG 

Sequence 1227 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCAGGAAGTGAAGTAAAA 

CCTGGTCTTGGTTGATAGGCCCCAGGTTGGCTTGGAGCCATTCCAGGTTGAGAGGCAGGA 

GCCACAGTATAATTAGTAGGCTGAGAAGTTTGGGCAGTGTAAGTTTGTGCAGGATAATTG 

CTCGCCTGGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAGTTGCTCATC 

AGCTTCTGGAAAGCAGCTTCACCTGTCCTTTTCCCCAAGAAGCTGGGCAGCTCCCGGGTC 

AGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGMCTTGTCACCCTCTTTGCCCGAGTAC 

CTGCCCG 

Sequence 1228 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAtCAAG 

CTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGTC 
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TTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAAATGGGGGG 
Sequence 1229 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATNAACCAGAAATCAAGGGNGA 

AAGAAAAGATGAAAGACAAACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 
GGACT 

Sequence 1230 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTCTTC 

TAGTCCGGTAAACAGAGGGCCTGCCCCCGACAGCTTCTGCTTCCGGGTCACGCCTTGACA 

GCGGCTTTCAACCCCCACCTCAGCCCAGCAATTCGTTTGGAGCATGTGAACACCTTGAGC 

CTTGATGAGTTCCAGTNTGTGGTATATTATGCAGNGCNTTCAGNGAAAATNCTTTTTNTN 

CGGGNNTTNAANNAAAAANANTNGGGTGCCATGNTNTTNCCCCCNNNNNNTNGGGGGGGG 

GCCCCCCTCAAANGGGGGGGGGGNACTATNANNNNCCCTNTTTTTTGGGGNNCNANNTNN 

AC NCCNT T TNNTT NGGGCCCCNTTTTTTGGGGGNAAAAAAACCCCCCCCCTNNGGGGGGG 

GTATTTTCNTTTTNNGAAAAAAAAAGGCCCCGGGGNNGACCCCCCCCCNGGTGGGNTTAN 

ANAAAAAAAAAAAANTCNCCCNNTTN I 1 1 1 1 I I I 1 1 I A AAA 

Sequence 1231 

AGGTACGCGGGGCTTTTCCGTGCTACCTGCAGAGGGGTCCATACGGCGTTGTTCTGGATT 

CCCGTCGTAACTTAAAGGGAAATTTTCACAATGTCCGGAGCCCTTGATGTCCTGCAAATG 

AAGGAGGAGGATGTCCTTAAGTTCCTTGCAGCAGGAACCCACTTAGGTGGCACCAATCTT 

GACTTCCAGATGGAACAGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAG 

TTGCTCATCAGCTTCTGGAAAGCAGCTTCATCTGTCCTTTTCCCCAAGAAGCTGGGCAGC 

TCCCGGGTCAGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGAACTTGTCACCCTCTTTG 
CCCGAGTACCTGCCCG 

Sequence 1232 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAA 

CAAAAAGGAACCAGAGGCCACTTGTATATATAGGTCTCTTCAGCATTTATTGGTGGCAGA 

AGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAAT 

TGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCAACTA 
AAATGCCATTTCACC 

Sequence 1233 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAANTGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGNGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

AAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAG 

GCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAA 

GACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTNATCCCTTCAGCATCAAGCTGGAATGGG 

GAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNCT 

CCCCAAGNTTGGTATTCAAAATATGTAATGACTGNTATTGGCAAAAA 

Sequence 1234 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTNCCCAAGTTNGGTATTCAAATATGTAATTGCTGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCCTACCACTGGACAGGGGTTATTNTTTAACCCCCTGAATTTGCTTGGGT 
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CTTTGAGAGAGCCCCAAGGGGGTTTTGGGAGAGGGGACCCANAATTGGGGGGTAGGTC 
Sequence 1235 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTTGAAAGG 

Sequence 1236 

TGGA CTCCACCGCGGTGGCGGCCGAGGNACAATACACTAGAAACCAACATAATGTATTTT 

TTTTAAAACCTGTGNGAAAAAATAAATGTTCCACNAGTAGGGATAGGGGAAAAGNAACCA 

AAAGAGAG AAAGA GAAAGGAATGCTGGTTNATCTTTGTANGTNGTAATCGAATGGAGAAA 

TTTGCAGTATTTTANCCACTATTAGNGAAA I I I I I I I I I I I 1 1 1 GTCAAAATGANAGACT 

GGMCCTCTGTTCAANATGCTTTNATTGNMCTCTGGTTTTGAAGACCGGGNNNGGNAAA 

GCAANNAAAAACGTNGGGAAACCCTNNGATGGACNTAAAGGGGCNNTGGNNGCCAAAGGG 

ACCTT GGGG GAAAANGGTCCACTTTGMTANANAAGCATGGGGNGGGGNGNATTTTTCCC 

CCCCCTTTTAAAAMNATGGTNTGGMTAATTTTAAANNGGNATATTAAAAACCACCTTT 

NTT 

Sequence 1237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGATATCCACATAT 

TTTCGATAAAAATTCCAAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTA 

TACCACCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTC 

TAAAACAGCACAGCAGCTMTGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTT 

CACACGCCTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGT 

CCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCC 

TCTCTGGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAG 

CGCCGAGGCTTTGGC 

Sequence 1238 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACNNTGGQCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACT7TCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 

T 

Sequence 1239 

AGCTCCACCGCGGTGGCGGCCGAGGTACGCGGGGGGCAGMGAGGAAGATTTTTGAAGAG 

TGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGTGA 

GAACAATAAGAATTCCTTGGAGAGCAGCCTACGGGNAACTAAAATGCCATTTCACCTGGA 

ACTTGATGGANGGGAGAAAACTCC7TGGMTGATTTTGMGACAAMGTTATTTTTTACC 

CGGCACTGAAGATTTNCAGCAATCCGTTGGAATNTCAAAAGGCCCACCAAANGGTGCCAA 

CNCCTACATGNGCNCTTATCNTAAANAGGCACCCCTCCAANAGGGGGNCNAATAAACGNA 

GGGNCNAAGNNNCCCTGGGNAAATTGGCCTTTACNGGNTAAAAAAGGCCTTGGANNGGAG 

GTTTTAAANTNCCTACGGCCAAAGGNAAGCCATTGGCNTGNACCCCAAGGGCCAAGGAAA 

AATCCANNTAAAAAGNTTCTTGGGNGTCCACCCCNTGGGG 

Sequence 1240 

AGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 
TGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 
AGGCTTTGCATGTCTTGGCATTTCCTTTCAGGGAGCCTGAATTGAAAAAAA 
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Sequence 1241 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCNGGTACGCGGGGGAGACATTCCTCAATTGCTT 

AGACATATTCTGAGCCTACAGCAGAGGAACCTCCAGTCTCAGCACCATGAATCAAACTGC 

CATTCTGATTTGCTGCCTTATCTTTCTGACTCTAAGTGGCATTCAAGGAGTACGGGAAGG 

CGMGAAMGAATAGAGAAGATAGGGAMTTAGAAGATAAAAACATACTTTTAGAAGAAA 

AAAGATAAATTTAAACCTGAAAAGTAGGAAGCAGAAGAAAAAAGACAAGCTAGGAAACAA 

AAAAGCTTAAGGGGCAAAAATTGTACCTTCGGCCCGCTCTAGAACTAGTGGGATCCCCCG 

GGCTGCAG 

Sequence 1242 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACATGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAG 

Sequence 1243 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAT 

CAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGG 

« 

Sequence 1244 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCCTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT.GAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAA 

Sequence 1245 

TGGTACTGCTAMGTCATGACAGCCCAACAGGTGATGTTTTACTGGATGAAACTCTGAAA 

CACATCAAAGCAACTGAACCCACAGAAACTGTCCAAACATGGATAGAGCTACTCACTGGT 

GAGACCTGGAACCCCTTCAAATTACAGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGT 

TCTCAAACCCGGTATGGNGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATC 

TGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAA 

TGACAAGAATGTGTCTGTCAAGGATGTTCCGTCGTGGCAACCGTTNCTGGTGACAGCAAA 

AAATGACCCCACCAA 

Sequence 1246 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTG 

Sequence 1247 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGQATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 
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GGACTAAGACACTGGCCATACCACTGGACAGGGTTTATTGTTAACACCTGAATTGCTGGG 
GTC 

Sequence 1248 

GAGCT CCCCGCGGTGGCGGCCGNGGTACATAAAACATTATTCCTTCCTTGGGCTAAAAAC 

TTTTTGCCACCTACATTAMGCTMTATGCCTGNTTACTGTTTTTAGAGMCTTATTT^ 

TTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACATAAAAATG 

CACCCTAGACCCGAMCTTACTAGACTCATTATAAMTTTTNTTTAAGGTGTCCACGCAG 

NCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATNAGCCCTACCCCAGTAATCCCCAGA 

AGGAACTTACACI I I I I I I I MTCTTTTCCTACAACTNCATATTTTATAAATAAAAAGAC 

ANAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCAT 

Sequence 1249 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 

TCATTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGMGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTAT 

GTTTAACGCCATCTTACTAACCATTATTGCTATGGTGGTATACACTGCCTATGTNTTTAT 

TCCAATCCACATTCGCCTGGCTTGGGAATTTT 

Sequence 1250 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATNTGTAATGACTGGTATTGGCAAAA 

Sequence 1251 

CTNCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATGGAAGAATNGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

GAGCTTGAAATTCCTCCACCAAGGTTTGGTATTC 

Sequence 1252 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATCTTGAGAAGGATTT 

GAAGGACAAGTTTGTGGCCCTGACCATAGATGATATCTGCTTCTCGCTCAACGACAACTC 

ACCAAACATCAGATATTCTGAGAACGCCGTGAGGATTGAGCCAAACTCCGTGAGTCTGGA 

AGACTGGTTGGACTTCTCCAGCACCAATGTGGAGAAGGCTGACAAGCAGCGGAACAACTC 

CCTGATGCTGAAAGCCCTGGTGGATCGAATCCTGTCCCAGACAGCCAATGGATCTGTGCA 

AGCCAGTGTGATTGTGGTGGACACCGGCATTCAAGAATGGGCCTGAAGGGATCAAAGGGA 

TGCCAGGGACAAGCTGGGCTTGATCATCTGGCCCAAGGTATTNGGAAAGAGATTGCTTCC 

CAGGGAAGAAAA 

Sequence 1253 

ACTNAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTNGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGMTGGGGAATGAAGAANGGNAGATTGGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 
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CTGAAATTCCTCCCCCAAGGTTGGTATTCAAAAATATTGTAATGAACTGGGTATTGGCAA 
Sequence 1254 

CCGCGGTGGCGGCCGNGGTACAATGATTGTCATCTCAGTAAAAGGTCTATTATCTAACTT 

GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 

TGAAAAGCAATTAGAGTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGTCTTGA 

Sequence 1255 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGG 

GGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGCTCTGGGCAGCAGCAAGAAT 

TCCTCTGCCTCCCATCCTACCATTCACTGTCTTGCCGGCAGCCAGCTGAGAGCAATGGGA 

AATGGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCANAAGCTGGGTTGGTTTATCCAGGAA 

TACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAGATGGTGAACACCATCTGG 

GACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGNGCANGGAGTGGCCATAGGNGGCTCC 
TATGGACGGAAAAC 

Sequence 1256 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAANGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

Sequence 1257 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA/X3AAA 

AGATGAAAGACAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

GATGAAGACTCTGAGAGGGGTTTGGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 
AAA 

Sequence 1258 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGMTGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAAATATGTAATGACTTGGTATG 

GCAAAAAGATTGGGACTAAAG 

Sequence 1259 

TACTTAGG GCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGTTT 

TGGCACACTTTTCCTGACAAACAGCCAGTGTTCTCAATACATAAATACTAGTCCACGTTA 

ACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTG 

GAAATACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAGGA 

GATGGAGAGCAATTCTTTAAAGATAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATACAA 

GCCTCTTAAAATGGGACGCATTTGCCTCGCGCCTCTGGGGTGTCTGCAGCTCAGCNTTGG 
TGCCCCACACGG G ACACCCG ACTTTT 
Sequence 1260 
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TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCAATGTCATT 

AACCATAAA TGAG CATTT ACAAT CTGGATTAAATGTCACATGGTATTAAGTCTACACTTA 

GAGTAATGCTTTTA CTGATTTTTAAAAAT ATATGCATATGTTTAGTGATCGAGAAAAGTG 

AAATACTGGAGTAC 1 1 1 I I I I 1 1 I I I I 1 1 1 1 1 1 1 I GGCTTGATGAGTAGGTGAGTTTATT 

GGGACTTACACACAGGTCAATCCTGGGCGGCGACAAGACAGCTCTAGAGATCTGAGCTTC 

CTCCCAATGCTAAACTGCTTTCATGCTAATTTTCTGACTGTTTACTTACCCGGGGTAAGA 

GCGATGGGGACTGTTTTCATTGG 

Sequence 1261 

TNCTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACTAGGCTACTGCTGAAA 

GGGAGCTGAAATTCCTCCACCAAGGTTGGGTATTCAAAATATGTAATGACTGGTATGGCA 

AAAGATTGGG 

Sequence 1262 

TGGCGGCCGCCGGCAGGTACGCGGNGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCC 

TGAACCGAGCCCTGCGGAACAGCTGAGAATTGCACTGCAACCATGAGNGAGAACAATAAG 

AATTCCTTGGAGAGCAG CTACG GCAACTAAAATGCTTTCACCTGGAACTTGATGGGAGGG 

AGAMACTCCTTGGATGATTTTGAAGACAAAGTATTTTACCGGACTGAGTTTCAGAATCG 

TGAATTCAAAGCCACAATGTGCAACCTACTGCCTATCTAAAGCACCTCAAAGGGCAAAAC 

GAGGCAGCCCTGGAATGCTTACGTAAAGCTGAAGAGTTAATCCAGCANGAGCATGCTGAC 

AAGGCAGAAATCAAAAGTCTGGTCACCTGGGGAAA 

Sequence 1263 

CGAGGTACCCAGGCCTGCAAATCTCCTGGCAGGATGGTCAGGAACTGCTCTAGCACCAGC 

AGTTCCAGAATCTGTTCCTTGGTATGGATCTCTGGCTTCAGCCACTGACAGCAGAGCTCC 

CGGAGGCGGCTCAGTGCCTCGCGAGGTCCAGGAGAATCCTGGTAGCAGAACTGCCTAAAA 

AGCTGCCTGCAGAGCTCCTGCTTAAGGAGTTCACTTCTCTGTAAGCAAGTGTCCTGCCCA 

TGGATAAATTCTTCCTCTTCTATCTTCACTATCAGAAGTCCTTCATCCTCTGGAGAGTTC 

TGGGCTGCAGCTTTCTTTGGTTCTGTAGCCATCTCTCGGACAGGGCTGATTCCGATCGGA 

CACTTCCGGTGGAAGGACTGAGCGGCGCTACACTTCAAGAATTCCGTCCACAGGGACTTG 

TGAGTCTGCGCAGAA GGCG GGATGCCTTTGGACTACGATTCCAAGAATCCTTCCTGGGTC 

TCTTCGGGCGCAGACTTTTCGCCAAAGTCCTGAAGATCTCAGGGCTTGAAGGAGGGGGCA 

TCCTTCTTCTTCATTGNAGTAGTGTGTCTTGCTAAATAACAGAAGGGACTCCTGAAAAGA 

AAATGACGTTGGCCCGGGCGCGGGGGCTTACGCCTGCAAGTCCACACTTTGGGAGGCCGA 

ACNGGCNGATCACGAAGGTCAGGAGATCGANGACTATCTGGGTTACGCGGNGACACCCTG 

NGTTTCTTAAAATCCANAAAAAAAAAAAAAAAA 

Sequence 1264 

GGCGAATTGGACTCCACCGCGGTGGCGGCCCGAGGTACAGAGATTTATAATGTGCTGCTC 

TAGGTCCTATCGGGTA AAGG GATCAGCAGATGTGAAGTCAAGAGTCTCCTGTAAGATTTG 

A CTTTCT TGGAMCATATTTTMTCCTGGGCCTCCTNTTTCMATCACCTATTTCTTTTA 

Gi i 1 1 1 iGCAGTGATACTGTGTGTTGCTTCTAACAGAGGTTCAGTTTCACAGCCTTTCCC 

TCAAGTGTCTTATCCTAAAAGTAAAACCTAGATGATCTAAGGTGGTGGNTTTCAACAGGG 

TGCAAATTTGCCTCCTATACTCGCAACACCCAGNGACAGTTGGCTATGTCTNGGAGACAT 

TTTTGNGTTNTCACACCTGGANNNAGGGTGGGGGAGGTGGNGCTAATGACANCAAGNTGG 

CCNTAANCCAATNATGCTGATANAAATNCTACANTGCACAAGGATAGGNTCCCACANAAC 

ANAAGNCTTANCCAAACCCCAAATACTAACAAT 

Sequence 1265 

CCGCGGTGGCGGCCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTT 
GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 
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TGAAAAGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGG 

CTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACC 

AACTTGCCCAGGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATC 

TTCTCCATACCAGCTTCCTTAAGCAAAAGCAATCCTCTCTTTGCTTCCTCAAGGGGCAGC 

ACMAGGATGTTTTGGCTGTGTGGAAACAGAAGCCCGCATTTGTAGTTGCACTGGCGAGT 

GAAGTGATAGTTGACGCTGGTTGGGGTGGT 

Sequence 1266 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACTTGCTAACTTTGACGCCAGCATCTCTGAAAG 

ATCCCCATCGMGGCCGGTCATTGCAMTACAGGCTGTTCCTTTTCACCCTTGATCTGCA 

AGACATCAAGTGGAACTGTCTCTCCTTTCACAATGGCAAGTGTGGCATCAGTAATATGTT 

GGACTTTGTTTCCACTTTCGGCAAAGAGGGTATGACTCAAACTACTGGTCTCTCCCAGTG 

GGATAAATCCAATGGGAATCTTACTGAAGGTAGCCTCATCTGTTCGTCGAAGAACACCAA 

GTAACAACCTNCTGCAGTGTCCCATCTCCTTCCTGNAACAATGGATCACATTCCGGNGTT 

TTCCATCAGTTNCMGGAGGTTTNTTNGGCTTGGGCCCTTAANAAATCTGNGCTTAAACA 

AAAAGGCCCNATTCCTTCCCAAATAAAAANGGAAAAANTCGGGGGCCNCAA I I I I I 1 1 I 

Sequence 1267 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGGACAGG 

Sequence 1268 

GGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAA 

ACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGC 

GACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 
CACCTGAATT 

Sequence 1269 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTACAG 

AGATCCTGAAGAGATTGAAAAAGAAGAGCAGGCTGCTGCTGAGAAGGCAGTGACCAAGGA 

GGAATTTCAGGGTGAATGGACTGCTCCCGCTCCTGAGTTCACTGCTACTCAGCCTGAGGT 

TGCAGACTGGTCTGAAGGTGTACTCTTGGTTTATCAATGGGACGTTCCAGCAATCCACAC 

AAGAGCTCTTTATCCCCAACATCACTGTGAATAATAGCGGATCCTATATGTGCCAAGCCC 

ATAACTCAGCCACTGGCCTCAATAGGACCACAGTCACGATGATCACAGTCTCTGGAAGTG 

CTCCTGTCCTCTCAAGCTGTGGCCACCGTCGGCATCACGATTG 

Sequence 1270 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT7TGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 
GACTGGTATGGCAAAAGA 



# 
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Sequence 1271 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTCAT 

TTTTA TTTTGA AMCCATTCAGCACATTTATCTTATGTAACATGCAGAGCATATATCTAT 

CTGTATTTTTAAAATTTTCCTGTTACTCATTGATACATAGTACTTCCTTGATGTTGT^ 

AGTCCGTGAGAAACATGGCGACTCGATCAATGCCCATGCCCCAGCCAGCTGTGGGGGGCA 

GCCCATATTCCAGGGCAGTACTCAMGGTGATATTTGCTTTTTTCAATGCTTCAGGGGAA 

AAATCCTTTTCTTTA 

Sequence 1272 

TAAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAANGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATT 

Sequence 1273 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTGGTGACGTGGTTCCCAAAGATGTC 

AATGCTGCCATTGCCACCATCAAAACCAAGCGCAGCATCCAGTTTGTGGATTGGTGCCCC 

ACTGGCTTCAAGGTTGGCATCAACTACCAGCCTCCCACTGTGGTGCCTGGTGGAGACCTG 

GCCAAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAA 

AATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAA 

AATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

Sequence 1274 

ATCCCCGGCGNGGCNNNNCGCCCGTTCAGGTACTCTTTGTTTTGGCACACTTTTCCTGAC 

AAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGTTAACAACAATAGCATATGAG 

ACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTGGAAATACCTTGGAGGGTT 

TCACAAAAATAAGACAMGGGCAAAGGAACTTTGCCAAAGGAGATGGAGAGCAATTCTTT 

AAAGTTAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATACAAGCCTCTTAAAATGGGACG 

CATTTGCCTCGCGCCTACTGGGTGTCTGCAGCTCAGCTTGGTGCCCCACACAGGACACCG 

ACTTTAAGTGGCTGCCTTTGCAAGGCTGAGAGGCCATGGAGGGGTTGATGCCTGAAGTGT 

CAGCGCCATCTAGTGGAAACATGGGGCATGGCCC 

Sequence 1275 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

MAGACAMCTGCAAAAAATTNCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1276 

AGGTACMAATTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGAATTTTCAAAC^ 

TGTAATATGCTGTTTCACMTCTTTTTATTAAATTACTTAATGATTCCAGTCTGCAAAAT 

GAGCCATAAGACTTTGCTNCTGTTTGNTATANGATNCATNTGGANATTGGGGGNGGGNAA 

ACCATANG TAAGGTT AAACCTATCCGTCACCTGCTTCATGTAAAAGACTCCACCATTTGN 

TTGGATNTATTTTTTTCTCCAGGCNACTAGTAAGMAAAAGGTGMCAAAGGTGGATTTN 

CAT CCCT NNCMANTGGGCCCTTNTGGCNCMTTCTTTTTTCANTAATCCTATGGTANAC 

CNNTTTTGGTAGATTACCTTGGTGGTNGAATTTTNAGCNGTrsrTTGGNGGC 

AAAAATTCTTTGGGGATTTAAAATTTAAANCAAAAAAACCAAACCAAATAAAANTTCTAA 

TCCACCCNTGNGGAAAATTATTTTGGAAAAGGAAAAATTTTCAGGTTAAAAAACAAAGGG 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

210/467 



ANTGGNTGGTTCCTTCCATTAGGTTTAAAGGGGAGGAGGNANNATTAAAAAAAATTAAAA 

AAATTGGTTCATTTTAAAACAAGGTTTNGAAATTTNAAGGGAA 

Sequence 1277 

AGGTACAAAATTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTC 
TGTAATATGCTGTTTCACMTCTTTTTATTAAATT^ 

GAGCCATAGACTTTTGCTCTGCTTGTTATAAGATCANTTGGAATTGGGNGGGGGGGANAA 

CCANTAGTTAAGNCTAAATCTAATCCGGTCACTTGGCTTCATTGTAAAAGAACCCCACAA 

TTGGTCTGAATTMTTTTTTGCCCAGGCACCANGAMGNMNATNGNTGTACCAMGNTN 

GAANTACATTCCTTGGCAAANTGGGGCCCTCTTTGGCCCAAATCC I MM I CCAATTATC 

CTTATTGGTTAAAACCCTTTTTTTGGTTAAGTNTNACCTTGGGTGGTTGGM 

GCCGGNCTTGGNNGGCCTTAATTTTGGTAAAAAATTTCT^ 

MCC CMNMNAACCANACCAAANAAATANTTCNTNATTNCACCCCTTGGGGNAAATTNA 
TT1TGGGAMAAGGAAAANATTTTCNAAGTTTAAAAAACCCAAAGGMNTGGGTNGTTCC 
TCCAATTANGGTNTTAAAAGGG 
Sequence 1278 

AGGTACATTTACACAATATTAACACTAAAMTCTGTGTTTTTTTAAAACACCATAGAAGT 

CAMCCACMAAACCCAGGATCTTGTTTTAAATGTGTTTATGAAGACTGCTGCTGAGCTC 

AAMGCATTGCAGGTAATCATGACCACCTAGATGAAAGCTGGATGTTTGAAAACTCCTTC 

ATGTCCAATGMT GTAAATTTTTTC ACCTCATCCCCMGGTATTCTCCCATACTTTGTTC 

TACTTTTGACCTTC I M M M I M I GGGNCACCTCTTTTCATGGCATAAGGGCCTNGACT 

TGAGGGGGTACAGGTTCTTTTTNGTGGTWTAAAAGGAATT^ 

CTCCTTGGTTTCCTTTAATTTCCCTTTAAATTTTCCTTCAATMTTGGTAW 

TTTTTCNTTTTTAAGNGGACC 

Sequence 1279 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAACGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAG 

AAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAG 

ATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAG 

CAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 

AAATTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1280 

GTGGCGGCCGAGGTACCAANTGAAGTGTGAATGACAATGGCCATCCANTANTTTATAGAG 

GGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAA 

ATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTTGCATGTCTTGGC 

ATTCCTTCAGGAGCTGA 

Sequence 1281 

CCGCGGTGGCGGCCGCTCGGGCAGGTACCATTCCTCTACATCCATTTGGTAGCAGAACCT 

CMGTGTAAGCAGTCAGTGTAGCATGMTATGMCTGGCTCAGTTTATCACTTCCTGTTT 

NGACCTGAAGCACCACCCAGCTATGCAGAAGTGGTAACAGAGGAACAAAGGCGGAACAAT 

CTTGCACCAGTGAGTGCTTGTGATGACTTTGAGAGAGCCCTTCAAGGACCACTGTTTGCA 

TATATCCAGGAGTTTCGATTCTTGCCTCCACCTCTTTATTCAGAGATTGATCCAAATCCT 

GATCAGTCAGCAGATGATAGACCATCCTGCCCCTTTTGTTGAAGGAACACTTGGTTGA 

Sequence 1282 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCCTTCCTTGGCC 

TAAMACTCATCGCCACCTACATTMAGCTMTATGCCTGATTACTGTTTTTAGAGAACT 

TATTTTATTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACAT 

AAMATGCACCCTAGACCCGAMCTTACTAGACTCATTATAAAATTTTCTTTAAGGTGTC 

CACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAAT 

CCCCAGAAGGAACTTACAC I M M M I AATCTTTTCCTACAACTTCATATTTTATAAATA 

AAAAGACAAAAATGTCAGGCCTGTG 
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Sequence 1283 

GGTGGCGGCGCCCGNCAGGTACCAAATGAAGTGTGAAGACCNGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCATGGGAGAAAGAAAAGATNAAAGACAAACTGCA 

AAAAATTGCCMAATGCGACTTTCTAAAAATGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACANGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCANCATCAANCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGNAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 

GACAGGNTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCNAGGAGTTCTGGGA 

NAGGNACCACATTGGGG 

Sequence 1284 

CGCGGTGGCGGCCGCCCGGGCAGGTACCCCGGGAGAGCCCGCTTCCCCCTCCTCCCTGTG 

CTGTCTGCACCCGAGGAGAGCGGCCTGCCCGGAAGTGGGCCACCATATCTGGAAACTACA 

GTCTAT GCTT TGAAGCGCAAAAGGGAATAAACATTAAAGACTCCCCCGGGGACCTGGAGG 

ATGGACTTTTCCATGGTGGCCGGAGCAGCAGCTTACAATGAAAAATCAGAGACTGGTGCT 

CTTGGAGAAAACTATAGTTGGCAMTTCCCATTMCCACAATGACTTCAAMTTTTAAAA 

AATAATGAGCGTCAGCTGTGTGAAGTCCTCCAGAATAAGTTTGGCTGTATCTCTACCCTG 

GTCTCTCCAGTTCAGGAAGGCAACAAGCAAATCTCTGCAAGTGTTCAAAAAAATGCTGAC 

TCCTAGGATAGAGTTATCAAGTCTGGAAAGATGACCTCACCACACATGCTGTTGATGCTG 

TGGTGAATGCACCAATGAAGATCTTCTTGCATGGGGGAGGCCTGGCCCTGGCCCTGG 
Sequence 1285 

CGCCGGGCAGGTACCAAATGAAGTGTGAAGACANGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAl GAAAGACAAACTGCAAAAAATTG 

CCAMATGCGACTTTCTAAAMTGNGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGG 
Sequence 1286 

TCGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGATATA 

CTGGAGTCACAGAGCAATATGCATTAACAGGATACAACAGTTCATAAAAACTGAGTAACT 

ATGCACACAAATTTCTTAAACAGCCACCTAAAGAGAAAATGCACAGATGTATGGTGGAAA 

CTGTATCTAACACTGAAACTACTACAGGACTCCATCAATGAGTCCAACTTTTAGTGATAA 

AAAACTACTGTACACTACATGAAGAACCATATGTTTATAATTATCCAAATAAAAATGAAG 

TTATTAAACTTC AAGAT MTATGGTAATTTGCATTGMCCGATGATTTTACAAAATTCTG 

CMAGGTCAMATTTTAAA AGATG GCTGAACAGTAATTGCAGCATCTAATAAAAACGCAG 

CTCATTACCGAGCMACGGTTTTAATTAAAAATTCAAAAGGAATAATCCTGACAGGAGAA 

ATAMAAAATAGATGTCAMAGAAGATAAMTTATTTTCAAAGGAGTAGTAACTCAAGTT 
TTAACACC 

Sequence 1287 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCMAAAATTGCCAAAATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATA 

CCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTT 

CTGGGAGAGGGACCAGATTGGGGGGTAGGT 

Sequence 1288 

CCGGGCAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTG 
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TTCACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTTGAAAAGCAA 

TTAGAGTMGCMGATAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGC 

CCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGAATG 

CCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGGCTTCCATTGC 

TCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACCAACTTGTCCA 

GGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATCTTTCTCCATA 

CCAGCTTCCTTAAAGCAAAAGCAATCCTCTCTTTG CTTNCTC AAGG GGCAAGCAC AAAG G 

GATGTTTTTGGCTGTGTGGAMCAGAAAGCCCGCATTTGTAGTTTGCACTGGCCAGTGAA 

GTGATAGTTGACCCTGGTTGGGGTGGGGG 

Sequence 1289 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CAAAGTATTATGACTTTAAAAACCCCATTATTGAAAAGTACACAGGAGGCAAAGTGTTTC 

ACATCATAGACTTCACTTCCMCTCCTTGGMTGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGCTTAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCMGCTGGTTTTCTAGGCCTGT 

TAGCTGGAAGCATGGTGAGCACCATTTCTGGACGCTCAGGCCGTGTCGGGCTTNAAGTCA 

TCTTNCACCACACAGGTACCTTNGGGCCGGNTCTAGNAACTAGTGGGATCCCCCCGGGCT 

GGCAGGAAATTCGAATATCAAAGCTTTATGGATACCCGTCCGACCTTCGANGGGG 

Sequence 1290 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGTTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 

CACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTG 

Sequence 1291 

AGGTCATAAMCATTATTCCTTCCTTGGCCTAAAAACTCATCGCCACCTACATTAAAGCT 

MTATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGGCAGTTCCMGCTCAA^ 

ATCGCTAACT G GCA CCTTGTNGTAC ATAAAAATG CACCCTAG ACCCG AMCTTACTAG AC 

TCATTATAAMTTTTCTTTAAGGTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCG 

GAGAAATATCAGCC CTAC CCCAGTAATCCCCAGAAGGAACTTACACI MUM I AATCTT 

TTCTACAACTTCATATTTTATAAATAAAAAGACAAAAATGTCGGGCCTGTGAGCTGAAGC 

TTAGCCATTGTAACCCCTGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAG 

AAGTCTGGAGCAGNCGAAAAACCACAAAGAAGTGAAACAGCCAGGTTTCTGNCTTAACTA 

ATTAACCCAC 

Sequence 1292 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCC 

TTCCTTGGCCTAAMACTCATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTT 

TTAGAGAACTTATTrTATTAGGGCAGTTCCAAGCTCAAAAATACGCTACTGGCACCTTGT 

TAGCTACATMAMTGCACCCTAGACCCGAAACTTACTAGACTCATTATAAMTTTTNTT 

TAAGCTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTAC 

CCCAGTAATCCCCAGAAGGAACTTACAC I I M M I I AATCTTTTCCTACAACTTCATATT 

TTATAAATAAAAAGACAAAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCC 

TGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAGAAGTNTGGAGCAGCCGA 

MMCCACAAAGMGTGAAACAGCCAGTTTCTGCCTTAACTAATTAACCCACCTTACGAC 

ATTCCACCATTATGACTTTGTCCACCATTATGACTTGTTCCTGGCCTGCCCCAACTG 

Sequence 1293 
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CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

AAMTGCGACTTTCTAAAAATGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCC 

ACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGT 

AATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCAGTGGACAGGGTTAT 

GTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGA 

TTGGGGGGTAG 

Sequence 1294 

CGAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTG 

CTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTG 

CCCAACTGGGTTTCCCAAGCTATGTAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGG 

GCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGAGTGC 

ATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCA 

GTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTC 

CTTGCTCTCGCAAGTGGCCCTGGCA 

Sequence 1295 

CGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCTTGATCCCCAGAAC 

TGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCTCAACAACCTCAGC 

CAGAGTGTCMTTrCTGTGAGAGMCAMGATTTGGGGCACTTTCAAAATTAATGAAAGG 

TTTACAAATGACCTTTTGMTTCATCTTCTGCTATATACTCCAAATATGCAAATGGAATT 

GAAATTCMCTTAAAAMGCATATGAMGAATTCMGGTTTTGAGTCGGTTCAGGTCACC 

CAATTTCGAAATGGAAGCATCGTTGCTGGGTATGAAGTTGTTGGCTCCAGCAGTGCATCT 

GAACTGCTGTCAGCCATTGAACATGTTGCCGAGAAGGCTAAGACAGCCCTTCACAAGCTG 

TTTCCATTAGAAAGACGGCTCTTTCAGAGTGTTCGGAAAAGCCCAGTGTAATGACATTGT 

CTTTGGATTTGGGT 

Sequence 1296 

CGAGGTACAGGAGCAACCTTCTTTCCACCATTACTGGGAATTCCACCACTATTTGCTCCC 

CCAGCCCAGAATCATGATTCTTCTTCATTCCATTCAAGGACTTCGGGAAAAAGTAATCGA 

AATGGTCCCGAAAAAGGTGTAAATGGGTCAATAAATGGAAGTAATACATCATCTGTAATT 

GGTAT CAACAC ATCTGT AC I I I I I I I I I I I I I I I I I I I I I ATCTAAAAGCAACATAATTA 

TTTTTCTTGCGATTTTTTCAAGAACTCTTTTTAATTC 

AGCTTCTGATT 

Sequence 1297 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCMAATGCGACTTTCTMAAATGANAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTMTGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCA 

Sequence 1298 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

AACTGCAAAMATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGC 

ATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTG 

AGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAA 

GAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAA 

GTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCA 

TACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAG 
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TTCTGGGGAGAGGGACCAGATT 
Sequence 1299 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCMAATGCGACTTTTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCC 

TTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGT 

CTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAAT 

ATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1300 

CGNCCGCCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTG 

CCAAAATGCGACTTTCTAAMATGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCGTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATANAAGATGTGNT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGG 

Sequence 1301 

CNAATTGGAGCTCCCCGCG GNGG CGGCCGAGGTACAGTATGGCTTAAAAGGCTCTGCCTT 

AGATTCTAGAATCCAGAACATTTTCTCCAAAGACAATCAGGGTATGGGGGAGAAGTTAGT 

TCCAG AGAAGA GAGCGAGTCCAGGGTAGAAGGGATTCTTCTCTCCTGAGGGTCTATGGTC 

TCCCAI 1 1 1 I IAAAGCAGCAGNGGTATCTATCCCACTCATGGCCTAGAGGTTGCACAGAG 

CTGTCTGGCACCCGCTTCTTTGGCTTTTCTCTCCTGACACCCAGCAATGCTTACTCAGAG 

CGTTGAAGGCGGCCAGCACCTCGAAAGAGATTCTCTGATTTTTTGTGAACACCTGGATGG 

TGMCCCATCMGGGACTTCTGGATCTCGAAATTGTTTTTCAACCCTTCGTGAACAGACA 

GAACTTCAGCTTATCCC 

Sequence 1302 

CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

MMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTT 

CAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCT 

GGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGC 

CCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATAT 

GTAATGACT 

Sequence 1303 

CCGGGCAGGTACTACCATGCCGGGCCAA I I I I I I I I I I GTTTGTAGAAATGAGGTCTTGC 

TATGTTGCCCAGGCTGGTCTCAAACTCCTGGTCTCAAGCGATCCCCCCGCCTCAGCCTCC 

CGMG TGCTGGGATAAMGGCGTGMCCACCATACCCAGCCAGTATTATCTTTTCATTTC 

ATTTTCCAGT TGAGTATAT TATTGGCTACATTTGCATACCGCACAATTGTTCATTTT^ 

AAAACCAATATTTTGTTTTGTTCTGTTGTCTACAATAAGGAGAATTCAGATGATAAACTT 

ACAACCAATCATGGCCAAGTCCACTTGAGGAATTGTCTCTGTAGATTTATCTGTAGACTC 
CCTAATA 

Sequence 1304 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGAGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGA 

GCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAG 

CCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 
GACTGGTATG 

Sequence 1305 

AGGTATTCGACCCACGCGCCCGTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAAC 
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CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAGTnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnGCTGTTTCCTGNGTAGAANATTGTTATCnnnnnnnnnnnnnnnnn 

nnnnnnnnnNAGCCCGGGGAGNCATAAAAGTGTAAAAGCCTGGGGGTGCCTTAATGAAGT 

GGAGCTAACCTCCACATTTMTTTGCGTTTGCnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnCTGTCGGTGGCCAGCTGCATTAATGAAATCGGGCCAAACGCCGCGGGGGAAAGAAGG 

CCGGTTTGCGTTATTNGGGCCGCTCTTCCCGCTTTCCTCGCTCACTGGACTCCGCTGNCG 

CTCGGGTCNTTACCNGCTTGCGGGCCGAAGCCGGGTAATTCAAGCTTCAAC 

Sequence 1306 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTT 

CCAGGAAATCCCACCCTGGGATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCAC 

CTAATGAGTGCTCTAGAGATCAGCACCAAAGATGCTTGCCAGAGTCTTCTCTATATGTCC 

TCCCTCCTGTATCAAATAAGTCACCAAGTATAGCTGACTATCTCCATAACATCTCTCCAG 

TTGTGGCCTTAGTTCCAGCCTTCATCACTCTGCATTCTGAGGAGCTGATATCTGTCTCCT 

GCTGCCAGCCCCTCACCCCCAGTCGGTCTCCAACATACI I I I I I I I I I M 1 1 I ICGGACN 

CGTGGGTCGAAAGCCTTGACCCTGCC 

Sequence 1307 

AC I I I I I I I I I I I I I I I I AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCA 

CCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCCCCTATTATTTTCCTAC 

AAGTTGCTTTTTACTTGAGTAGGAACCCTTGATTGATTTTTGCGGACGCGTGGGTCGAAG 

CTTGACCT 

Sequence 1308 

ACTTAGGGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGGCCGCAC I I I I 1 1 I I I I 

I I I II I I I I I 1 1 I I I I GCCTTC AAC AAAAAAGGAATTTATTGGCTCACATAAGNG AAAGC 

TGANAAATAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAG 

ACTCCCTNTCACCCTGGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGTCAAGCTTC 

GACCCACGCGTCCGTCTTANCCTACAAGGNGGCTNTTATGTACATAGNTGGGTAATACAT 

CCAATTAATGATGT NTGA CATGCTATTTTTGTAGGGAGAAMTATGTGCTMTGATATTT 

TGAGTTAAMNATCTTTTGGGGAGGATTTGCTGAMAGTTGCACTTTTGTTACAATGCTT 

ATGCTTGGTACAAGCTTATGCTGTTTTAMTTATTT^ 

AGAAACCANCTGGTTTAGAMAGTTTNNTATGTGACGATAAAACTAAGAAATTTCCCTTN 

TTATTCCTAGTATTTTTCAGCACTCCATNAATTCTATTACCCTAA 

Sequence 1309 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAAGACCCAAGAGAGCTGATGGTGCAGTTCCAGTCTCACAGAAGTCTGCTGGAGAAT 

TTCCTCTGGATTGGAGAAGGTGGTCTTTTCATTCTACTCAGGTCTTCTATTGATTGGATA 

MGGTGCACATACATCATGGNGGGCMTNTTTTTTACTCANAGGACATCTGTGCCAATGT 

TATTCCCATNCAGMACAGCCTCCAAGTTGACACATNAAATTTAACCAATACACTTTTCC 

TTATCTTNACCCAAAANTTAAGGTTAAAGAAGTTTTNGGATNNCCCTGAATAANNNATTG 

GCCA TTNGGAATGCCCACTTTrrNCCTTTTTTGGGNGAAAAMTTNTAAAATTNACCNNT 

TTTTNTNMMNTAMAMAACTTGTTGGGCTTANGGNCCACCTNGGTTTNCANCCTTAT 

TAATTCCCAGCCCNTTTTTNGGANGGNCCCNANNGNCAGGNNNGGATTCCCTTTGNGCCC 

CCCGG GAANTTTCMNNNNTTTNNGCCCNGGGGNTAANAAANAAGGNGNAAATCCTNTTT 

TTTTTCANAAAAAAAAAAAAA 

Sequence 1310 

GGGGGGGCATTTCACGCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGG 
GCAGGTCGCAC I I I I I I I I I I I I || I 1 1 I I I ACCTAGCCCATGTTACCAATCTAAATGAA 
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CTGTCTATGCATAGAMACTGCTTTATGCCTAAGATAATTACTGGGATTTAAGAAAGTGA 
GAAAAAAGAATAGGTGGGATTGAGAAATTAGGTAAAAACAGAAGAGGCCAACTAAACCCA 
AGTGCTGCCCTTCAAGGGCTCTAGTAACCGGACGCGTGGGTCGAAGCTTGACCT 
Sequence 1311 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTT 

ACAGMGAGAMACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAG 

CCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATGCTATAAAAGAGGAGCATTAG 

AAATATTTGAMGATAGAAATGAGMCTAGTCAATATTTATTTTGCTTAGCACTGTATTC 

AGTATTATGGCATCTTAAAGTAGTTMGACTCAATATTTCATCAAAAAAGTTTAAAATCT 

AATCAGAGAAT 

Sequence 1312 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGGCCACATTTTCMTTTAGCCATTTTTCTCTTATTCACCTTTTTCTGCTM 

TGTAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCTGAGAATT 

TTATTAGCTGCTMCGCTTGTCTCATCATMGCACTCATATGTTCATTGAGTAAATATTT 

ATTG AGTA TTTGCTATGGTCCAGGCACTGTGCTAAGTATTGAGGATAAAATGGTGATTGA 

MCATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAAAGTATGTTAATATCAAAAAA 

AAAAAAAAAAAAAAAAAAAAGGN 

Sequence 1313 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAAGCCTCACTCACAATTATT 

TTTGAGATAGCTCCCAATGAGTTTAATCACTGCTATGCCAGGTGTGTGAGGCTGCTGTGG 

GACAACAATCTTG ATTCC TAGAAGAGTCATAAATTTCTAGGGACTACAGGCTCCTGCCAC 

CATGCCTGGCTMTTTTTGTAGAGATGGGGTTTCACCATGTTGCCCAAGCTGGTCTTGAA 

CTTCTGAGCTCAAGCGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTG 

GGCCATCACGCTTGCCCTAGAGTAATATTCTCTATTATCAAGGTAGAAAGTTCAACATAT 

ATTC ATTAGATCTACTTTATAGATACTGTTACTCAGATCACTTATATCGTTATATGTATT 

TTTTGTCTTCTTMCTTCMGTCTTGGATGAGAGMGAGGTGTTTTAAATTTCTCTGTTA 

TTTCTAGGGTTCTATTCATTTT 

Sequence 1314 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGATGACAACATC 

AAAACATACTCTGATCACCCCGAGAAAGTAAACAAAGATGATGAGGAATTCATAGAAAGC 

AATAAAATGCATGCTATTAATGGAAGAATGTTTGGAAACCTACAAGGCCTCACAATGCAC 

GTGGGAGATGAAGTCAACTGGTATCTGATGGGAATGGGCAATGAAATAGACTTACACACT 

GTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCCGGGCAGGTGCTGTGAGTGCTCTGG 

CGMGTTTGGAGCCCAGAATGAAGAGATGTTACCCAGTATCTTGGTGTTGCTGAAGAGGT 

GTGTGATGGATGATGACAATGAAGTAAGGGACCGAGCCACCTTCTACCTAAATGTCCTGG 

AGCAGAAGCAGAAGGCCCTTAATGCAGGCTATATCCTAAATGGTCTGACTGTGTCCATCC 

CTGGTCTGGAGAGGGCTCTGCAGCAAGTACCT 

Sequence 1315 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAGAT 

CAGACGTGGCGACCCGCTGAATTTAAGCATATTAGTCAGCGGAGGAAAAGAAACTCTGAA 

TCCGACCAGTGTAGGTGATTACATTAGCCTTTGAAGTCAACACAAAGTTTAAAACACCTG 

CCCGGGCGGCCGGCCGCCCGGGCAGGTGTACAAGCTTCGACCCACGCGTCCGGCTGAAGA 

CATCCCTAGGGCAGGTAGCAGAATACCTAATTCAACCTAGAGAGGCACAGGCTGCACGAG 

AGTCTCTCAGATAAAGCCCCATTGAAMTMATTTACMTCTAAAATTTAAAAACCCGTT 

AAAAAAGCAGCACAGCATGAGGAGTCAGTAGATACACTGAAAGCAAGATTAGATCTTCAA 

GACTTTCAAACTATAAAATTTAGAAAATTATAATAAATTATGAAATAGAGGCCCTTTCAT 

GTCAAAAAGTCATGAAAG 

Sequence 1316 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCG 
AAAAGATGAGGCAACAAGTAAGAGAAAACAGCATTGAGCTTAGAGAATTGGAGAAGAAAT 
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TAAAAGCAGCTTACATGAATAAAGAAAGGGCAGCTCAGATTGCTGAAAAGGATGCCATTA 

AATATGAACAAATGAAACGTGATGCTGAAATAGCCAAAACCATGATGGAAGAACACAAGA 

GAATAATAAAGGAAGAGAATGCTGCAGAAGACAAACGAAACAAAGCGAAAGCACAGTACC 
TGCCCG 

Sequence 1317 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGCATAG 

GGACTCTTGCCTTAAGGAGTGTAAACTTGATCTGCATTTGCTGATTTGTTTTTAAAAAAA 

CAAG AAATG CATGTTTCAMTAAAATTCTCTATTGTAAATMMTTTTTTCTTTG G ATCT 

TGGC AAAAA AAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACACTTGTTGAT 

AAGAGTTTTCTGAAAACAGTCTATCAAATATAAAGAATGGTTTCTATCCAAGAATCAGCA 

GTGAGGGAAGAAATACTAAACACCTGTCAAGAAATCAGTTATTCATTTTAAAAAATAACA 

GAACCAGTGCTGCTCTCTGTCATAAAAMGAACATGTAAMTTTATT7TTATAGGCTTTG 

GTAACATTATATTCCCCACAGAGGCCTTCAATCCTACTTAAAGATA 

Sequence 1318 

AGGTCAAGCTTCGACCCACGCGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGGTT 

TAGTTGGCCTCTTCTGTTTTTACCTAATTTCTCMTCCCACCTATTCTTrTTTCTCACTT 

TCTTAMTCCCAGTAATTATCTTAGGCATAAAGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCGG 

GCGGCCGGCCGCTCGTGATCTAGATCGCCGACCT 

Sequence 1319 

GCGAATTGGAGCTCC CCGC GGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGTTT 

GGGTGGMTTATMTATTTTAGATAAGATTTAAGAGGATTCTAATTCTAGCTACTTGATA 

GGAATGCGAATGATGATMGGCTTTTAGAGTTAGATAAGAGAGAGGGCTAGCACCCTGAT 

ATTCTGTMTTGAMCAGAGTTTCAAGTCCTTTGGTCAAGTATTACCCTTATTCCTTCAG 

GAATAGTAGATATTTTAAGATTACAGATAGGTTATCTTATCTAATTTACCTACCTATTGT 

TGAAATTATTTAATTTGCATTTAACTGTGTTTTCACACCTGCCCGGGCGGCCCTCTTACC 

TGCTTCTGACCTTATGCTCAAGAACTCCCCTAACTCTGGCCAGAGCTCAGCTTTGGCAAC 

TCTGACCGTTGAGCAGCTCTCATCCCGGGTTTCCTTTACGT 

Sequence 1320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 

AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCACCAAGAGCCACTATAAAA 

AGATCCTTMTGAGCAA MTATA TCCCCTATTATTTTCCTACMGTTGCTTTTTACTTGA 

GTAGGAACCCTTGATTGATTTTTGCGGACNCGTGGGTCGAANCTTGACCT 

Sequence 1321 

GG GCGAATT GGAGCTCCCCGCGGNGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGT 

TTGTTTTTTTCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGCCATTATNATTGC 

TTACTCTCATCGCTTAGCGCCCCAGGTGGGATGTGTTTCCAAMCACATTTTTGTATTTA 

TMGGAMTGTAGTTAGGATTMTTTTATTGTCCtMTTAGAACTCACATTTTGGTTAAA 

TCCTC AATTTC ATTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

ATTTAATTTTTCTGCTTGCCCAAGAAACAAAGCTTNTGTGGAACCATGGAAGAAGATGAA 

AATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTAATTGGGAATAAGATGGAAAATATGAATGCTAAATCAAATTTTTTA 
Sequence 1322 

CCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTCCGCTCACTTCATCCTCCCAG 

CAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTGGGGCTTAAGGAA 

ATTAAAGAGCTGTTGTGGGACTTCCAAAGCAGAAGACAGTAGGCTTTCAGAAATTTGATA 

AAAATAGCACTTTGCATTTCTTGAATCTTGAGCTAAATGGAAATTAATACTAAACATTCT 

CCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAGAAGAAAAGGAAA 

TGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGTTAAGGCTTTGGCACCTGCC 

CGGGCGGCCGGCCGCCCGGGCAGGTTCAAAGACTACCAAAGTATGTATTTGATTTTCACA 
TGCAAACAACTTAAA 




WO 01/070979 



PCT/US01/09126 




WO 01/070979 



PCT/US01/09126 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

218/467 



Sequence 1323 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGCAGAAACCTTGCCATCATTCTTACTGCTGGTTTGCATCTCATTTATGGTGNNTCTGGG 

ATTTCTTCTCATGAACAGAMGATTCAGTGGCTGTTATATGCCTTGTCAATTTAATGTAT 

TGCC CTATCC TCTTTTTGATCAAAGATAGAGACTAAGACTGGGAATTATGACAGAAAAAG 

TCATATTTTTCTTTAAATGATTTTGAMTGTTAAAATAGGCCAATATGAGTCAAAGTGCA 

AATTTTTTGGTGACCTGCCCGGGGCGGCCGGCCGCCCGGGCAGGTACTAAGCATTTCAGT 

TCCAGGAGMTMMGMATTCCTATTTGAAATGMTTCCTCATTTGGAGGAAAAAAAGC 

ATGCATTCTAGCACAACAAGATGAAATTATGGAATCAAAAGTGGCTCCTTCCCATGTGCA 

GTCCCTGTCCCCCGCCGCCAGTCCTCCACACCCMACTGTTTCTGATTGGCTTTTAGCTT 

TTTGGTGGTTTTTTTTTTTTT 

Sequence 1324 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCCTMTATATTTACTCTCTGGTCCTTTACAGGA 

AAAGTTTGCCAACCTCTGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGC 

ACAAGACTATGTAGTCAGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGA 

AAGGCGGAGAAGAAATGGATTTGGATAGGTAGTCAACAATGTCTGTTTTATGTTACCACA 

CATTTTCTCGAGAAATTTCMTCAGCTCTCTGAGMCAGATTCATCTTTAAATGAATGTT 

CATAGGTAACAGCAACTCATGCATCMTGTTGCAAAGTGAGCTCATTTTCACATTGCTTC 

AGGTTAGGCAGMGGTTTGGTAMGGGATTMCGTMTTGTTTCCTTGNTGTTTACAAAA 

AGAAGTCCCAGTTGGCATGCCACATAAMTCTTCTGNATCTCACTCTTGGTTACATTTC 

Sequence 1325 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACGCGTCCGCAGAGGCCC 

AGGCTCCCAAACCGACAAAGTGAAAAGAGACCAGAGAGGCCAAGCATATTGACTGGTGCT 

GTTCAGGGCCTGCTCTTTTCCACTCACCACTTGTTTTGCTGCTTGTCACGAGGAGAGTTG 

TTCCTGTATGTGGCTGCTCTCAGATCTTTCCMGCAAGCCAGTCATTTGMGAGGTTTTC 

TTTTCATGCTGGAGGGCAGGCTAAGATCMTGAGTGGMGAGAGAMGGCTGTTTTAGCT 

CAAGTTAAAGGAACACCTTCTAGCCATCAAAGCCGCCCAACAGAGGCAAGGGCCACCACA 

CATGAGAGAGCGCTCTNTCCTTAA 

Sequence 1326 

GCGAATTGGAGCTCNCCGCGGTGGCGGNCGCCCGGGCAGGTACCAAAATAATTACCAACA 

NTACATTATGTACACCATTTACAGGAGGGTMCACAMCCTTGACAGGTAGTMCTTTTC 

ACCCCACATNACTGMCGCTTMCACTCCTGGCTGTTMTTGTCAGTTCAGTGTTTTAAT 

CTGACGCAGGCTTATGCGGAGGAGMTGTTTTCATGTTACTTATACTAACATTAGTTCTT 

CTATAGGGTGATAGCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 1327 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCTGTAGCCTATGACTTG 

AGTCTCTTGAACTTCTAGGAAGAGGCAAACTACAAACTACTAGGATTCTGATTTCAGATA 

TAGGCATTCCAGMTCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAACTTGA 

ATGGCCTTGGTTGTTCTGTAATTGCTGCCAAAATCATCACAAGCTGTAGCTGCCCGGGCG 

GCCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGATGGGAATTCAGGTATGA 

AAGAAAACAGGCAAGGAGGCACTGAGGGAGAAAGACACAGACTTTATCGCTCTGTGGCTC 

ATTGTTACTGGMTATTCTAAMCTCTTGTTCACATGCTATTATGACTTATAAAGCAGCA 

ACAGCTGAGGCGCACCAGGACACAGCTTCCATTTCTTTAACGT 

Sequence 1328 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 

AAAATGGGAGACAATTTCACATGGACTTTGGMAATATTTTTTTCCm 

TCAMCTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAA 

CCTTACCAAAAAAAAAAAAAAAAAAAGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTA 

CAAG CTCGACCCACGCGTCCGMATAATAAAGCTAGMGTMTATTTTTCCTTTTGTCTA 

TTTTCCAMTTGACTCGATATTGATGGCTACTTTTGTAAGTTTTTA 

GAATATTTATTGATCACCTNTATGTGCTCAGTACCT 
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' Sequence 1329 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGC 

TTGGGGATTTCGAGGMGGGTTCATMGGGAGATTTTAGCTGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATG 

TTCTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAMTCAACAATATTTAGATATTCCAA 

TATCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGC 

TTGTGTATCAGTGGCAGGGTTTMGGTTCATTTTTTATTCCCATATTM 

TTAGACAAATTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGC 

CCTCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAACC 

Sequence 1330 

AGGTCMGCTTCGACCCACGCGTCCGTGMCTTTTATCAAGGCTTTTGCTCTTTTAGACT 

TGAGTTTATCTTTATAATTMGGAGMTGGTTTTTAAMTTTAGTTCCTCTGACACCCCA 

AAATTATCAAMTA AATTA TGTTGTAGTGAATCTGTGTTTTGAAAGTCATTGATAGGACT 

TATATGAGTCAAAATTTTATGGATTATAMCTAGGCTTTATCTGGTTGGAAATAATTGCA 

ATACMGAAGCAACTTTATTAMTTAGACCTAMGTCACAATCTTCTTCTTTGCTGCTTT 

TTAAAMTTACCTATTACCTTTAAAAGATCCCAMTTTAGAAGAGGAATTAAAATAAAAG 

TTAATGCAATAAAACACTTCCACAATATTCTATTACTTCAACCTCTAATCAATGAAA 

Sequence 1331 

AGGGCGA ATNGGAGCT CCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATA 

ATMTAMTTTTTGTTTTAAAAAAAGAGTGTTGTCTTTGTCTTGTATTTTCTC 

CATGTGATATTCTTA GGTAT AGA I I 1 1 1 1 1 \ AGTATTTGTCCTGTATATTGTTA7TCGAG 

CTTCTGGGATCTGTGTTTTGGTGTCTATCATTMCTTTGGMTATTCTCAGTCATTACTG 

CT TCAM CATTCATTCTGTTGCTTTTTCTCTTCTGGTATTATCATTACACATATATCACA 

CCTTTTGTAA TTCT CCCACAGTTCATAGATATTCTGTNGTATTTATTTATTTTT^ 

TGCCTTTTAGTTTTAGAGATTTCTATTGACATGACTTMAGATGATTGATGAGTTGATGA 

GAATTGAGAGAATTGATGAGAATTGTTGATGAGAATTATT 

Sequence 1332 

ACTTAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTCCTGCATTTC 
TAATGMGAMTAGGCTGGTCCTCMTTTTGCAGAAGTTGTATCATCATAGGTCATACCT 
AACATTCGTTTGTCAAGAGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCA 
CAAACCTTCAGAATGTGGTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAA 

ATCAGCCTGGCTTTAACATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGA 
AACTTGACCT 

Sequence 1333 

CCGC GGTGG CGGCCGAGGTAGCGGTCGCAC I I I I I I I I I I I I I I I I I I I I I AAATAG AG A 

TGAGGTTTTGCTATGTTGCCCAGGCTGGNCTNCTGGACTCAAGCAATCTCCCACTTCAGG 

CTACCAAAGTGCTGGGATTTACAGGCATGAGCCACCTCTCCCAGTCTCAGTTATTATTTT 

AATAAATGAGACTGAACGTCCTC7TATAAGGCTCACTCCCTTGTTCCTACTACATTTGCT 

CTGTTTAAGTATCTCTTTAAATTCTTCAGTTAAGCGGACGCGTGGGTCGAAGACCTGCCC 

GGGCGGCCGGCCGCC CGGG CAGGTATTMCAGGTGCTTGCAGTTTGTTGACTTTTTTGAA 

AAMTCAAGTTGTAMCTTTTATTACAMTTAMMTGMGTTCTTAAAAATCTCAACTT 

GACCAGATATGAAACMTTTAAAMCCTTTMGGCGTATTGAGAAAAACCAGGCTTTTTT 
AAA 

Sequence 1 334 

ACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCTCACCTGCCCGGGCGGCCGCTC 

GAGGCCGCTCACCTGCCCGGGCGGCCGGCCGCAC i H I I I I I I I II I I II I I I NNCAAAA 

CMMCATGCTTAGCATGCACACTTTTACCACTTTTTTCGAGTGGAAAGTTTATTGGCM 

TATTAAATTTCACCCTAN ATAGG ATATGAGAATGTTTTGATAAATCACAATTTATAGTAT 

ATTMTGCCATGTGAGAATTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCATTA 

AACCTTAAAGAAA CCTTTC TCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATG 

AATTTGTAGATTTCTTTTTAGTTTAATTCCTGAGTATACAGGGCMMGCTTATATCCTT 
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TATATAAACTTCTGCTTTGGTCTAAAACTGATATATCTTCACGTTGAGGTTTCATCTGAA 
ATGCNCCACCGTTTGCTGACTTGCTTCAATATGMTTTGGATGGCTATAAAATTGACCTC 
GGCCGCTCTAGAACTAGTGGGATCCCC 
Sequence 1335 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGCCGCTCGAGGCCG 

CA CI I I I M I I I I M I I I I H I I GGTAAACAGGCGGGGTAAGATTTGCCGAGTTCCTTTT 

ACTTTTTTTMCCTTTCCTTATGAGCATGCCTGTGTTGGGTTGACAGTGAGGGTAATAAT 

GACTTGTTGGTTGATTGTAGATATTGGCGGACGCGTGGGTCGAATCTTGTACCTGCCCGG 

GCGGCCCATAGTTTGTCAACCACTGGTGTAMACCTTAGTTATATATGATCTGCATTTTC 

TTGAACTGATCATTGAAAACTTATAAACCTAACAGAAAAGCCACATAATATTTAGTGTCA 

TTATGCMTMTCACATTGCCTTTGTGTTAATAGTCAAATACTTACCTTTGGAGAATACT 

TACCTTTGGAGGGAATGTATAAAATTTCTCAGGCAGAGTCCTGGATATAGGAAAAAGTAA 

TTTATGAAGTAAACTTCAGTTGCTTAATCAAACTAATGATAGTCTAACAACTGAGCAAGG 

ATCCTCATCTNGAGAAGTGCTTAAAAT 

Sequence 1336 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAAGATCCTGCGGAAGGAA 

TATGTTTTTGCTGACTCCAAAGTAAGTGACAGCAAACTTCTAAAATGGGCTGTGAGGTAG 

GGAGGGGACACAAGCGTTTTGAGGCTCGCTGNGTGCCAGGGAGTGTATCATTAGCTCACT 

CMTTCCCAGMCAACCCATTTCACACCTGGGAAAGGTGAACTTAGAGAAGTTGAGGATC 

ATGTTCCAGGTTGGCCTGGATTTGAGCCATCACTGTCTCAGGAGTAGGGAGGCTTCCCAC 

TTTGCCCAGCTGCCTCCCAGCCTCGAGGCCACATCCTTTATGACCCACATCTAACTCAGC 

CCCACACCTGGGGGMAGGCTTTCAGCTTCTCTGGGCTGGACTTGGGAAATCTTTGGGAC 

ACTCCTGACCTGCCCGGGCGGC 

Sequence 1337 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCAGGCCCAGGCAGAACCCAT 

CCCATTTTATCCTTAAACTCAGAAGGAAATNNGTCTAAATATTAAAGGATTAATATGGGA 

ATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTCTCTTAGAGTTCAAT 

GMCACTTCAGGAGAGTATTTCCMCMTATTTAGATATTGGAATATCTAAATATTGTTG 

ATTTAGATMCCACCCTAGATTTCTCACCACCCTAGMCATTTAGNGGGGAGACATTCTT 

TTCTCCTTTTTCTGATMCTTGGTCAGMGTGATTGACTGTGCAMTGGTATTTCTCAGC 

TAAAATCTCCCTTATGAACCCTTCCTCGAAATCCCAAGGT 

Sequence 1338 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGCTT 

GGGGATTTCGAGGAAGGGTTCATAAGGGAGATTTTAGCTGAGAAATACCATTTGCACAGT 

CAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATGTT 

CTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAAATCAACAATATTTAGATATTCCAATA 

TCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGCTT 

GTGTATCAGTGGCAGGGTTTAAGGTTCATTTTTTATTCCCATATTMTCCTTTM 

AGACAMTTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGCCC 

TCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAAAACCTGCCCGGGCGGC 

Sequence 1339 

CTTA GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAAATATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGNGCTA 

AGCAAMTAAATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTTATAGCATGGGCTCAGGTATCAGATGGCGTAGGTCAAGATCTTGGCTCTACTGTTTAC 

TTACGGGAMTACTTTTATGTTGCTMATCTCAGTTTTCTCTTCTGTAAGACGGGATTAA 

AGTACCT 

Sequence 1340 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTACTCAAGTAGTCT 

TTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAACAAGGAGTTTGATNTGTGTTNGCTG 

ATTGTGAACCATCMTTGAGATAACTCACTACCTTCAGGCCAGCCAGTTACATACTTTTG 



# 
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AAMGCCAAGAGTGAAGCAGGGTTGTTTTTCATCCAATTCTTGGT C 1 1 I I T GTTAAAGGC 

AGCAATMGATAGGGTGGTTTCGGGCAATCACTTAGCTAATTGGCTCTCTATAGTCATAC 

CTGGATMTATTTGTAGTCATACCTGGGATAATATTTAAAGGGAAGAAACTAAACATAGT 

CCTTAAGTAGGAACCAACTACAAT 

Sequence 1341 

CCGCGGTGGCGGCCGAGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTAT 

TTCTTATTCTACMCAAGGGNGCAGCCTANAGGCAMACACATCCCATTGTCATTTTTTT 

GTAMTAMGTTGTATTGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGC 

TTTCACTTACAACCTGAGTGGTTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATAT 

TTACTATGTAGCTTTTTCTTrTCT^ 

GAGTGCGGTGGTGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATCCT 

CCCGC CTC GGTCT CCCAAGTAGTTGGGACTACAGGCATGAGCCACCATACCCGGCTAATT 

TTTTTAAAGTTTTTGGTAGAMTGGAGTTTTTTMTGT^ 

CCTGGTCTTAAATGACCCTTTTCCCATCAG 

Sequence 1342 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAGTTATTTATTGATTTAATCAT 

TGTMTCTCCAATAGAGATTACAATAGAGATCTCCAACATGATTTCATGCATTTAGAGGA 

GAMTATTTCCTGGTTMGTGGAAMTTGTGCGGATGTGGCTTCTGGAAGACCTTCATTC 

TAAAGCAGCGGACGCGTGGGTCGAAACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCTG 

CAATCCAGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 

AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGAGGGGTGCTCTCCTGAGCT 

ACAGAAGGAATGATCTGGTGGTTAAGATAAAACACAAGTCAAACTTATTCGAGTTGTNCA 

CAGTCAGCAATGGNGATCTTTTTGCTNGTCTTGCCCATTCCTGGA 

Sequence 1343 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCTTTAAAACCCATA 

CCCCTCC AGGGTT CTTTTCTGTTGCTGGTGMGTGCACTTTTrAAAAGAGTNATTCATAC 

CATCMGATTTTTGACMGAMATTTTAGAAAAACTGTGGAAGAAAACTGATTGCTCTTA 

GTTCTAGCCATGTGTAATTGCTGACCACCTGAAATGGTCCAAACTGAGATTTGCTAAAGC 

ATAAAATACACACCATATTTCAAAGGTTTTTAAMGMTGTAMACATTTCATTAATTTC 

GGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGCCGAGGTGGATGGACCCATCCATTC 

AGGCAGGGGGTGTGGGGTGTCCCCTGTGCTTAGAAACCACCTAGCATCATAAGCTGCAAC 

AGCACTTTATTGGGATCTGAGTCTACAGTTCACATAGGGAGGTGAAGCCGTGGGAGAAGC 

AGGGNGTAAAAMAAAAMAGGGGGGGGGACTTTACCCCCCTAAGGACAGGNTGCTTTCC 

AAACCTAACAAAAAC 

Sequence 1344 

CCGCGGTGGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGGTAATTTGTTGCACACTAT 
GTMCAAMCAACTGAAGATATGTTTAATAAATATTGTACTTATTGGAAGTAAAAAAAAA 
AAAAAAAAAAAAAAAAAG TGCGGCCGGCCGCAC I ITTTTTTTTTTTTn I GCTGGGGTTT 
TTTTCTTTTCI I I I I I I I I CAGCTACAGGAATTTAGCCAATTCANAGGAAATCTTCCCCA 
TMTTATGGMCTTTNTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACT 
GAAATAACAATAACAACTTCAACAAATAAAAAAACAGTTAAGCTAAATAAACAGATGATT 
GCAGAATTTATGTGATTACTGGGTACCTCGGCCGCTNTAGAACTAGTG 
Sequence 1345 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGTTTGAGTTGAAACGGTATGTGACTTCCC 

CAGCTGCACCCTGGGCAGTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAA 

AACTCCACAGATCACTCCTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCC 

AGTGCATTGAGGCTGGAAGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTC 

CCCTGTAGTTGGGCTAATCTGGAAAAACTCAAAAACCTGGCCTGATTACCGAGGTTTCTT 

TTATGGATATTTAGTATTTAGATAAAATTTTTACAGTATTCTTGAMTGAACCCAATTAA 

ACACATAGTTCTCAGTCTTGACCACACATTAAGAATCATCTGGTAGACTTCTGTAAACTA 

CCAATGCCTGGCCA 
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Sequence 1346 

CCGCGGTGGCGGCCGCAC I rTTTTTTTTTTTTTTTTl I ACCTGAAAATGCTTATTCTAGC 

TTCACATTTGATTGTTTGGCTMGAAGAMATTATTTATTAGACTTMTTTTCCTCAC 

GTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCA 

TTCT GATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTANACGCTTTCAGGACCTTC^ 

TTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 1347 

CTCCCCGCGGGGGCGGCCGCCCGGNAGGCNAAGCTTCGACCCACGCGTCCGCTTTAAAGG 
GAATTCTNTGTAGAGTGGGAGGCGAACACGNCTGGNNCTTCCAACTCAGGAATTCTCGTG 
GCTGGGCTGGGTCAGCGATGGCTTTGTCTCTTTATGTCTAMGTGCCCTATGGCATGCTG 
MGGTTACCTAACCATTCTTTAAAAGGAGAATGACCCTCCATGGGAATGGCCAGCCTGCC 

AACTGTGCAATTGAAGAAGACCCGATGGATCAACCCCATGTCTTCCTTGGGGAGAAAGTG 
CATAAACCAGGGGTCCC I I I I I I I I I I 
Sequence 1348 

AGGTCAAGCTTCGACCCACGCGTCCGCAAAAATCAATCAAGGGTTCCTACTCAAGTAAAA 

AGCAACTTGTAGGAAMTAATAGGGGATATATTTTGCTCATTMGGATCTTTTTATAGTG 

GCTCTTGGTGCACTGCCTGTGAGTTAGCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAA 
AAAAAAAAGT 

Sequence 1349 

GCGCGTATACGACTCCTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTA 

CMACTATGTATCTGAMCACTTCTATTTGGCMTTTTATAACAAATCAAATTTTAAAAA 

GAACAAAAGAGATTGCAGATTACTTCGCAGATACAGAATAAAGCAATTGATGAAGTGCTT 

AAGCAAAAGAAACAACMAAAAAGAAMCACACTGCTTTTCTTTTTAAAAATAAAATCAC 

ATTGCTATA GATCA AATGGATAATACCCTTATTAAACAACCATTCCAGAATGTCTTATAG 

TAGCAGTGCTTTTATTTGCACTTCACTTMTTTTATAAGACTCATTTTCATGTATATAGC 

TCTTTACCCCCATTGTTMCGMTAMGTCTCTCATMTTTTTACACTTTTAMTT^ 

AAAGCAAATG AGAAA TGATTTATGTATCGTGGAACCTTTCCCATTTTGGAACCAAAGGTT 

TTMTTCTATATTTTTGNCTAATATTTCTTTAAAAAAT 

Sequence 1350 

CC GCGGTG GCGGCCCGCCCGGGCAGGTACTATCTATAMGGAGGTTTGATGTTTTCATTA 
CTGTTTTTGTAMTATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGT 
GG CTCAT ACCTGTAATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGA 
TAGTTTTATTCACTTGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1351 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGATTACCATGG 

CAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGTAG 

AAACAAAGTGC GGCC GGCCGCCCGGGCAGGTGCCGCACI 1 1 I I I I I M 1 1 1 I I I I I I I A G 

AATAGGATTGMTTTTATTAAAAACAAAAATAAATCTAAAAAGCTTCCTTCAGTTACAAA 

TATGCACMGAATTTCTGCATTACATCATTTGACATAAAATGTTCTGAATGACAGAAGTA 

GAAGTAGAACTTACTACCATTTGAAGACAGGAGTTGAGCGCTGAAAACCACACACATTTA 

TAGAAAGAAACCAAAGTTTCACAGGGAAGACCTGTGATCTCTGGCTACAGGAGCTGAAAT 

TAGGAACATGAAAGAAACTTGGAGAGAGAAGACATTCAATACTCTAAAATACTTCAGCAA 

AAATAGTCAAACATNTGTNAACAACTTGGNACAAAACTTTATATGGTGGGGGGTGGCTAT 

GCCGGAAMANTCTTNACTGGNTATTATTCACCTCAMMGGGGGNTTTTAATGNTCACG 

AATCCTTCCTTTAAMTAMANAMGCNTGGNTTNTTTTTCTGGNGTCAAGAGTAAAANG 

TANTAGGGNNTACTCAGGGATGGTTTGGAATTTTAAAACGGGGCNTTCCACCCTTGGTGG 

CTNGTGGCANTTTANCCCAAAACGGCNAAANAACCGGCCCGNGGTCAACTTGNAAAACCT 

GGCGGNTTNAAATANGGACCCCGGGTGGGGGATTNANTNAGNTTTGANCCGNNNCTTGGG 
GGGGCCGGCCC 

Sequence 1352 
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CGAGGTCAAGCTTCGACCCACGCGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGG 
TTTAGTTGGCCTCTTCTGTTTTTACCTAATTTCTC 

TTTCTTAMTCCCAGTMTTATCTTAGGANMGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGGTGCGACCTGCCCG 

GGCGGCCGCTCGAGGCCGCCCGGGCAGGTACTATGTCGATTCGACAGAACATTCAGAAGA 

TTCTCGGCCTTGCCCCTTCACGAGCCGCCACCAAGCAGGCAGGTGGATTTCTTGGCCCAC 

CACCTNCTTCTGGGAAGTTCTCTTGMCTCMGAACTCTTTATTTTCTATCATTCTTTCT 

AGACACACACACATNAGACTGGCAACTGT7TTGTAGCAANAGCCATANGTAGCCTTACTA 

CTTGGGCCCNTTTCTAGGTTTGAATTATTTCTMGCCTTTTGGGNATGATTAGAGNGAAA 

ATGGCNCCNGCAMCTTGNAGGGGCTTTTGGNNCCANAATGATTTTTAATAAAAAAAAGG 

GGATTGAATAGNTAMNTCMGGGAANGGTTTATGNAAAGGAAAAAAAAAAGCCTCCTTC 

NTGGTTTMTTTACAAAAGGTNTTTTNTNGGGGGACCGNCTNTAAAGNACTNGGGNTTNC 

CCCGCAAGGTGGGNNGGTATTNACCCNTTTNNGGNTTNAAAAAAAMANTTNGNNNGGGT 

NAACCCTGGAACNGGGGGGGTTNGGNG 

Sequence 1353 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTCTTCGACCCACGC 

GTCCCGGGGTTTCAMCTGMTAACTCATTGTTGGGTCTGGATCTGTAMMGGTTTTGT 

CACTGATGGGCAATATGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAAC 

AACTGGATGAAAGCTATAAACAATAGTTCAGAAGATTGGCGTAGAGGATTTACCTACAGA 

ACTTCAGGAGATTCCTAAGAAGGCCTTCAGTGATCTTTTCATCAATATCATCAGGCCTTA 

TCATTGTTTCACATTTGCTTCTCTTTACCATAGGGAATATAATAATTATTTACTGGTTAA 

CTTCCTAGGGAGATTGCCTGCGGGTTATTTAAGATCCAAATTTTAAAGTAATAATTTCTG 

TTGAAGCTGCTTGTGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTA 

AATGAGCCCGGGATGTAGGCAGGTTTTGATGTTTTGCTTGCTTTTATCCCTAACATT 

Sequence 1354 

TATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAMTATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGTGCTA 

AGCAMATMATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTAAGCATGGG CTCA GGTATCAGATGGCGTAGGTCMGATCTTGGCTCTACTGTTTACTT 

ACGGGMATACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGGGATTAAAG 

TACCT 

Sequence 1355 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGGGGTTTCAA 

ACTGMTMCTCATTGTTGGGTCTGGATCTGTAAAAAGGTTTTGTCACTGATGGGCAATA 

TGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAACAACTGGAATGAAAAG 

CTATMACAATAGTTCAGMGATTGGCGTAGAGGATTTACCTACAGAACTTCAGGAGATT 

CCTAAGAAGGCCTTCAGTGATCTTTTCATCAATATCATCAGGCCTTATCATTGTTTCACA 

TTTGCTTCTCTTTACCATAGGGAATATAATAATTATTTACTGGTTAACTTCCTAGGGAGA 

TTGCCTGCGGCTTATTTAAGATCCAAATTTTAMGTAATAATTTCTGTTGAAGCTGCTTG 

TGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTAAATGAGCCCGGGA 

TGTAGGCAGGTTTTGATGGTTTGCTTGC 

Sequence 1356 

CGCCCGGGCAGGTACTATCTATAAAGGAGGTTTGATGTTTTCATTACTGTTTTTGTAAAT 

ATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGTGGCTCATACCTGTA 

ATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGATAGTTTTATTCACT 

TGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1357 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCGCGTCCCGAGTGTTTCCA 
CTCTGTCCATAAAATGGGAGCTAATATTCTCCAACCTGTGTGCCTGACATGATGGTTAAA 
GGGATTAMCAAAACAATAGTTTGTAATTTATTCTGTCAGAGCAAACTGCTGGTAAATAA 
AAGGGCTAGTGACGAMAATAAATTTTAAAAAACCTMTAAAACAAGTTTGTAATTTATA 
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ATTGTATACAAATAAAAGATGTTACAAAAAAAAAAAAAAAAAAAAAGGACCTGCCCGGGC 

GGCC GGCCG CCCGGGCAGGTTTTATTTAACATTCAAACTTCATTAAGACATGTGCAATAT 

GGCAATTTTACTGGGGATTAAACCCTACCTAGGATTGCTTGCTGGGGCTTAGCAACAGGG 

TCCAGTTCACACTTAGCACTMTTAAATACTTTATTGAATAAATACAATACCAAACAAAA 

TGCATTCAAA 

Sequence 1358 

CCGCGGTGGCGGCCGAGGTCAAGTTTCGACCCACGCGTCCGCATTATCCTTCTTGCCATC 

TACCCCATGTTAMGCCAGGCTGATTTGTTCTTGGATCTTAAGTGTGACAAAGCCTGCGG 

GAAGG AACCACATTCTGAAGGTTTGTGGGCTGTGTGAGATCCAGAGAACCCAAGGGGGTT 

TTTTTGCTCTTGACAAACGAATGTTAGGTATGACCTATGATGATACAACTTCTGCAAAAT 

TGAGGACCAGCCTATTTCTTCATTAGAAATGCAGGAAACCTGCCCG 

Sequence 1359 

CGCG GTGGCGGC CGAGGTCAACGCTTCGACCCACGCGTCCGGGACCTCAGAATATAAAAA 
TATGGI I 1 1 1 1 I I ICAGACTTACTAGI Mill! I GATAATTCCTCTACGAATGTTGATTT 
AACTTAGAMTATGTAAATTTAATATTCAAMCCAMTTATTTTTTAAAGAGGAAAAAAA 
TATAAAACCTGCCCGGGCGGCCGGCCGCAC I I I I I I 1 I I II I I I 1 1 1 } I j I I NAAACTTT 
MTAGNGTNCGGAAGNTGAATAATTTATGAAGGAGAGGGGTCAGGGTTGATTCG 
Sequence 1360 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCAC 

NCGNCCGGAGATGTATACTGCCACTATAGGAACTATAAGAAAAAGTCAAATGGAAATNTN 

ATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGATANGGNGTCAGTGAACTC 

AAAAAT NANTCAA TNGAAACTACTGAAACTAAAACTCAAAGAGAAAAAAAANGATGGGAG 

ATMTTAT TTTTT AAGAATTGGTCATCAAAATGTAGCAACAAGTTCGCCTTATCCTATAT 

CATTTGAATTTTCAAAAMTAAGCTCATTATACAATCTTTAAMTATTTTGAATAGAACT 

GTTTCATGTGTTATTNGTGAAAAT 

Sequence 1361 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 

CAAGGAGTTTGATCTGTGTTCAACTGATTGTGAACCATCAATTGAGATAACTCACTACCT 

TCAGGCCAGCC AGTTA CATACTTTTGAAMGCCAAGAGTGMGCAGGGTTGTTTTTCATC 

CMTTCTTGGTCTTTTTGTTMAGGCAGCAATAAGATAGGGTGGTTTCGGGCAATCACTT 

AGCTAATTGGCTCTCTATAGTCATACCTGGATAATATTTGTAGTCATACCTGGATAATAT 

TTAAAGGAAGAAACTAAACATAGTCCTTAAGTAGGAACAACTACAATTTTAAC 

Sequence 1362 

ACTGI I I 1 1 I I I I ATTTGTTGAAGTTGTTGTTGTTATTTCAGTCTTTTTCTTATTGGGTT 

GACCAGACTTGGTAAAATCTGTMGAMGTTCCATAATTATGGGGAAGATTTCCTCTGAA 

TTGGCTAAATTCCTGTAGCTGAAAAAAAAAAAAAAAACCTGCCCGGGCGGCCGGCCGCCC 

GGGC AGGTTACMGCTTCGACCCACGCGTCCGGGAAATTTTAATTAAAAATAGGTGAACA 

TTTTAA ATGAC CTMTACATATTTAGTCCACATTGAAACTTTGGCATTT^ 

TTAMATTTTTGATGGCATTAAMTTTTNATGCCATTAAAMTTTTTGATCAGTAGGTA^ 

CA 

Sequence 1363 

CCGCGGTGGCGGCCGAGGTACCACGGTTGTCCCCTGAAAGGTGTTGTGTCCCTCACCAGA 

CTGGGAGCACCTCMGGGCAGMCCCATGTCATGTTCCTTTTTGTATTTCCAGACCTGAA 

ACTGCC AGTAAATAAACCCTA AAAGTAGAAAGAAAAAAAAAAAAAAAAAAAGTGCGGCCG 

CCGCACI 1 1 I I I I 1 1 I I I I I I I I I I I NGGAAMCCAAACATGCTTTATTTCATTTTTTTC 

ACAATTTATTTAAACATCTCANATATACAAAATAGGTACCT 

Sequence 1364 

CCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCCAGAGAAGCTGAAAGCC 
TTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGATGTGGCCTCGAGGCTG 
GGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACAGTGATGGCTCAAATCC 
AGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTTTCCCAGGTGTGAAATG 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

225/467 

GGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCACACAGCGAGCCTCAAA 

ACGCTTGTGTCCCCTCCCTACCTCACAGCCCATTTTAGMGTTTGCTGTCACTTACTTTG 

GAGTCAGCAAAAACATATTCCTTCCGCAGGATCTTCCGGACGCGTGGGTCGAAGCTTGAC 
CT 

Sequence 1365 

TACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GATTGATTAATACCTGTCACAGATACATTTTGGTTTACAAATCAATGAACAATGGAGGGA 

ACTCTGTCCTTAATCTTGGTACGAGACAATGAACCCCAGGTACTTACCCCAGACAACGAC 

GCCGCTTNACCATGATGATGGACAACAGGCAAC I I M I I I I I GGAGTTTCAGCTTGCTTC 

CAACAGGGACGGTGAGTGTGAGGTTTATTCCCATTTCTAAGACGATAGMGTTTTCAGCC 

TAAGCCGTATTCCTAGGTMGCAGCTGGATTGCAGTTTTTGTCTTGGAAATTNTCCTTAA 

TTGNNTNANNCGTTMNATTMCAAACTAGCTGGNTNTTAAATTTTTIvnrCNm 

TANAGGTNCCCCCANAMTTNAAAATNMATTTNTGCAATTAATTTTTGAACCTTGCCCC 
GGGGTGGGCCCTGGCCCCCCCCTNGACAANGN I II I I I I I I I I I I II I 1 1 1 1 I 
Sequence 1366 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCG 

ACCCACGCGTCCGGAGCTGCTCAATAGTGAGAATCAGGTGATATAATGCATGTGGAAAAA 

GAATGTGAAAMTCTMCACTTTAGATTGTATACAGTGTTTTTTAAAAAGACACAAAAAA 

ACTGTCAACATGAGAAACATAAGCAAAGTTTTACTCAAGACAAACATCCACGAGTCACAA 

CTTCAGTTATTCCCAGTCTTCAAAATAACAGAAGGGCAAAGCAAAGGTAAACATGCAAA 

Sequence 1367 

GAC TNCTATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTT 

AAACTTTTTTGATGAAATATTGAGTCTTMCTACTTTAAGATGCCATAATACTGAATACA 

GTGCTAAGCAAMTMATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGC 

TCCTCTTTTATAGCATGGGCTCAGGCATCAGATGGCGTAGGTCAAGATCTTGGCTCTACT 

GTn"ACTTACGGGAMTACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGG 

GATTAAAGTACCT 

Sequence 1368 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 

TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTAAACAGAGGMGTATTTACATTCTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 

CACCTCGGCCGAGGTACCTTCTGTCACAAAGACCCAAGCTTCCTCCAGCTTCCAGGATAG 

CAGTCAGCCAGCTGGAAAAGCCGAAGGGATCAGGGAGCCAAAGGTGACTGGGAAGCTAAA 

GCAACAATCACCTAAATTACAGTCCTCCAAGAAAGTTGCTTTCCTCAGGCAGAATGCCCC 

TCCCAAGGGCACAGACACACAAACACCGGCTGTGTTATNCCCATCCAAGACTCAGGCCAC 

CCTGAAACCTAAGGACCATCATCAGCCCCTTTGGAANGGGCC 

Sequence 1369 

CCGGGCAG GTCG AGCGGCCGCCCGGGCAGGTTTCCTGCATTTCTAATGAAGAAATAGGCT 

GGTCCTCMTTTTGCAGAAGTTGTATCATCATAGGTCATACCTMCATTCGTTTGTCAAG 

AGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCACAAACCTTCAGAATGTG 

GTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAAATCAGCCTGGCTTTAAC 

ATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGAAACTTGACCTN 

Sequence 1370 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCGACCCACGCGTCCGACGACTCACTATAGGGA 
TCTAGATCACGAGCGGCCGGCCGCCCGGGCAGGTACAGAGATTTAAATGAAATCTTCGAA 
AGAATAAATTTGCTTTTCAGTCCACTGTATTTTCAAAATTT 
Sequence 1371 

CCGCGGTGGCGGCCGAGGTACTTCAAAGTTATTGCACATACACTTGTTTACTTTGTATGT 
TTTGCAGGATTAAACTTTGTATAATCTTTTTACAAAA I I I ) XI 1 1 CAGTATGCAAGCTT 
GCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAAAAAAAAAAAAAAAAAAAGTGCGG 
CCGGCCGCCCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCGGAGAnnnnnnnnnnnnn 
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nnnnnnnnnnnTGATCTAGATCACGAGCAGCCGCACTTTN I I I 1 1 1 I I I I I I I I 1 1 I IGA 
CGAATTCAGGAGTCCTTTATTAGCCGGCAGCCGANAGACAGCTAGCGCTCAAAATTNTCT 
TGGCCCCGAAGMGGAGCTAGATTTTCTTTTATAC 
Sequence 1372 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATAGCACATTACATGTCAA 

ATCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTC 

CGTGAAATCAATATCCCGCACAAGAGTGCTAAANAAAAAAAAGNNANAGGNCTGCCTNTG 

TnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTNGNTATCMGCTTAN 

TCGATACCCGTTNACCTTGNGGGGGGGGCCCGNTANCCCANTTTTTT^ 

GGGGGGGGTAAAT TTNGCCC CNNNTNGGGGNNAAAAAAAANGGGNAAAAAA AI 1 1 I I I I I 

CCTNGNGGGNAAANTTTTTTATNTNNTTTTTMNANANTTTTNNCCAAAAAAATNTAAAN 

NGGNCCNGGGGGNNNNNTAAAAGNTAAAAAANNNNTNGGGGGGNCCNTTTNNGGGGGGGN 

CCNNNANNNNCCAATTNNNNNNGGGGGGNNCGNCNNTNTTNCC C I I M I 1 1 I AAAAAANN 

AAMAAANCNCNGCCCCCCCCNTATTTTTNAATTNAAANCTCCCCCCCCNNGGGGGGGGG 
G 

Sequence 1373 

CCGCGGTGGCGGCCGCCCGGGCAGGTTATAGACAATATGCTCCTTAAGGTCCCTTTCAGT 

CCCGTTCTATGGATCTGTGTAGTTTCGCTTCTTTTTTCAATATGCTCAGAATTAGGACAC 

CAATGTTAATGGAAGATAAGGAAACTATACCACCTATCCCTTATAGAAGATTTGTGCACT 

AACTAATATGAGCCCTGGAAGATCAAGCCAGTAGAAGATAGAAGATCTATCCCTGCTTTA 

TACTTTGGATCATTTATTTGTGMGATCACMCTTTCAAAGTTTTATTATTTCT^ 

TTCATGGAAGTTCGGGGAAATTAAGTGGATCTACTTCTAGTCTAAATAAGCTCAGTGTTC 

AGAGTTCAGGGAATCGCAGATCTCAGTCATCTTCCCTGTTGGATATGG 

Sequence 1374 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGAAGAAGTGCTATGTGGCGAGAMAAAAAGTTTTAATGTATTGGAGMGTTTTAAAA 

AACCCAGAAAAATGCC I I I I I I I I I 1 1 I I I I GAACATAAACTCAAGATTTTATTGTCTTC 

ATAATAAAAGATGACACTTAGAACTGGATCACTTGGCCCTTCTCTTCTTATCTCCTCCCA 

GTTCAAMTGCTTGCATNTTTTMTAGCCAGCATTNTTTTMNATCTGCAGGTNGGGCTT 

AAACNCCNCTTAAAGCCTTTAAACACAAA I I I i I I I I 1 1 GGNGGNTTTTTAAGCCCTTTTT 

TCCCGGMAAAANTCGGNTTTNAATTTTGGNCCNANAAAANNACCCCTTTTG1 MM I IT 

NNNAANAAAAACCNCNTTTTTTNNTNNGGGGGGGAACCNTGTCCCGGGGGGGGGGNCGTT 

TTTAAAAAAAAAAGGGGGNNCCCCCCCGGGCNGNGGGGGAI M M I I I NTAAANNTTTTT 

TTNNNCCCCCCCCCCC 

Sequence 1375 

CCGCGGTGGCGGCCGAGGTACTTCAMGTTATTGCACATACACTTGTTTACTTTGTATGT 

TTTGCAGGATTMACTTTGTATMTCTTTTTACAAAAI M M M I I CAGTATGCAAGCTT 

GCAAGATGAAAATAAAACCTGTTTGCCTGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTGCGGCCTCGAGCGGCCGCCCGGGCAGGTACTCTATAAGAGGTGTGGGTGTCTTGTTTG 

GTCAGGATGTTAGAMGTGCTGATMGTNNCATGATCAGTGTATNCCAAMGGTTTTTAG 

GMG TATGGCAMAANTGTTGTATTGGCTTTTATGGGGACATGATNNTAGTCNNCTTCCT 

TTTTAAANANGNNTTATNTTGTTCNANTGGNTTAANTGGTTTTTAAAAA 

Sequence 1376 

TACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 

CCACGCGTCCGGCCACATTTTCAATTTAGCCATTTTTCTCTTATTCACCTTTTTCTGC^^ 

ATTAC TCTG TAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCT 

GAGAATTTTATTAGCTGCTAACGCTTGTCTCATCATAAGCACTCATATGTTCATTGAGTA 

AATATTTATTGAGTATTTGCTATGGTCCAGGCACTGTGCTAAGTATTGAGGATAAAATGG 

TGATTGAMCATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAA 

Sequence 1377 

CCGCGGTGGCGGCCGCCCGGGCAGGTAATCACACAACACTTTCTTTTCCAACTGCTGCAA 
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AGTGCATCTACMTATGCTATTACAGATCCACTTTTAAMGGTTTCCTGTGACATTACAG 

CAAGCCTCTTTTTTCAAACAGAGGAATMTCCCAAATTCTTCCTCAAATAAACTCCATTC 

CAGTAAATGGTAAATACATAAAAAATTACAGTAAGCCAGACACTTAAAAGGACAGCCAAG 

AAGTCTTCCAACAGTTTATTAGAAAGAATGTAGACATTTAAAAAAATCCCCACTGTCATG 

AACATAAATTGAGGTTTTCAGCCCGGGTATAAGCTGAATCAAAAAAAGGAAATAAAAAAT 

CCMTAGTGTATTAMCATTTTTCACTCATTTGCCATACTGACAGTGCAAATCAAATCTG 

GACTAA 

Sequence 1378 

CCACTGGATTTGACTCAGAGAGGACCCCCAGAGGGTGTCTCCATCTTCCCTATTTATTTT 

CAGCCCTTGAGGGCTTCATTGTAGATCAAAGCCAAGGCCCCCAGGAAGGNGACATACTCC 

TGGMGTTCACCTCCTGGTCCTTGTTCCGGTCCMGTCTTCCATCAGCCTTGCAATTTCA 

GCATCCTGCAGCTTCNAGCCAATGGTGAGCTCCTTCTGGATCAGCTCCTTCAGCTCCTTC 

TTGCTCAGGGTGTGCTTGTCACCCTCCCTGCCGGAGATACCTGCCCG 

Sequence 1379 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCAGGGATGGGAGG 

CACAAGTTG TGATTGGG CAMGTTTATTTTCTATGTCAGCCTGTCAGTCCACTGCCCCAT 

TTTGCAAGAC I I 1 1 I I I I AGCCTTGACAAAATGTCTCAGTTAAGTATAAAAGTTTTTCCA 

CTACTTAGTCCAAAAAAAACTATTAAATCTTAATGAAATAGCCACTCTCAAAAAAAAAAA 

AAAMAAAAAGTGCGGCCGGCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTCTTTTTC 

TTCCCAAACATAGACTTGCAAGACATGGCCTGTATGAGAAGAAAAAGACCTNAAGAAAGC 

AACGAAAGGAACGCAAGAACAGAATGAAGAAAGTCAGGGGGA 

Sequence 1380 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGCACATTACAGTCAAA 

TCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTCC 

GTGAAATCAATATCCCG C ACAAG AGTGCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAA A 

AAMGTGCGGCCTNGAGCGGCCGCCCGGGCAGGTACATNTATTTTGGATTGTATATTGNG 

TTTGTGTATTTACGCTTTGATTCATAGTAACTTNTTATGGMTTGATTTGCATTGAACAC 

AAACTGTAAATAAAAAGAA 

Sequence 1381 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATCTGTCAA 

ATATACTATGAAATGCATAGTCTCCACTTAAAATGCTGAATGACACACACGTTTTGCAAG 

CATTACTGCTTTCCACAAAAACTGCTGAATAGGAGTTCCGTCCCTGCCAAGATCAGTGTT 

TAAGAGATACTTTATGATGCTGATMGTATTATTGGTGGTGGTGGTGTTCAGAAAGTTTG 

TCACTCA TGCAGAT GTCTGAAATCTTGTTCCGAATCCATGGAACATAGGGTGGAGGCCAG 

CTCCCCCTTTTTTTAGATGATCACATAGTTCTGAGCAGAGATGTGGTCCTCACCCTGCAG 

TTCCTGCAGGAGCTGCTGCTGCTGGGATGGCTGCTGG 

Sequence 1382 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGATT TTTTT AMGTATTTTCTAGTCTTTTCTCTCTCTGTGGAATGGTGAAAGAGAGA 

TGCCGTGTTTTGAAAGTMGATGATGAAATGMTTTTTAATTCAAGAAACATTCAGAAAC 

ATAGGAATTAAMCTTA GAGA AATGATCTAATTTCCCTGTTCACACAMCTTTACACTTT 

AATCTGATGATTGGATATTTTATTTTAGTGAAACATCATCTTGTTAGCTAACTTTAAAAA 

ATGG ATGTAGAATGATTAAAGGTTGGTATGA I I I I I I I I I AATGTATCAGTTTGAACCTA 

GAATATTGAATTAAAATGCTGTCTCAGTATTTTAAAAGCAAAAAAGGAATGGAGGAAAAT 

TGCATCTTAGACCATTTTTATATGCAGTGTACCTGCCC 

Sequence 1383 

CGAGGGAGTCTATTGGAGCCCTTGGGAACTTCTGGATCAAATTGGACCTGAATTGAGATC 
TATTTCTCAGCTTTCACTTATGTGAGCCMTAAATTCCTTTTTTGTTGAAGGCAAAAAAA 
AAAAAAAAAAAAAAAAAAAGT 
Sequence 1384 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCAC l I I » I I II I) I M I I I I I I I 
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TTTTAACGTTTTAATTAAATGGATTTATTTAAAAAGACTATAAAATCTGACATCAAGAGA 

GATAAAAAAAAAAGACCCATAAGATTTAAATTGACAAATGTAAAATGATTGGCTACAATG 

TAAMATACATTTNCCAGCCCCCAAACAAMCACAAGTATAGTMTTATAAAATTTTTGG 

ACCTGCCCG 

Sequence 1385 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTATTTCTTATTCTACAAC 

AAGGGTCAGCCTACAGGCAAAACACATCCCATTGTCATTTTTTTGTAAATAAAGGTTGTA 

TTNGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACMCC 

TGAGTGGTTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAGCTT 

TTTCTTTTC 

Sequence 1366 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGAGAAGAG TTTG CAAATGCAACAAMTATTTAATTACCGGTTG1TAAMCTGGTTTAGC 

ACAATTTATATTTTCCCTCTCTTGCCTTTCTT^ 

TTTTAAATGACATTTTTGA 

Sequence 1387 

CTACTATAG GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCA C I II < » I I 1 1 1 1 I I i I I 1 1 
MINIMI GCCTTCAACAAAAAAGGAATTTATTGGCTCACATAAGTGAAAGCTGAGAAA 
TAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAGACTCCCT 
NTCACCCTGGTACCTGCCCG 
Sequence 1388 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGG 

GGAGGACCTAGGCAACG GCCT GAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAAC 

CTACTGAAGATACCGTGGTTTTACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGC 

CATCTGAGACCAGCCATCTGGCAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATC 

CCACTGACACCACGTAGAGCACAAATGAGTTGCCTCCACTGAGCCCAGCCCAAATTGTTA 

TCCTATAAAATCATAAAAACATMACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAA 

ATTAAGTGCGACCTGCCCGGNNCGGCCGGCCGCCCGGGCAGGTACCCATTAATTTGCTCA 

GATATAGCAGGCTTMTGGTTCTATATTTTCAAMGTTTTTAAGAATGGTT 

Sequence 1389 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAATAAGCCCACC 

CCACTAGGMCTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTA 

CTGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGG 

GTTGGCCATNANAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAA 

CCCNTMGCCMGGCACCCAGACCAGTTTNCATACATAGMAGTTACAGCTGCTTTTATA 

CCCCCTTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAG 

GAMGACCAGTAGAGAGAAMAAAGGCCATCTATACCMTTNTMGTTAATTTAGACTAA 

ACAAGGTCTTMTAGCAMGGATMTTGMATCCCMACTTACAAGGTTTTTTAAC 

Sequence 1390 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGGTTTCATGAATGGAAACCTAAG 

TMMCTAAGCTCATTAGTGACAGACTTGTTTTCTTCTTGTTATTCCTCCAGCAACTCCC 

TCACCACCACGCCTCCCTGCCTACCATCCCCGGAAGGGTGCTTATTCTTTAACAAAGAGA 

ATCTAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTGAGAAAACAGACCATATT 

TACTCACATAATTCGCCTTCTACCTTTCACCTGCTTATGTAATAATTTAACACTGTAGAG 

GGGACATGGAGGTGACCGGAGTATTTAGTGGGTTCCTTGCTCCTGGGCTGGGCAGGTTCA 

CAGGCCCCACAGGCCTTGGGCCCCAGCATC 

Sequence 1391 

CCGCGGTGGCGGCCGAGGTACTCTGAGGTCACGTCAAACTATGCTTTAGAACCTTCATCT 
TTTGCTTTCTTGGGCTTTACTTTCCMAATGGACTACAGGATMTGAGGCTTTTT^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 1392 
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CCGCGGTGGC GGCCGCACCTGCCCGGGCGGCCGCTCGAGGCCGCAC I 1 I I I 1 1 I I I I 1 M 
11 I I 1 I 1 I I I TT A AAA ATTCAAAA AATT AGTTT ATT AG CTTAATAT AATT AGGTCAATG G 
MTCCTGTTTTGATCTCAATACTTCCCATATTGCAATATATAAATGNGACAAATTCAGCT 
GTTTTGTGGCATAAATAAGTGTCTAAGCTGGGCAGTTAGTCTACCC 
Sequence 1393 

CTTAGGGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGTCI It I M I I I I I I I I 1 1 I CT 

TTTGAAACAATTTTTCTGAMTTTATTTCTAAMGTCAGAGACAAAACTTTAGGAGTGAC 

ACATTTATACTAAGCATACATGCGTGAGCAAAAAAAAATAAGCACAGAATACAAAAATGA 

MTAGTAAAATTTTAATACAGTATTCTGAATACAAGTAGAATACCACTAGATAAGAATTG 

TATTTACCTAAG AAAT CTATGATAGNGNGGGNGGAGATAAACCAGTTTAGGATAGCCACT 

TCACTATTCACATTTTAATCAGTGCTGACCAGAAGCTAAAGCAA 

Sequence 1394 

ACTTAGGGCGATTGGAGCTCCCCGCGGNGGCGGCCGNGGTACAAATAAGCCCACCCCACT 

AGGAACTATGTTAAAAAAMATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAMCTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGNACAGGTCCCTTGTTTCAAAGGTATGACACANGGTAACCCGT 

ANGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGNTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNGAGAGAAAAAAAMGGCCATCTATACCAATTCTAAGTTAATTTAGACTAAACA 

A 

Sequence 1395 

CCGGGCAGGT ACAAA AGGGTTCCTCTATATGCCMCTMTTCCAMTTTTTACTTTTACT 

GCMAAAMCCTTTTTGGCATCAAAACTCCATTGTTTCTCTGCACTCTGACACCATCATT 

TCAMGGGGCTCACATAAATGATCACTACTGCTCTCTCCCTAATTTTTGAAAAAGGAGTT 

TTGAGMTA AAACA GTGCTTTTATTATTAGCCAACACAAAGTGTGAGAAAATCATTCCTG 

AGAATTAACATTTTMGCTAACAGMATTCAGTATACTTAAMCATAATTATATTTAATG 

AGTCATTATTTGGATCTAAAACGGACGCGTGGGTCGAAGACCTCGGCCGCTCTAGAA 

Sequence 1396 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCGCCCTGGGCA 

GTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAAA ACTCAM AACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAAATTTTTACAGTATTCTTGAAATGAACCCAATTAAACACATAGT 

Sequence 1397 

AGGTACTTTAATCCCGTCTTACAGMGAGAAAACTGAGATTTAGCAACATMAAGTATTT 

CCCGTAAGTAAACAGTAGAGCCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATG 

CTAT AAAAGAGGAGCATTAGAAATATTTGAAAGATAGAAATGAGAACTAGTCAATATTTA 

TTTTGCTTAGCACTGTATTCAGTATTATGGCATCTTAAAGTAGTTAAGACTCAATATTTC 

ATCAAAAAAGTTTAAAATCTAATCAGAGAAT 

Sequence 1398 

AGGTTTGAGTCGACCCACGCGTCCGGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGG 

TGCATGGCCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGA 

CTCTGGCATGCTAACTAGTTACGCGGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGC 

AAGATTCCTGATCGGTATACAGTGATGTATTTACTAAACAGAGACCTGTGCAGAAATTAC 

ATAC TATC CATCTAGATAGGTTGTTACACTTTTGCCTATTGATGGMTAGTTCCATTTAT 

CMGTTTTATACATCAAMAGCTTTTGAAGTTCACCAGACTGTCCAT 

Sequence 1399 

CCGGGCAGGTACTGTAAATCTACTGTAATCCTGTTTTGCAGAATACTGCACGACGGAGAT 

TGAGAAGTGAGAGCTCTTATGACATAGATAACATTGTGATTCCCATGTCATTAGTAGCCC 

CAGCTAAATTTAGAGAAACTCCAATATAAGGGAAATACTTACTCCCAGGGTATGGTATAC 
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TTTACCATCTTCATAN I I I I I I CTTTCCCTTCCCTTCCCTTAAAAAAACTNAANTTTTTC 
NMGGTGGAAGAANTTTTTAATTNAANTGGAAAGGGANGCTTCCCTTCTTCCCCAGTTCC 
CTTCTTAGCCNATGGGAGGGGGAAACCGGG 
Sequence 1400 

CCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAAC I Mil I GATGAAATATTGAGTCT 

TAACTACTTTAAGATGCCATAATACTGAATACAGTGCTAAGCAAAATAAATATTGACTAG 

TTCTCATTTCTATCTTTCAMTATTTCTAATGCTCCTCTTTTATAGCATGGGCTCAGGTA 

TCAGATGGCGTAGGTCMGATCTTGGCTCTACTGTTTACTTACGGGAAATACTTTTATGT 

TGCTAAATCTCAGNTTTCTCTTCTGTAAGACGGGATTAAAGTACCT 

Sequence 1401 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCACCCTGGGCA 

GNGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGNGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAAAACTCAAAAACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAAATNTTTACAGTATTCTTGAAATA 

Sequence 1402 

AGGTACTCCCATTTCCCTGAAACAAGCAGCCAGCAACTATCTCAGAAATGTGTCA I I I I I 

ACTGGTTATAATTCTTAAAAAGCTTGTTTTCCTAAGATATGAAATGCCTGCCAGTATACA 

MCTGCTGTAACTACTTCCCTTTTTGCTTTTAGCGGGGAAAAAATAGCTTAATGACAGCA 

TAGAATCATGTAGTAMTATAATTCATTTTTTGAMGGTTCAGCTATATCCTCTTCCATT 

TGTTTATTTTAAATGATCTAATTGCAMCATGTCATCACTCCCTTGATGTTTACCTNCTT 

GTTATGCA I I I I I AGCAGGCTTTATTGTCACC 

Sequence 1403 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 

ACTGAGACTCAGAATGTTTCAATTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 

TGGAAACTGAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I f I I I I I II I I I I I I I I I 

CMTATTATTTATCAAAATAAATTTATTAAAAGTATTCAAAGACCACTTCAAAGNGTAGC 

TGCCTTCAAGACAGATTTTTGGCACTCATMCGGACACTGCAGTTTTCAACACCATAGCA 

CTCATTCTATTTCACACATCATTTTTAACAA 

Sequence 1404 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTTCAACTTCTTTCATTGCTCCTC 

TTTTCAGGAAATCCCCACCCTGGGGATACTTCAAAAGACCT 

Sequence 1405 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAAAG 

GGCCAGACACAGTGGCACACGGCTGTAACC 

Sequence 1406 

ATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCCCTAMTATTTAAC 

TGTTACTTGTAMCTTGTGTAATTTATTATTTATTTTMTCMAATTCTGAATATTTC 

TTAAATGAAAGTTGCAAAAAAAAAAAAAAAAAAAAGTGCGGCCCGGCCGCCCGGGCAGGT 

ACATACCTCCTTGCACAMTGGAGGGGAATTCATTTTCATCACTGGGAGTGTCCTTAGTG 

TATAAAAACCATGCTGGTATATGGCTTCAAGTTGTAAAAATGAAAGTGACTTTAAAAGAA 

AATAGGGGATGGTCCAGGATCTCCACTGATAAGACTG1 MM I AAGT AACTTT AAG G AC 

Sequence 1407 
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AG GTACTCA ATCTMTCCAAMTTTTCTTTCTTAGCMTCTATTTTCTGTATTTAGAAAA 

ATGI Mil I ATTTCAMGAGCCTCTCAMGAGCATTTCACGTATCTTTTACTGT7TTTCT 

CTCCACCTCCAAGGGGTCTGTCTAGATCAGTGCGGACGCGTGGGTCGAAGACCTGCCCGG 

GCGGCCGGCCGCCCGGGCAGGTACTGAGGACAAATCAGTTCTCTGTGACCAGACATGAGA 

AGGTTGCCAATGGGCTGTTGGGCGACCAAGGCCTTCCCGGAGTCTTCGTCCTCTATGAGC 

TCTCGCCCATGATGGTGAAGCTGACGGAGAAGCACAGGTCCTTCACCCACTT 

Sequence 1408 

AGGTACATATCACACATTTCCAMTTTGAGACCACTMTGTTTTTTAATTTCAAATATGT 

ATATMATATGTATTCTTATTTCCAATTATTTCCTTGGCATGAATTCCTAGAAATTGATC 

TATTTAGTATAAGTGCTTTTTTAGCTATATGTCCACTAGTATGGTATGAGAATGCCCTGT 

TTATGCCAGTATTATCATCATTGMTATATTACTGCTGATGTTGTGGTAATACATTTAAA 

CCMTGTGATGGGGCAAAAAAATTATATTTTTACTTACATCTTTAAAATTACTGGNGATC 

TCTG NTATTG AC AAGCTGGGCATANAAAAAGTAAATTAATAG AATT 

Sequence 1409 

CCGGGCAGGTCGGACGCGTGGGTCGAAGCTTGTACAAAAACCCAAGTATCACCTGAATTA 

CAATTATTCTTAAAATTTGTCCTTAMTAGCTTACTCTTGGMGATTTGTTTCTATGTAG 

ACATTATGGTAAMGTT ACTC TGAMCTCTTTTCTTTAGTTATCTGTTTATTCTGAGCTC 

AACAAGATTGAAGTAAGT7TTCGGGAGCTACAGAAATTAAATCAAGAAAAGAATAATAGA 

GGATTATATTCAATTGAAGTGCTGGAGCTCTTCTGATATTATCAATTCTCCTTCATAGAC 

ATTTTATAAAGCTCTTTTATGTGAACTCTTGCTTCATCCAGGCAAG 

Sequence 1410 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTMGTATACTGMTTTCTGTTAGCTTAAMTGTTMTTCTCAGGMTGATTTTCTCA 

CACTTTGTGTTGGCTAATMTAAMGCACTGTTTTATTCTCAAAACTCCTTTTTCAAAAA 

TTAGGGAGAGAGCAGTAGTGATCATTTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAG 

NGCAGAGAANCAATGGG AGTTTTGATGCCAAAAAGG I I I I I I I I GCAGTNAAAGTAAAAA 

TTTGGMTTAGTTGGCATTATAGAGGAACCCTTTTTTGTACCTGGCCCGGGCGGCC 

Sequence 1411 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAA 

Sequence 1412 

CCGGGCAGGTGCCTAATATATTTACTCTCTGGTCCTTTACAGGAAAAGTTTGCCAACCTC 

TGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGCACAAGACTATGTAGTC 

AGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGAAAGGCGGAGAAGAAAT 

GGATTTGGATAGGTAGTCMCAATGTCTGTTTTATGTTACCACACATTTTCTCGAGAAAT 

TTCMTCAGCTCTCTGAGMCAGATTCATCTTTAAATGAATGTTCATAGGTAACAGCAAC 

TCATGCATCAATGTTGCAAAGTGAGCTCATTTTCACATTGCTTCAG 

Sequence 1413 

AGGTCAAGCTTCGNTCCACGCGTCCGGGAAAAACGGGGTTACTAGTAGCCGCCCATAGCC 
TGCAACCTTTGCACTCCACTGTGCAATGCTGGCCCTGCACGCTGGGGGCTGTTNGCCCCT 
GGCCCCCCTTTGGTTCCTGGCCCCTTAMNMCAGGCNGGTTTTATTAAACCCCAANNNN 
CCCGGNTTANAAGGGGAATTNAAAAAAGGGCCCCGGCTTTNGNAAAAAAAAAA 
Sequence 1414 

NCNGNCCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGNGTAGA 

ACANGGAGTTTTGATCTGTGTTCAACATGATTGCGAACCATCAATTGAGATAACTCACTA 

CCTTCAGGCCAGCCAGNTACATACTTTTGAAAAGCCAAGAGTGAAGCANGGTTGATNTTC 

ATCCMTTCTTGNNCTTTTTGTTAAAGGCANNAATAAGANAGGGTGGNTNCGGGCAATCA 

CTTAGCTAA 
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Sequence 1415 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTMGTATACTGAATTTCTG1TAGCTTMMTGTTAATTCTCAGGAATGATTTTCTCA 

CACTTTGTGTTGGCTAATMTAAMGCACTGTTTTATTCTCAAMCTCCTTTTTCAAAM 

TTAGGGAGAGAGCAGTA GTGA TCATTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAGT 

GCAGAGAMCMTGGAGTTTTGATGCCAAAAAGGTTTTTTTGCAGTAAAAGTAAAMTTT 

GGAATTAGTTGGCATATAGAGGAACCCTTTTGTACCTGCCCGGGCGG 

Sequence 1416 

AGGTGTACAAGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 

CAGTGCCTGCCACATACTMGTGTTGGATAAGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

AAGTGCNGGCCGGCCGCCCGGGCAGGTCAGATGATTGCAGAATTTATGTGATTACTGGGT 

ACTCTMTGGTAAGGAGAMTTMGACCAGCTAGTTGTTAATCTTMCTTTTAGTCATTA 

AGGAGAATTTCCAAGACAAAACTGCAATCCAGCTGCTTACCTAGGAATACGGCTTAGGCT 

GAAAACTTCTATCGTCTTAGAAATGGGA 

Sequence 1417 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTAATCCTGTGAGAAAG 

ACAGGACAGAAACCACTGTGCCTATTTTACAGATACGAAAACTGAGGCACAGGTAAAGGG 

GCTTGTCTGTAGTCCCATAGCTAGCAGATGGCTGGAGCCAAGACTGAGGCTCGTTCTTCA 

ATGCTGAGCCAGGGCTCCTTCCGCTGCACCACAAGAACGCTAGACCACTCGCCACCAGCC 

TTCTCATTCCCTCTTCCTCCATTCTAATCATTTCTAGCTGGCTGGCCTCCACAGAGCATA 

GGAAAACAGCCAGGGCCGGGCACGGTGGCTCATGCCTGTAATCTCAACACTCTGGGAGGC 

CGAGCCGGGTGGATCACCTGAGGTCAGGAATTCGAGACCAGCCTGGCCAACATGTTAAAA 

CCCCATCTCTACTAAAAATATAAAAATTAGCCAGGCATGGTGGCGCACACCTGTAATCCC 

AGCTACTCAAGAGGCTGAGGCAGGAGAATTGCTTAAATCTGGGAGGCGGAAGTT 

Sequence 1418 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGNCCGAGGNNCAAAAGAGAGACAAAAGG 

GTTCTCTTGGAAACAAGAAGAGTGACTCCAGATGTGGCCTGAATAATTGCCATGTTAAGT 

TAATGCAAAAGATCAGAACAGGGCTACATTTGCACAGGCAGTTTCTCTCCGGGCCGTAGT 

TTTCACTGATGATCACCTTTCACAGCATTTTCCCCAACCAGCATTTCACTTAGTCTTCTC 

TATACCCAGCACCTCCCCCGGCACCCCCGGCAAGCCCACTATCACTTCCGACTTCCAACG 

TGGCATCCGTGAGATCTGTCCACATTAGGCGAAGCAGGAGAACACTGAGAGCAGCAGGAT 

GGGTTTGGAAAGAGCATGCCTCTGGAAACACAGCTTCCTGGGAATTCACATGAGGCCAGT 

CCT ACAG AGAGCAAGATGCACCCCAGGATTTCTTCATTTTCTAATAGATGTGGGAGTGCT 

CCATTTTCCCCGACAGCGAATTTCCCCTGAGAAACGATACTAGACCCTGGGTTTGCCCAC 

CTTGTAACTCTTCCTTATCTNCTCCTTTTCATCCCTAATTCA 

Sequence 1419 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 

CACCACACTCTACAGAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1420 

CCGCGGTGGCGGCCGAGGTACACTGTAMTAGCCTTTACCAAACGTGTTTGACAAGGACC 

ATMTTAACATCACTTAGTGAATTGTGATAAAGAAAAAAAAGCCATGATTTATTCGATGT 

GATTGGCTTGTTTTTATGTGGCGCCAAGAACGAACCTGTTTAGCAGCTGTAACCAATGGT 

ACGCGGGGGAGGCGAACAATGGCGGAGCTGGGCGAAGCCGATGAAGCGGAGTTGCAGCGC 

CTGGTGGCCGCCGAGCAGCAGAAGGCGCAGTTTACTGCACAGGTGCATCACTTCATGGAG 

TTATGTTGGGATAAATGTGTGGAGAAGCCAGGGAATCGCCTAGACTCTCGCACTGAAAAT 
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TGTCTCTCCAGCTGTGTAGACCGCTTCATTGACACCACTCTTGCCATCACCAGTCGGTTT 

GCCCAGATTGTACCTGCCCCGGCCGCTCTAGAACTA 

Sequence 1421 

CCCCGCGGTGGCGGCCGAGGTACTTTGGGAGACCACCCCCAGCTATGGTTCCATACACTT 

ANACTGCGCCCAGCTACAGNTTNATACACTTNGGACAAANTATCTGATAAAATAGAGAAA 

AAAATCTTATTTACTATAGCATTACATAATAAATTTNTGAGAAAAAAATTAACCAGGGAT 

GTAAA AAACCTTTACAATAAAAAATAAATAAAAAAGGAAGATCCAAATAAATTTTAAAAT 

ATTTTATGTCTTTGGATTGAAAGAATAAATATTAATAAAGTGCCATATTATCCAAAGTGA 

TCTATAGATTCAATACACTTCCTATCAAMTTGCAGTATTTTTTTCACAGTAATGGAAAT 

TCAATTCTAAAATTTACATGAAACTAAAATAAACTTTGAATAGCCAAAACAGTCTTGAGG 

AAAAGGAACAAGGCAGAAGAATATCATACTTACAATTTCAATCTATATTTGAAGACTTTA 

TAGAANTAAAA 

Sequence 1422 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATCCAGCTAATAGCAAAC 

TTTGTGGATCCCAATCCGAGATTTCCCTTGCTGAAAGACAAGAAAGTATCTCATATAAAA 

GTGCTGTAGCAAGTATTTGTATACTCCAGAAATAAGCTTCTGTAATTCTTAGCTGCCAAT 

GTGTTCAGGCGTGATGACTCGGTTTCTGTTTCTCTGAACATCAATACTAGGGTCTGTATA 

ATTTCAATGCATGCCACCAGCTTCATCAACCCTT 

Sequence 1423 

AGGTACAATCAGAATGCTGCATTCTCCAGCCATAAAGATCGCTCCCTCTTCTTTTCAAAC 

ATCCCTGTCCCTCAAGGTCTAGCTCAAGACGGTCACCTTAAGAAAAGCTCCCTTTGTCGA 

GCAGTGACTCCATACCAGGCCCTGCTTTAMCGCTTTATCTGCATTATCTTACTTGATTC 

TCGCAATAGCCCTGGGTGGTAGGTGCAATTATTATCTCCAGTTTATAAAAGAAGATACTG 

AGGGTCAGAGAAGTTAAGTGACCGGCTCAAGGTGTCACATTCAGTAAGCGTTGAAGGGGC 

CTGTGTTGGTCTGTCCTTGAAGATGCCCCCTACCGACTACACTTTCAATGATTTTCTGCC 

TTGAACCTGGCCCCATGACTAAA 

Sequence 1424 

NNCAAACCTCCTATGCTTTCCTTGGCATCGGCTACACATCATAGTATTCATTGCCTCCTT 

GAGGTCATCTTGCAGCTTGGACAGAAACTCATTTACTGACCGGCTCAGCTCATTCTCTGC 

CATTCGTTTCATCTCATACTCCTTTCGCTTTTCAGCATTGCTGACAATGTCCCAAGCTGC 

TCGCAAAACCTtGAAGGCCTCCTCAGCCCGGGGATGATGATTTTTGTCAGGATGAACCAT 

CACTGCCAGCTGTCTATAG 

Sequence 1 425 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGAACTGATTTCCCC 

AAATGGCTCTTTGTG CAGCC GCAGCAGCTGTTCAGACTTATGAGCAGACAGGAACTGCCA 

GAGGGCAATGGTGTGTTTTAACTGGCATCTGTTTAAGGCCTTTAACACGTGAATCGTCTG 

ATCACCCATTTGCAGGATGTCTTGAGTATACACATTCAGCTGCATGTTTGGATCCCCACC 

AGCTGTGCTCAGAAACCCCAGAGTGACTTCTACGACAGACAGCACTTCACAGGCATCGCT 

GTAGGACTGCAGCTGTCCACTGATGGCACTAATGACCGAGCTGGGGAGGGAGTCCTGGGA 

AATGAAAAGCAGGAGAGGGATGTCTGTGGGCTGGGTTTCTGGCATCTCACCACCTGGTAA 

GAGAGCCGAGCCCCTTCACTGCCCAAGCCACATGCG 

Sequence 1426 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGGCCAAGGCTGCAAT 

CAGTGATTCAGCCGACTGCTCTTTGAGTCCAGATGTTGATCCAGTTCTTGCTTTTCAACG 

AGAAGGATTTGGACGTCAGAGTATGTCAGAAAAACGCACAAAGCAATTTTCAGATGCCAG 

TCAATTGGATTTCGTTAAAACACGAAAATCAAAAAGCATGGATTTAGGTATAGCTGACGA 

GACTAAACTCAATACAGTGGATGACCAGAAAGCAGGTTCTCCCAGCAGAGATGTGGGTCC 

TTCCCTGGGTCTGAAGAAGTCAAGCTCGTTGGAGAGTCTGCAGACCGCAGTTGCCGAGGT 

GACTTTGAATGGGGATATTCCTTTCCATCGTCCA 

Sequence 1427 

AATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGAC 
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AAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTC 

ACCMCTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTG 

GACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGG 

CTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTAT 

CAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTA 

CCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

ATTGAGGATGCGCTCMCCMCTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGAC 

TGTCAAGTTTCAACATTCAGGGTCTGTCCCCAACAGGCACCAC 

Sequence 1428 

AGGTACAAATAGATACCTTCAAGGAGAATGAAAACGGGGAATATACTGAGCACTTACACT 

CGGCCAGCTGCCAGATCAAAGTTTTCAAGCCCAAAGGTGCAGACAGAAAGCAAAAAACGG 

ATAGGGAAAAAATGGAGAAACGAACACCTCATGAAAAGGAGAAATATCAGCCTTCCTATG 

AGACAACCATACTCACAGAGGTAMMGATTTCTTTTGGTGACMTTTCAGTTCATMTT 

TTTMTCTTAAAAATTCATCACTTCCAMCTGGTCAGMTTTACTTCTCCTAAGCCTTGA 

GGGACACAGTATCACATGGATTCTGTGTCCAGCGGCCTTAACAGGAAGATTGCTTTAGAA 

TTTGGCACGAACCATGCCACTGTCTCTGT 

Sequence 1429 

NCNGNCCAGGTACTCNNNNACANTGNAAACTNNTCANGNGCCCATCATTGCTGGATTTGT 

ATTTAACATTATGTTTCACCCAGACAACAGCTCAGAGAACTGGGCAATGGCTGCTNATGT 

GTTGAGCCGGGGCATACAGGATGAAGAGGGACAATGAGAGGGAATGAATTCTATTCTANA 

CACCCTGAGTTTGAGGAACCTATGGAAATGTCCAGGAGGCAACTAAATGAAACAGCCTGT 

GGTAGACAGAATAATGGCCCCAAAGATGTCTACAGCCTAATCCCAGGAGCCTGTGAAAAT 

GTTCCCTTCGCATGGTAAAGGGATGTGGCAGATATGATTAAGCTAAGGATCTTGAGATGG 

AGAGTTTATCCAGGATTATCCAGGTGTGCCCAGTATAAT 

Sequence 1430 

AGGTACGCGGGACACAGGGTCCTGTGCAACANGGNGGACTAACAGTAACACCGCCACGCC 
GGCAGCAMGCTCATTTTGGTCCCCGCCCCGTTCCTCTTTCTCI I I I I AACTCCTTCCCT 
C1XTGCGGATTCTAGMCGGAACCTTTTTTTAATTCTTCCCAGTAGAAACGTAGGAACAA 
TTTCGTGAACGCAATCNGGAGTGCCCAACATGGC 
Sequence 1431 

AGGTACCCCTGTTTAACAAGGGGTAGGGGCCTTCTGAGACTGTTTCCTCTACAGAGTAAG 

GGTTCGTTCAGCCTTTTCCGTGGCCTGCCAAGAACTCAACTCCATGTTCCCTCACTTCCT 

GTAATTGACCTTGTCCAGGACTTTCTGACCTTGGAGMTTCACCTTTGCTCTTTCTGCTG 

CTTCGTGCATTCTTCCACCAAATGTCTTAACTGACTGGGCTCCTTTCCAAACTCAAGGGC 

TTTGCCAAATGCCACCAGCTCAGGGAGGCCTTTNCTGGCCATGACACTTGAAGTTGCAAC 

ACTCCCCCGCAGTCTCCCGTGCCCCAGATGTAAGTTCCATGAGGGCAAGCCCTGTGCTTT 

TACCACCATATCCCCAGCATCTTGAGCTGTGCCTGGCCCAAGAAATATTTGTTGAATGAA 

TGAATTTAAAAGGGGATATTCATGANGGCTTACACATTCTCAATGGGTT 

Sequence 1432 

G GGCNAATTGGAGCT CCCCGCGGTGGCGGCCGCCCGGCAGGTACl I I I I I U I ICTTTTT 
CI I I I I I I I I I I I I I I IAAGTTTTTGGGATCGTGTCTCACTCCTGTTGCGCATGCCTGTA 
GTCCCAGCTACTCAGGAGGCTGAGGCAGGATAATTGCTTGAACCCGGGAGGTGGAGGTTG 
CAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTTTGTCTT 
CGGAAAAAAAAAAAAAAAAGATTTGGCGGATGAAAATAACCAGAATGAAAATAGCTNGAA 
AACTCANCAAGCAGGAAGCTCCCCTTCTCACCCTTTTGTTCCCTTGCCGATAGAATCAGT 
CACTATTAGAAAAAATGAAAGACGCTCTGTTTAAAACAATGATGACAGCAGTACCT 
Sequence 1433 

GCGG TGGCGGCC GAGGTACTTCCCTTTTAAGAGATGAGTCACCGCAACTGAAACTTCTCT 
ATTTCTTTTCTTTTCTGATTGTTCTCCAGAATTAGGACTAGTAACAGTCCTGAANNCTTG 
TNTTTCCTTATCTAGAMACTCAGTATCTTCCCTTCCGTTTGTCTTAAAATATTAGTACA 
CGCTTTCTCAAGCCTAGCCGATTAGAAGGGGCTGCCGGGCTTCCACCACACCTCATCGAG 
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GNAATGGTTTTNTGGNNAAAAAGCCCATGGAAATACTGAGCCCATGCCNCTCACGTTGNA 

AAAGCCCCGTTCCTTGCC 

Sequence 1434 

AGCTCCCCGCGGTGGCGTAACTTATCTCATTTTAGATNAGTTTGCAAAGAGAGTTGGTGG 

CTMGGCCATAGCTTAGCCTCCTGACCCCTACCTTCCCACGTTCTTTCCAAGAGATTCTC 

CTCAGGMTMCACTTGCAAGGGAGTTCCTGATGAAGTGGATTCTTGTTATTCTAGGAAT 

AGGCCTACATGGTGCACTGGCAATGTGAGATTATACCTCAGCATTTTCAAAGAGCATAAA 

MTC TAGAGC TGGGGGGTTTAAAACATGACAAACCTAATTTTAAGTAGGCAGACAAATAT 

TTAMTTTTTCCCCTACCCTTGTTTCTACATCGGTCCATTCAGACTCTGCACCATCTGGT 

TGGGCAGGTGCTACTGTGGMGATCTTCGTTTTCGACTACCATTTGGTGATTCTTGCTTT 

MAGTCTCMTATCAGTAAACTGAACAGATTNCCACCACCCCTTGTTTTATAAATATCAC 

CCTTMTTAGTTTAAGTTTCAATCTCCCCATCGGAGGCTAGTTCCTGGTGGGTCAGCATG 

TACCTGCCCNGGCNGGCCGTCTANAACTAAGTGGATCCCC 

Sequence 1435 

TACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTNTTTTT 
T'T I M I I I I M I I I I I IGGAGCCAAAATTGTGTGCATTCCTACTGGGAAACACAGTGGCC 
MATCCTTTTGMTTGTTTCCTTCTAGAGACTTTAACTCTTCTGACTGCAAATCTTAGTG 
TCCTGTGAGTATTAGTTGATTAATTATACTTGCTGCTTAGTGAAATACAGCCAGCTATAG 
GTATCTTCTGGAGTAGCTCMCACAACTTTTCTCTTGCTAGAGTGACTCTTGCTAACAGA 
ACCCAAAGATGCGCACATATACCCACAGGAGCTGGAGGTCCCTCGCATGCTCCTCTCGTG 
CCAGCCTTTGCCTTACCCTTCACTCTCTCCCTCCAGGAGCCGTCGGTACCTCGG 
Sequence 1436 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATTTGTCTTGCTC 

ACCTCTTACCAAATCCAAGAATGGTTTCTATCCAGTGAATGGCAAATTCATCTGATAGGT 

MGGGA ATMTG GGTCAAAATGGTAGCAMCACTTCTTTCAAATTTCTACTAAAAGACTT 

GCTGTTGTTTTTCTTATAAAGGGGCAATTTCATCMCATACATCTTTTAAAGGAATCTCT 

AGAMTTTGAGTGACTTTTTGGCCATAATCCTGTTTGATATATTTTTGGTCAGCTGCTCA 

MACMACATTCTCCTTGTAAGGTTATCTATCTGAAAGATACTAATTCATTTAAAGCAGC 

TGCAGGTGAACAACCTAAAGATGACATGATTTGGGAGAAGAGGAAGGCAGATTACTGAAC 

TGACAAGTGACCCAAAGCATAATTAGGTTTGTGCACATGGTAGCATGGAGGTTCCACACC 

TACCTTCTACAGCGTATTAMATAAGAATATTGTCTTTGAAACATCTTCTAGCACCTTTT 

TAATAAAACAAAATTTCCCATCTTCMTTCTATTTTTTTCCCAMTCTACCTTTAAAAAA 

TTGT 

Sequence 1437 

CCGCGGTGGCGGCCGAGGTACAATAAACAGGGAATGAGAACTATTTACATGGAAGTTTCT 

TTCTCATGATGCGGTGGAGMGCCTCGGCCACTTGGTTCTGCCAGATGTTCCTGGGGTTA 

CTGTAAATGGGMGGACAGGCAGAGCTMACAAGGTAGGAGMTCGCCCCCCTTTTTTGA 

ATGTTTMAGAGTTTGCTGCAGTATGCTGCATTCCATGTGTGCTGCTTACGGGAGCCAGG 

GAMCTGGGATTCCACTAATTCAATTGTMTACTTGCGGGGGACCCTGGAGTTTTACGTA 

ACATTTTGATTTGGGAAAAAAAAAAAAANAAAAANANTGTTCCTGCCCGGGCG 

Sequence 1438 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGGC 

ATGCTCCTGCCTTGGGCAGAGTGTATATGTGTGAGTGCACCTGCCCTTCCCACAGCCTAG 

MCGTTCCTCCTCCAGACAGCACATATGGCCTGCTCTCTCACTTCCTTAAGGTCTTTATT 

CAAAAGTGACTTTCTCAGTGAAGCCCTGTCTGCTCACCCTGCGTAAAATTTCAGCTCTTC 

TTTCTATCTCTCTTCCCAGAT1 Mill I CTCCTTCATGTTCGTTGGTGTCTAAGGTTTAT 

CATCTATTTCGCTAATGGTCAGTAGAATGTMCCTCCACGTMGCAAGGAGTTTTGTCTG 

TTTTGTTCATGTCTATGTCCTTAGTGCCTGGAGCATTCCCTAGTATGCAGTAGGTGCTCA 

ATAAATGTCAGTTGGATTAATGGCTGAAAGAAAGGTCACCGCTATAAGGATGGAGTCAGA 

GAACAAACACAGTTMTTCCTGGTCCACTGTTTTTGCTTCCACTAMTTGTATTTGGTCT 

ACGGCTTCTCCGCTTGCCCTGGAACCCTGCTCAGAACACTGCTCCCTTCTCCTTCTTCTT 
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CTCCCTCCGGATAAATTCT 
Sequence 1439 

CCGGGCAGGTACCGCGGGGGGCCGTGGTCAGAGCGAGCTTCGGAGAAGCAGTGGTGGGTT 

CCATGTGATGG TGGA GTAGGAGGCAGGTCTCCGCGGTTCATCTGTGTTGCTCTAAATGAC 

ACTGTTTCATTATTTTGATGGCtGGAGAATATTTCCTAAGTGTATGTATATGAAGAAGTT 

TCTTGATCTCTTTATCTGTGGATGAACAGCTACTTTGAAACATATGGTACCTCTGTGGTC 

AGACCATTTGCCAAGCTTGTGAGGCCTCCTGTTCAGGTATACGGTATTGAAGGTCGCTAT 

GCCACAGCTCTTTATTCTGCTGCATCAAAACAGAATAAGCTGGAGCAAGTAGAAAAGGAA 

GTTGTTGAGAAGTAGCACAAATCCTGAAGGAACCCAAAGTGGCTGCTTCTGTTTTGAATC 

CC 

Sequence 1440 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGTGCT 

TTCATAGCCGCGGACACAACTTGGGCCCACAGTTAACCGAGGAGGAAGGCAGAGCGTGCT 

GAGCAGAGCACCAAGGAAGAGAGCTCGGCTAGCCGGGMGGTCCGAATGGATTTATTTGG 

TGAGGCCAAGGAACCCACTGCCTCCACGGTGTTCCTCCAGGAGGCTCCTCCCTTCACTAA 

GGCAGCAAGGAGTGCACTGGTGAGGGGAAGAATTGTTGTCATCCTTTCTCTACGACTCCA 

AGAGAAACTTTATACTGGAGGMGMTATTCTACCACTTTGGGATGCTTCCAAAGAAATG 

GGATACCAAAGGAGTGAACTTCAGGTCATTGGAAGTGGCCAAGCTGGAGGCGACCTATGG 

AGACATGACCTTCAGTTCTTCCTGAGATCGATAAAGCAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 1441 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGATATTGTCAATGCGCAACATGGAGAG 

ACTTTAAAACAAATGCTAGGGATTAGAGTATAGATCAGATAGCTGGCAAATCTATAGGAA 

GGGAA AAGTAATTTTAAACACACAGCATTTGTTTCCTGCTGCTCTATCACAATAGCTAGG 

I ll l l AAATAAGTAGGCTTTATACCAAGCCATAAAAATGAATTGCTGGGGCTCTTTGGGA 

CTAGGGAAGGCGGGAAATTTTAGATATTGCTGTTGGCTTAGTGAAAATGCATGCTTACCC 

GGTCACCTGTGGCTCCAGCAGGACCAGGGGCACCTACAGCACCAGGAGCACCCTAGTACC 

T 

Sequence 1442 

ATA GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTTATTACATA 

TGAI i I I I ATTAGTTTCTG G AG G CAMTTTAATTTTTATTTTAAAATCAAAATCTATTTT 

AAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 

CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 

TAACATAGCAGGACCCTGTCCCTTAAATAATAAAAAATTTAA 

Sequence 1443 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGGTGCTCCTGGTGCTGTAGGTGCCCCTGG 

TCCTGCTGGAGCCACAGGTGACCGGGTAAGCATGCATTTTCACTAAGCCAACAGCAATAT 

CTAMATTTCCCGCCTTCCCTAGTCCCAMGAGCCCCAGCAATTCATTTTTATGGCTTGG 

TATAAAGCCTACTTATTTAAAAACCTAGCTATTGTGATAGAGCAGCAGGMACAAATGCT 

GTGTGTTTAAAATT ACTTT TCCCTTCCTATAGATTTGCCAGCTATCTGATCTATACTCTA 

ATCCCTAGCATTTGTTTTAAAGTCTCTCCATGTTGCGCATTAACAATATCCTAATGCACT 

GAGGCTTCTCAAAGCCTTCAATTATTACCAAAAAAAAAAAAANNTTTNNNAGGTACCT 

Sequence 1444 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACCAAT 

TCCTTCCT MCCT GGATTCCACTGTCCTTGGTGAAAACTACTTTGATGGAACCTACCAGA 

TGCTTTATCTTTTGGTTAAAGGAACTATACCTGTNGAAATTCACACTGCCACAGNGATAT 

TTGTTTCTTTCCAATTATNTGTTGCMCANAAGATGACTTTTATACCTCTCACAATCTGG 

NTAAAAATCTTGCCTTGTTCCTAAAGATACCAAGTGACAAAATCCGTATCAGCAAAATAA 

GAGGGAAGAGTCTGAGGAGGAAGAGATCCATGGGATTCATAATTGAAATAGAGATTGGAG 

ACCCTCCTATTCAGTTCATAAGCAATGGCACCACAGGTCAGATGCAGTTATCTGAACTCC 

AGGAANTTGCTGGTTCTCTTGGACAAGCTGTNATTTTAGGAAA 



• 
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Sequence 1445 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAGTCACTCCTGCC 

TTCACCATGAAGTCCAGCGGCCTCTTCCCCTTCCTGGTGCTGCTTGCCCTGGGAACTCTG 

GCACCTTGGGCTGTGGAAGGCTCTGGAAAGTGTAAGTTGGAGTCACTCTGGTCTAATCTG 

GGCTGCAGGGTCAGAGGTGGGGTCTCCTTGTGGTGTGGGTGTGTCCCCTTCTGTAGGCTC 

TGATCCCTCAGCTTAGTTTCGGGAGACCTCCCTGAGGGTGGAATACATGTCTGGCTGGGC 

TCCMGGTTTGTGTGGCAGTTTGAGCTTCTGGAAATGCTTCCTCTATGCAGCCATGCTGT 

CAGCCCAGGTCCCACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATACTNTGCCT 

TCTTTCTTTCACCTGTCTGCGACTCTTCAAAAAAAAAAAAAAAAAAA 

Sequence 1446 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTAATCCAAGACAA 

MGM TAGCTCCTTTCAGAGTTCTCATCATTTCTTCACCCCAGCCTTAATGGGTATATTC 

CTTTTCCCCAGGTGTCCAGAGTTTTCCAGAATTGTCCCAAGGCTCAGTCCACACCAGTTC 

TTCTCCAGTGTGCTCTCCTGAGAGGCCAGGCACACTCAACAATTATCTAGATGAGTTCCC 

ACTTCTTTCCAAGTGAAGGTTCTGCTATCCTAGCATAGTCAGAATAAGTTAAATTATGTC 

TCTTMGAGGCACTGTTCCACTCCTTTTCAAGGNGTGTGGCATATTTGAAATATGTGACT 

TAAAAGTCTACAGTCTCTTACCAAAAGCCTTGGGCCAAATACATGTCAAAATTCAGAATT 

TTNCAGATTTTAGAAAAGTGACCCATATACCATACATTGCA 

Sequence 1447 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCAT 

TCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCAT 

CAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCA 

CCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACA 

AAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

TTTCTGAC 

Sequence 1448 

TAGGGCGMTTGGA GCTC CCCGCGGTGGCGGCCGAGGTACAAATTGTCGTTTTTATTCCT 
CTTATTGGGATATCATTTTAAAAACTTTATTGGGTTTTTATTG 

ACCCTACAAAGAGCCTTCCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTC 

CAGC AAGCA AA GTGC TTTGACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAAC 

TGTTCTTTTGCATTTTGCCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATT 

TATAGCAAGATATTGATCAAATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAA 

GTATAATCTTTC TATAG CCAACTGCTAATGCAAATTAMACATATTTCATTTTAACATGA 

TTTCAAAATCAGTTTTTCATACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGC 

AGAACCACAAACAATTCGAATGGGGTAGAMCATTGTMATATTTACTCTTTGCAAACCC 

TGGGNGGTATTTTATTTTGGCTTCATTTCMTCATTGMGTATATTCTTATTGGAAATGT 

ACCTGCCCGGCCG 

Sequence 1449 

CCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTC 

ACCACACTCTACAAAGGCAQTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAAC 

TTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCC 

AGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGC 

TCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCA 

ACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATAT 

TCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAG 

GATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1450 

CCGCGGTGGCGGCCGAGGTACAMTTGNCGTTTTTATTCCTCTTATTGGGATATCATTTT 
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MMACTTTATTGGGTTTTTATTGTTGNTGNNTGATCCCTAACCCTACAAAGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCMGCAAAGTGCTTTG 

ACTTCTTGCTTCAGTCATCAGCCAGCMGAGGGAACAAAACTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

AACTGCTAATGCAMTTAAMCATATTTCATTTTMCATGATTTCAAMTCAGTTTTTCA 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 

ATGGGGTAGAAACATTGTAMTATTTACTCTXTGCAMCCCTGGNGGTATTTTATTTTGG 

CTTCATTTCAATCATTGNAGTATATTCTTAT 

Sequence 1451 

CCCCGCGGTGGCGGCCGNGGNACMATTGTCGNTNNTATTCCTCTTATTGGGATATCATN 

TTAAAAACTTTATTGGGTTNTTATTGTTGNTGTGGGNTCCCTAACCCTACAAAGAGCCTT 

CCTATTCCCCTCGCTGNTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTNCTTGCTTCAGTCATCAGCCAGCAAGAGGGMCAAAACTGGTCTTTNNCATTTTG 

CCNCTGNGATATGNCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAN 

CAAATATNGAAAGTTGNTATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTMTGCAMTTAAAACATATTTCATTNTMCATGATTTCAAAATCAGATTTT 

CATACTACCCTTTGCTGGAAGAAACTAAAAATAT 

Sequence 1452 

CCCCGCGGTGGCGGCCGAGGTACAAATTGTCGTTTTTATTGCTCTTATTGGGATATCATT 

TTAAAAACTTTATTGGGTTTrTATTGTTGNTGTNGGGNCCNTMCCCTACAMGAGCCTT 

CCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTTCTTGCTTCANTCATCAGCCAGCAAGAGGGAACAAMCTGTTCTTTTGCATTTTG 

CCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAT 

CAAATATAGAAAGTTGATATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTAATGCAAATTAMACATATTTCATTTTMCATGATTTCAAAATCAGTTTTT 

CATACTACCCTTTGCTGGAAGAAA 

Sequence 1453 

GAANCCCCCTTTNGACGAANANNCGCGAATCGNGAGCTCCACCGNGGNGGCGGGCCCGAG 

GGGGACNANGANTTTTCTTGNNCN I I I I I I I I NNAAAAACNGGNGACTATTTAATCCATC 

TAAAAATACAAATCAGGNAANGGGGGGAACCATAGGAAAATCCTCCACCTNTAACAAGAG 

CCGAAGNTACNGGGCTTTCTGCTTGCTCCAAANAAATCCCAAAGGGCTTGGATAGTTTGN 

GGAANGGGGAATTATCTGTGTCTTCAAACTAACTCCCAAGGATACCTCAAAGGACATTAA 

AGGTNTACCACCACCATTCTGGGGGAAGAAAAAGGGGGGTTTCTTGCCCCTTGCTTGAAA 

AGCCT TANAAAT NGGGGGAAGCCTCCAAATNGCCNTTNGGGGGNGNAAAAAGGGGNNCCC 

TNTNAATTTTTTNAAAAAAAMAAAATTGGGTTCCAAAAAANAACCCCCCTTTNGAAAGG 

GANCAAANGGGGGGGGGGGCCCCCTNCCCCNAAGGCNAAGGGGGGGGGCCCNNTTNGGGG 

GGGCCNCNTGGGGGGNNGGNNCCCCAAAAACCCCCCTAAACAAAANNTNGGGGGCTGGGC 

CCNTAANGAGGA GGGCCCCN CCAGAGCCCAAAANTTNGGGGTTTNNACCCCANNNAAAAT 

TTTGGGGGGGTN I I I I I I I I I ACCCCCNTTTNNGNNCCCCNCCCCCCTTGGGGGGGAAAA 

AAAACCCTTNAAAACCCCNTTNTTTNAAAACCCCCCCGGGGGGGGG 

Sequence 1454 

CTACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTTN I I I I I I II 

I 1 1 I I I I I I I I ATACCAGATTTTAAGGAAAGACTGCTTGCTTCTGANAAAGAATTCTCGG 

AGTTGATTCTCTGCTCCATTTGCTCCTTTCTCAACCTCTTTAGTCCTCGTTCTCTGCAGf 

AAGAAGCTAGTGACAACTGGAATTTAGCTCCCAGNGGGCTTTCTTCCGGGTGGNGCCTGG 

ACAGGCTGCTCCTGCTGCTAAGGCTTCTGGAGCTGTTATTGAAGATGTCAGCTTCTGCCA 

CTGNGGTCTGCTTTTTGGAGTCTGCATTGGTTTTGCCTCGATCTCTATCATTCTTCTCAT 

TATTTTCATGAATGAA 

Sequence 1455 

CGAGGTACTGACCTCGTNTGTCCCTTCCCCTNCACCGNTCCCCACAGCTTTGCACCCCTT 
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TCCTCCCCATACANACACNNNCCATTTTATTNTTTGGGCCATTACCCCATACCCCTTATT 

GCTGCCAAAACCACATTGGGGGCTTGGGGGGGGGCCAAGGGGCCTTGGGCATGGGACCAA 

GGACC ACCTCCCC CTACCCATATCCCTCCCCGTGTTGTGGGGTTTNGGGGAAAAAAACCT 

TTTTGGI I I I I I I I GGGGGGGGTC I I I I I I I I I CCTCGGAAATTAAANAAAAAAAGGATT 

NCTTACCTACAAGAGAAANAAAGAAAAAAAGGGGTACCCTTGGCCCCGGGGGCCGGGGCC 

CGGCTTTCTTAAGGAACCTAAGGTGGGGAATTCCCCCCCCGGGGGCCTTGCCAAGGGAAA 

TTTCCGNATTATTCNAMGGCCTTTAATCCGGAATNACCCCGGTNCGNACCCTTCGGAGG 

GGGGGGGGGGCCCCCGGGGTAACCCCCAAGC I I I I I IGGTTTCCCC 

Sequence 1456 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGATAAGACCC 

TGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAG 

AAATGGAGTCATCAG7TTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGA 

ATTTCACCATCACCAACCTACCATATTCCCGGGACAAAGCCCAGCCAGGCACCACCAATT 

ACCAGAGGA ACAAAA GGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCA 

TCMGAGTTATTTTTCTGACTGTCAAGTTTCAACATTCAGGTCTGTCCCCAACAGGCACC 

ACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTG 

CCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGGTACCTGCCCGGGCCGGCCGCTT 

CGGCTTTAGAACTAGTN 

Sequence 1457 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCCAAATTTTAAGGAAAATGA 

GTCCCGCAATGAGTTTCCTCATGCTTCGCCTGTGCGTGGACCGGNCAGCTTCTGGGTGTG 

ACTGGAGCAGGGCTTGTCTCCTTCTTCAGAATCACTTTGCAGGGGTTGGCAAAGCCGCTC 

CCATCCACGTACTCTCTGGACACMTMTTTTGGCCTATTGCCATCAMTGCCCATTTTC 

CACTGCTGGAAGCAATGTCAAAAAAGGGCTGGCCCAAAAAAAGACCCAGAGCTGTCAATA 

CMCACTGGAGACAGATGCAACTGAATMACCCTGTTTTACCCAATTGCACTATTTGGTC 
CT 

Sequence 1458 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTGGAACAGGGATAA 

GTTCTTGGATAAGGNGCCAACATACCTATAAAAGCTGATTTTTGAGTAAATTATTGATTC 

TAACATATGTAATGGATTTGGTGTGATAATTTTCTGATCTTTMCTATMGTGACTTTT^ 

ATTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAAT 

GGAGAAACGGAAAAACTATCTCTGTTAAAAACTGATTCCTGTCATTCTTCTGATATCAAA 

TAAGAGGAAGGAAMTMACTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTG 

CAGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAG 

GTCAGGAGTTCGAGACCAGCCTGGCCAACATGGNGAAATCACGTCTCTACTAAAAATACA 

A 

Sequence 1459 

NGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCACTC 

AGGGAGCTCAGATTTTGAGACAGTAGCTGGCCGATGCTCCCAGCTGAATAAAGCCCTTCC 

TTCTACAAAAAAAGAAAAAGAAAAAAGAAAACAGGATATCTGAAATTAAGACTGCAGGAT 

GGAGTAGTTTTCTGAAAATGACAGGGTCCAAGGTGTGACNCACCGGGACCAAGTGGCTGA 

ACTGGAATGAAGTTAAGAAGCCAGTAAGAAAACATCNCGGATAATATGGTGGATCAGTTC 

MCAGNAATGACATTATTTCACCCATGGTCCCCAAAGGGAGGGAGATGACTGGAGNATTT 

CAMTTCTTCMGGCMGCCCTCMTGCCAGCCCAGAGGATTTTAANGAGGGGCCTATTG 

TTGTTCCCAGAMGGAGGACTCTGNGGCCAAMCCGCCAAGAATGGGATTTCAAGAATTT 

ACTTCAA AATT CCTGTCAGGATTTCTTTAACCCTGGGTGGGGCTTATACCCAAAACCCCA 

MMAACTTTTAAGCCAGCCTTCNTACTTTTTGGCTTMTTTTTCTCCCTAAAGCCCC^ 

CCCTTGGGCN I I I I 1 1 I 

Sequence 1460 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGGCAGGTACTCAGAAATCCTTCCCCGAATT 
TACAGCACAGGCAGGATGACCTAAGAGGCAGTTTACTTCCCTGAGACCCACAGTTGGGCT 
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GTTCTGGAAACACATCTGTGAATCATAGCCAATTGCCACAGAGAAAACAGAACCAAGCCT 

CCGGTGAGGCCACTCCACCCCAGAGAAGTCTGCAGAATTCCAAGGACTCGGATTGGATGT 

TCAGAATTCAGCAACTGGAAAGTCCTTAAAAACAAACAGGCCAAACCAAATCAATATTGC 

TGTTTCTAGATGTCCCTTCTGTGGTTGAGCTAGTTTTACAGAGATAAATATATTAAGACA 

AGGAGGTGGGGGTGTT ATATGATCMTGATAGCCATTTTGAAAGAGAGGGAGGAGTACTT 

I I I 1 1 I I I I I I I I I I 1 1 I I CCAAGCACGTGCCACTTTATTGAATGACACTGTAGACAGGT 

GTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGG 

Sequence 1461 

AGGTACGCGGGGCTCAAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTC 

CCCTGCACCGAGGGAACAACTCGAGCCTTTGAGGGCTTCCTCCAANCCTACCCACTTTGG 

CCCAGTCANACCAAAAAAGGGGAAAGNAGGCCTGGCTTCCCCTTGNCAACAAGGGNATTA 

ATTCCMAAAGNCCTTCCTGGTTTTTTGGNTGGTNCCTTGAAGGGNAAGGGNAGGAAAAA 

AAAAGNCCTTGNMCCTTCGGGGAAGGGGNAGGGTTCCTTAACAATTCAAAAGGAAATTG 

GGGGGGGGAATAAAAGGAAAAANGGGGCCAAGNCCTTGGTTTGGAAGGAAGGNAGGTATG 

GCTTCAAATTAATGGCCCCCCAANGGCTTAATTGAACCAAAAAAGGGTNCAAAAGGGGAA 

CMTTCTTCAANAAGGGGTGGGGTTCCACCCCCCCNTTCGGGGTTTTCCCC i I I I l GGTT 

ACCCCTTGGCCCCCGGNNNCCGGGCCCNGCCTTCCTAAGAAANCNTAGGGTTGGGGNANT 

NCCCCCCCCNGGG 

Sequence 1462 

GCGGGCAGGTACATGGATGGGAGCAGCTTCACCAACCCCTGCAAAGTGACTCTGAAGAAG 

ACGACAAGCCCTGCTCCAGTCACACCCGGNAAGNCTGACTGNNTNCCACCGCNACAGNCT 

GAA AGGCATT GAGGGNAAAACCTCMTTCNTCGGGGNACCTAMTTTTTT(XCCTTTAAA 

AANAI I I I I I AAGTMCCTTTGGCCAACCAAGGTAAAGGGGGAACTTTTCCAAAACNTNG 

AACCCCTTTCCCTCCAAGNATTTGNAAGGAAACNTGTTNTTTCCCAGGTTMTTAATTAC 

CNATTTCCAAAAGGTTCCAACCTTTNGGAANGGGGTTAAGGGGGAAACCAANAAAAAATT 

TTGGCTTAACCAAGGTTCCCTNMTTAAAAATTTTTAAAANGGGGGG I I i I I I I I I I I I A 

TATAAAAAAGGGNGNTGGTNTNANNCCCCTTTTNGGNGGTCCCCCC I I I I 1 1 1 I I I I NAA 

GAAAAANCCNTAAGGGGNGGGGGAAATTCCCCCCCCCCCCGGGGGNNCCNTTGGGCCAAA 

NGGGAAAAATTTTTCCNGNATTTAATTTCCAAA 

Sequence 1463 

AGGTACGCGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCAGGCGG 

AGCTTGAGGAAACCCGCAGATAAGTTTrTTTCTCTTTTGNAMGANTCCCNCNTTTAAAA 

TMCCAAAACCTTACCTTT AAAAA AMAATTMTTANGTTCCAAAATTAANGGGNTTTTA 

ACCTTNAAA AGGNMTTAATTTTTNGGCCTTTTTAAANNCCGGGTTTTTAA 

I I I l I I I T I AAAAACCC GGTTTAAAAA I I I I I I I 1 1 I AAAAATTAAAGGCCCTTTTTAAA 

AGN NAA'I I I I I I 11 I 1 I AAAGGNGANGGAAAAAAAAA I 1 I I 1 I I I I I GGNANAAGTGAAA 

ANCTTTTTTTrsTTTAMAAMTN^^ 

NMATGG GNGG GGTNNNANANAAGGAATTTAMAMNTGGGNTNTTTTTNTTTAAAAAAA 
M ACCAA ATTTTATCCCTTGNTNAGGGGTTTTTTGGGNAAGNNAATATGGAAAAAANAAN 
CCI I I I I'NCTT 
Sequence 1464 

TTTGAGAAGCCAGCGCTCACCCACCCGGGGTCTCTGTGCATTGACCTTTGGGTGCTGACT 

TGGAGAAAAGCACMACACGACCAGTCCCATCCTGGGCTCCCGNNGGCCGNCCTTTCTTT 

CCTMTNCTTACCGCCATTTTGGTATTCCGGACCTGGCCMTTTTAGGTTTNGGGACCTT 

AAAAGNAATTGGMTTGGAACCTTCCMGT1TTTTAAAAAAGGGGGAAGGGGAAGNAACC 

NMANATTNNCCTTNGAACCCTTGGTTCCTTAAAAGNCCAAANGGAAATTGGGGCCCCCC 

AANAGNCCCTTGNNNGCCAAAAAGGNAAAAAAAAAAAAGNNCCNAACCCAACCTTTGGNN 

CN AATT TAACCCNAATTTAAAGGGGGGAAATTACCAAGGAAAAGGGGGGNCCAAGGAAGG 

CCTTTT CTTGGGCC CCTTGNGCCGGGGGGAATTTCCTTNGGCCAAAAACCCAA I I I I I I A 

ACCCAN I I I I I I I IGGGI I I I I I I I I GGGCCCCCTTTGGCCCAANAAAACCCTTAACCCC 

CCCCGGGCGGGTTNACCCCCTTTCGGGGGGGCCCCNGGCCTTTCCTTAAGGNAAAACCTT 
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AAAGGNNTTGGGGGGNAATTTCCCCCCCCCCCCGG 
Sequence 1465 

AGGTACTTTATTTTTC I I I I I 1 1 I I II I I 1 1 GTGGATGGGGACTTGTGAATTTTTCTAAA 

GG TGCTAT TTAACATGGGAGGAGAGCCGTGTGCCGGNATCCAGCCCAAGCCCNCCCGNTT 

CACTTTTTCNCACACNCTTTISTTTT^ 

MGNGGNCCCCTTTNTTTGGNCATTACATTAGCAGCGGMMCANCCTTTTANTTNCTTT 

CCCCNTTNATTGGNGMNANMCCTCCCATTTTNTTTTNTTCAAANAAGGAGCATTNGNC 

AAANANCTNCCCAAATNTNNCCTTNNACCACNGNGGGNCNNTANNCCACNNTNCTANCTT 

NAGG NGNNGCCATTTGGGTTNCNCCCTTTGNNCCNGGMNTTCTNCCCTTTTCCNTTNGN 

TTTTNTCCCCNCNGGGNNGTTTTTTTCCMGNGAAGNGANCCCANAATCCTTTTTTNNNN 

CNCNAANAAAAANGTTNAANCCNAAAAAAAAGNGCCCAAGAG I I I I M I I I I I I I I CCCC 

CCCCCC 

Sequence 1466 

AGGTACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCGGCAGGAAAAAAC 

AAAAACAGCAAGTTTACATGATCCCTGTAACAGGCCATGGNNCTNCAANNCTTCAGGAAT 

GCCTTNCCNTCNCATTCTGGCCCAAAGNTGGTTGTTTNCNTGGGAATNACCAGGNAGGCC 

ACCMTTCGGTGGGGCCTNTCCTGGGGGGNGGTTCAACCAACCTNCAAGNCTTTTAAGGG 

CTTGGTTGGGGGTTCCCCACCANGAGGCCACCTTNCAATTCTTGGGCTTGGGGGACCTAA 

TTGNGGNTGGGGTTGGGNTTGGGACCTTCCTTAACCTTCCAAAAGTAAGNCAANAAGCNT 

GTTTM CCCMGCCAACAATTTNCAAAAAACCMGGTTGGTAATTTGGNAAACCANTCCT 

mm INAAAAATNATTCCAAAAAAGTTNGAAGGAAAAANCCANAGGAAANGGGNCAAAC 

CATTAAANTTAAAATNGGTTTAATCCAAGNAAAAAAG 

Sequence 1467 

CCGGGCAGGTAC I I I I I I I ) I I I I I I 1 1 I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGACAAGCCATGCCNCCNNCCCNGGAAANGCC 

GCNCAGAGGG TTCCC TTGGGCAATNTGGTATNTACTGGGGTATCCTTCTTNTGCGGTTNC 

TTTCGGGNCMTTTTTCCGATNNCCACCTTTNNCCTTACAAAGGGNCCCCAANNNTTGNC 

CTTTCTCCANCNCCCCAAAAGGTTNNGTGCCCCTTTTNNNCCCCGNAANNNGGAATTGTT 

NAAGNCCTTNGN AAGNG GTTATTGGGGAGGACCATTTTCCTTTCCTTTNCCCCCCCCCCA 

AGGAAAAAAAAAATTTTCCNGTGTTNAAGGNGGTAATGNNAAGTAACCTCCCCCCAACCA 

MTTGGAAAGCaGGNTTAAAGAANTGGNTTTTTTNACCTTTTTTCCTTA 

CAMNNNCCTTTTTTCCAMGGGGGGGGCCCTTTTTCCTTTCCT^ 

TTNGGAAATTNGGGGGGGGGGGCCCTTGGGMTTTGGAAAAAANCCGGNTTNGGCCCGGN 

CCTTTGGGTCTTTGGTTTTCCCCAAAAAACCCCCTTNGGGCCAAACCCCCCT 

Sequence 1468 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTTCCTCCACCTCCNNCGCCACAATCTTCC 

C MAGAAA GGGGAGMAGTCTTTTCTTCCTTCGGCAGNCNCGNGTAGTCNAGNGGTTCCC 

CCTTTTTTCCTTAGGNAGGCATTTAAGGGGNAAGTAAAGGAAGGNAGGAAGNAATTCCGG 

CC TTGG TTCNTTCGGGGGNAGTAAGNAAAAATTCCACCANAGTCCCGGTCCCCGGAATTC 

CCTTTTCNTTCTTAAGGGGGTTCCTTCCGGTTAAGGTTCCGNMTTCCTTMGGGGTTNC 

AAMTGGAAAAAGNGAAAAATTMGGAMGNMCCMGGTTTTTTGGCCAAAGGAAGAAA 

AGGTTGGGGTGGTTMCCCCTTGGCCCCCCGGGGGGCNGGGCCCCGGCTTTCCTTAAGAA 

AACCTTAGGGTNGGGAAATCCCCCCCCCCGGGGGGCCTTGGC 

Sequence 1469 

CCGGGCAGGTAC I I I I I I I II I I I I I I II I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGNACAACTCATGCCTCCTNCNNNGAAAGGNC 

GGCCC AAGGG TCCCTGGGCCMTTTGGTTTTCNTGGGATTTCTTCTTTTCGGNTACATCG 

GGNCMTTTTCCGNTACACCTNCNCCTNCNAAGGGNCCCNAGGTNTGGCTTTCNCNNCGC 

GCMNAGNT NGTN CCCTTTTCACCCGGAATGGMTGG1TMGACCTTGNMGNGGATTTT 

GGGNGGACCTTTTNCATTANCTTACCCCCCCCAANGTAAAAAAATTTCCGGTNNTAAGGG . 
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GNMGAAMGGMCCCNCCCAACAATTGMGGNGGTTAAGGTTGGGTATTTTCCTTTTCC 
TTTCCCCCTTTCCCCAMGGCCNTTTTTCNMG^ 

MCCTT7TTGGNMTTGGGGGGGGGGNCCTTGGGMTTGGGAAAAAACCGGTTTNCCCGG 

CCTTTNTCTTTGNGTTCNCCAAAAACCCCCTTGGGCCAAACC 

Sequence 1470 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTNCCTCCACCTCCNCCNCCNCNATTCTTC 

CCAA AGAAAG NGAAGAAAGNCCTTTCCTTCTTCGGCAAGNCCCNGNAGGCCAGGGGNTGC 

NCCCTTTTTTCNTMGGAAGTATTNAGGGGGAGTAAANGTAAGTAAGGAAGGAATTCCGG 

CCTTGGTTCNTTCCGNGGGTAGGAAGGAAAAAATTCCAACCAAAGGCCCCGNTCCCCGGA 

ATNCCCTTTTCCTTCTTNAGGGGTTNCNTNCGGGTTAGNTTCCGGAATTCCTTAAGGGGT 

TCAAMTGGMAMGGGAMATTAGGGMAGTAACCAGGTTTTTTGGCAAAAGGAAGGAA 

AGGTTGGGGGTTGGNTAACCCCTGGCCCCCCGNGGGCGGGGCCCGGCTTCCTAAGAAACC 

TTAGGGTGGGGAATCCCCCCCCGNGGGGCCTTGCCAAGGGGAAANTTTTCCGNAATTAAT 

TCCAAAAGG 

Sequence 1471 

AGGTACAAACGAGTCCTGGCCTTGTCTGTGGAGACGGATTACACCTTCCCACTTGCTGAA 

AAGGTCAAGGCCTTCTTGGCTGATCCATCTGCCTTTGTGGGCTGCNTGCCCCCTGGTGGG 

CTNGCCTGNCCCACCCAACCAGGCNTGGCTTCCCTGCCTTGGCCTTGGCCTTGGCAAGGC 

TCCCCCAGGCCTTAAAGGGGTTTTGGNAAAGGNCNCCAAAGGGGAAAAGGAAGGTTCCGG 

GMGG GGAANTTCCNGGGMCCGAAGGGNATTMTGGGGGGAATTTTTTGGGGTCCTTNC 

TTTTTTGGACCNTAAAAATTCNAACCCCAAAAAAAAAAAAGGNCCNAAANCCCCCCAAAA 

ANCCTTTTTTAAGNCCCCAAGG 1 1 I I I 1 1 I AATTTTTTGGCCAAAAAAAAACCAAAAGGG 

GGGAAAAAAANTTAAAAAAMGGGGGGCCCTTTTAAACCCTTTTTTCCAANTTANANTAT 

AANAATAAAATANNANAANANNNNNTNNGNGGGTTTAACCCCCTTTGGGCCCCCCCGGGG 

GGGNCCCGGGGGCCCCCCGGGNTTCCTTAAAGGAAAAACCCTTAANGGGTNGGGGGGAAA 

TTCCCCCCCCCCCCCCGGGGGGGGCCCTTGGGCCCAAAGGGGGAAAAAA 

Sequence 1472 

AGGTACAGAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAMACTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACCGCAGGACAGANCTACCGNCGGGGAGNCN 

CGNGGNAGGGAATGGGGGCATGCAAGTCATGNTGAGAAGNGGAGNGTGATCTTACAAGCA 

NNGGAAGNACGTATGNGGTCCCGNGTAAGNAAGTCCANGTAAGNGNCCCCTTGGAAGNAA 

AGTCCCAAGTANCGACCAGTNTTGNGNAGTAAGTAGGGTGNTGGGNAAATAGGTTGNANC 

CCATNTCCGGGGGTCCTTGGGGGGNCCTTGGGTAAGNCCNCGNCCAACCAACTGCTTCTT 

CCTNCCCCMTGGTTTAAAAMTTAGGCCAC^ 

CAACCAAAAGGGTTCNCNCCCAATTCCCAACCATANAAAAAAAAANATAGATANACACAN 

AAAGGGGAAAAGGTTACCCCTT 

Sequence 1473 

AGGTACTGGTGTGTCCGGAATCCTACCCACTGTGATGACAGTGCCTGATAGTTTCTTCTG 

CCTTTCTATCCCMAACGATTGGTCAGTTTACCCMGGTTTGGCAAATGGCCAGCCTTNA 

GNMATCTTCCCCAGGGGAANCAATNCTTCTTTTCTTAGGTTAAGNTTTGGCCCCTTTAA 

GNCCCAATTCCTTTTNGGTTAAGGTTTTGGNMTNTTTGGMCCTNTNTTTNTTT^ 

NTNTrNTTGTGGCCTTTTCCCCAGGAAAMGGCCTTCTTAATGNCCTTTCMTTAATGGGG 

AACCTTNGGCCANTAACCCCAAAA I I I I I I I I I I I I NGGGTTTTNCAATTTCCTTGGGTT 

TTTGGGGGTTNCMTGGMTGGGGTTTAAGGCCAAGAAAGGCCCTTGGANCCCCTTCCCC 

NTGGGTTTTMCCAAAATTAAGGAAAAATTGGAMTTCCGGGGTTTTTCCCCTTGGGGGG 

CCTTAACCAAGGAACCTTTTGGAAGGGTTCCTNGGTN I I I I I I I I I t M I GGGI I I I I I I 

TTTTAAAAAAANCCCCTNNTNCNCCCCCAAATNGGGTTGGGGGGCCNAAAAA 

Sequence 1474 

CGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTC T I I I I I I I I I I U I I GNGGATGGGGAC 
TTGTGAATTTTTCTAAAGGNGCTATTTAACATGGGAGGAGAGCCGTGTTGCCGGCTCCAG 
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CCCAGCCCCGCNTGCTTCACNTTTCCACCNCTTTT^^ITTCTCACCTTGCCTTCTTGGGCT 

TTTCTNCAAGGGCCCCTNNTGGCCTTTCTTCCCCGNACCCCTTTGGTTCTTCNCTTCTGG 

AAANAACCCNCTTCNCTNCCAACCAAACCTNNGCAGNCTCCCATTNNCTTTCCCCGGGCT 

TCCCCCNTCNCCTTAAGGTTCTTGGTTCCCCTTGGCCGGTTQNCCTTCTTGGTTCCCCCG 

GGGTTTTT CAAGNAGNAACAAANCNTTNCCCCAAAAANGCCAACAAAAAAGGCCAAGGTT 

I I I I M l ICCCCCCCCTTAAAAGNGGGNNTGGGGGGGAAGGGGGAAAAGNCCAAAAAAAA 

GGMCCTTCTTGGTACCCCTTTTTNGGGGCCCCGGGCTTCTTAAAGAAAACCTAAGGGTG 

GGGGAAATNCCCCCCCCCCG 

Sequence 1475 

AGGTA CTTCAA TCCTGMTTMTCTTTAAACACTTTCAAATATGGAGATTAATCACCAAC 

TTCTTATTTTTTGGGCCAGTTGGATTCAA 1 1 I I I I ATTTAACCATGGANCTCTTTCNTAT 

TATTCGGTTACCTNGTNCGGAAATGGCCTAGGAAAGNAAAGGGCCTTCTTTTTCNCGGAG 

GGGTTN CGGGGAACCAAGNCNAGGAACCTTTTTGGTNAA I MM I AATTGGTTTCCCCTT 

TTTTTTGGNTGGGGMTTTCCTTTAMTNGAACCCCCAGGTTCCGGGAACAAGGGAATTG 

NAGGCCCAAAGGGTTGAAAAMGGGGGAAMCCNTTTTTNTNAAACCCCCCAAGGGTATT 

GG CAAGTTT GGGACCACCATTAATTAA I M M M M I I CCTTTGGGGAAAAGGGAATGGT 

TAATTTTTTCCCCCCAAAATTCCAAAANCCCCTGGGGTGGGGAAATTAAAGGGAAAATTT 

TCNTGG AAAAAACCAACCCAATTCTTAA I M M 1 1 GGAAAAAGGCCTTAA I M M M M I 

TGGGAATAACCAACCC 

Sequence 1476 

AGGTACAAAATTTTATTAAAGGTCTTTAGAGAGCAACATCCAGACTCCAGAATACAGCTG 

NGNAGGAGACCCTGTTATGCTGTGGGGACTGGGCTGGGGCATTGGAAAAGCNCCTCCNTC 

TGGGCCTTCCCAACNCCCTTTANTTGTTTCTTGNAAGAAATGGGNGGGGNTTTGNTTGGG 

NGNCNATGATNATTANCTTTCAATTCTTTmAGGGGGGATTTTNACCAAC 

NCCTTTCNAAACCGGTTGNGGNNTTCCATGTGNCCAAGGGNGGNGGCCTTTTCTTTTTTA 

NGGGGGNCCCCCAAANTTCI^TTNMCCCCCAANTANTTTTGNTTTTTCC^ 

CCCGGTGTTTNTACCCGGAAAATTCTCCGGNGGGGGNANCAAANGGNGNNTTTNCCTTGG 

TTG NGTATTNCCCCAAACTAATTTAAAA AAAA AAAC 

Sequence 1477 

CGGCCGCCCGQGCAGGTAC I M M M M M M M M M M I I GATGAAATGCCAGGGTGA 

AAGGGATAGCCAAATAGGCTAAAGCACAAGTGCCACTCTAGGTTAATTTCGGCAGGACNN 

GCCCCAAATAAAAGGGTCCCACCNGACCANNTTACCCAATNCAACAACCATTTCCCGGNT 

TTCGGGG 

Sequence 1478 

AGGTACCTTAACTTGAGTTACAGGGCTGGTCCCTCTCTTTTCAI MM I ATCCCAGTAGG 

TGAGACCGTCCCTGCTGTGTTCGGTGGCTGTGGAGTTGATGGCATCTTTGCTCCAGGTGA 

CACCTGCATGCTGGCCAGCTGAGCGGCATAGAGCTGCTGCAAAACAGGGAAGACAAACAT 

TGATCTCTTTAAATGCAGCTGAAATTGAGTTTCAAMGAAAMACTTACCATATTGTTAG 

ATTTCTCAGACAGCCTTGTMTTTCTTTTTCACT 

GGCAAATAATAAAACAGAAAGAAAAAGGACAGGTATGGGAGCCATAGTTCTGTTTCTGGT 
CCTTCTAGC MCMCTTTATGATCCTGGACMAGGATTTGATCTCTTTAGAATTCAAGTT 
I I I 1 1 I I MA 
Sequence 1479 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAACACCTGCTAAAGACATAGC 
TTAATAAATATTTGTTGAATGTTGTTGMTGAAGAGCATTTTCCTGCTCATAMTTTCTC 
ATTACTTAACG AATGTA GAATAAATTCCTTACTCTGGCAGACTGGAATATGTAGTTACAG 
TTTATTCTAGMTTTTTGCATTCCCCTT^ 

CCTAAGTGACCCAGGTGAGTCCAGTGTCCTGCTGATTTCACTGCAGGGCAGGCATGTGAT 
CCAGGCTGGCCACAGTGGTCAGTGTGGACAACTGACCCTTGTTCATCTGTGGAGTCTAAC 
AAAAGTAGTGCCCCCTTATCCTCGGTGGTTATGTTCCAAGATGCCCAAGTGGATGCCTGA 
AACCGTGGATAGTACCTGCCCGGGCGGNCGNTCTAAAACTAAGTG 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

244/467 



Sequence 1480 

CCGCGGTGGCGGCCGAGGTACTTAGCATTGATCAAAGAAATTTCAAATTACCGATCAATT 

GGGTGGGGAGAGGAATTTTCATTGTCCAAGCACCCTCAGGGAACAGAAGTCAAAGCAATA 

ACATATTCAGCMTGCAGGTCTATAATGMGAGMCCCGGMGTTTTTGTGATCATTGAC 

ATTTMGACACCAAAAAATAAMGACTCCTACGMGMCTGTTTTTGTTTTCCTCTTCCT 

mGAGMGACACTATGMTTAMTTCTACAGCTTTTTm 

MCAGAMTATTTTAGTTAMGTGTGACTTTCAGAAAGGGAAAATCAGGGCACAGCCTTG 

GTCTGTGTTCCCCAAATATTCACACTTTAAAGAATTCTTAAC 

Sequence 1481 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGATTGTCCTTTATGCATTAAATTCAT 
GCTTTTACATACTTGTGTTTCCTTTTCATTTCGTC^ 

TAGAGAAAAGGGTTGGCCAGGTGCAGTGACTCACACCTGCAATCCCAGCACTTTGGGAGG 

TCGAGGTGGGTGGATCACTTGAGGCTAGGATTTTGAGACCAGCCTGACCAACATGGAGAA 

ACCCCATCTCTACTGAAAATACAAAATTACCCGGGTGTGGTGGCGCATGCCTGTAACCTC 

AGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTTGAATCTGGGAGGCGGAGGTTGTGGTG 

AGCCAAGATCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTCTCAAAA 

AAATTAAATAAATAAGGCTTATGTGCAGTTATTCTCAATGAGCTAAAAAGCTTCCTGAGG 

CTAGAAATTCTTGTCATTTTTGGCTGGGA 

Sequence 1482 

CCGCGGTGGCGGCCGCCGGGCAGGTACTATCTATTTGGCCTTGGAAAGTTACTCTCTAAG 

TCAGGGCTTCTTTTCTTNMCTTTGGTGTGAGACTCCGCTGGAGAGCCGGTTAGAAAAAC 

ACAAGTCTCGGGCCCCACCCCGCAGAGCCCCTCATTCTCTAGTTCTGGGTTGGGGCCCAG 

GAGTCTGCTTTTCTAGCAAGCGCCCAGATGTCACTGATGCTTACAGCTCTCANACCACAG 

TTGGAGCAGNGATTTTTAAMGTCTTTTCATTTGTAAAGAGTNGTTCTCCATGCTCCAAA 

TGACTGNGACGACTGAGAAAATGCATGTATGTAAAGTCTGCANCTGGTGACATTGTACAC 

ACTNAGCAAATGGCCTTNGGTGTTACTGTNANTTTA I I I I I ACTAATTATTTNTNNCACC 

NACAAATTNGGANCTGCTNCAATCGGTNGGAATTTGGAAATTGGGC 

Sequence 1483 

GCTGCCATCAGCTCCCTAGGAGCTCTCCCTCCAGGAAGGGAATGTGTCCACCGTCAGACA 

CTCAGACCCAGCATGTGGGGACAGAGGCTGATGGCCTGTCTGGCCATTCCTCTCAGTTCC 

TCTCCTCACTAGCTTGTGTCCTTGTGCAAGTCACTTACCCTCTCTGAGGTTCAGTTCCCT 

CCTCTTTGAAGTGGGTTTAATAATAAGTACCTGCCG 

Sequence 1484 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCTTTCGTTAGAAGGCAGTGACTCCTTTC 

TGTGAAGCCGATTTAGTGAAGTGTCCTGTGCAGAAAAGAGTCCAGGGCTGTCAGTTAATT 

TCTTCCGGCCACTGGAGTTAGGGTTTGAAACTCTGCAGCTGCCTATTGCACTTGTGAAAA 

GGTTTGTATGTTCATCACTGCTGGCTGGCTCAGAGTTGGGAGTGAATCCTCCAAGGGATA 

AGCTTGGAGAACTTTCTGAACAGTCAATCTGTAAAGGTGTCTGCAATCCCAAGGCCAATG 

GACTAGATTCTGAAGGCTCTCGGTGGACCCACTGTTCCTCTCTGTTTATTAAGCI II I IG 

AAGGAGAGAGATGAGGGCAGGACATGTGACAACGGTGCTTTTCCTTATGCTTATATCGCT 

CTCCAACAGCATCCTT 

Sequence 1485 

AGGTACATGCAAGTTGCATGATTATMTGACGTGATCCTGGGATTTAAGTTGATTATGAC 

AGGAACAGAAGGAACTTGGAGATTAGGGACAATGAAAAGGTGGTAGTGAAAGGAGTGTTG 

GAGTTAAGTTACTAGATGTCTGGAAGACAGACTGTGGTGGTCAGATAATGAGATAGATTA 

TGGAGGGGTTACAGTTTTTGGTAATAACGAGAGAGATCTAAGGTATGACTGAGAGTGAAT 

GGATGAGAATAGCAGAGAACAAGGTCAGTGGAAAACTACTCTTCAAAGAACCTATAGTCA 

GGGTGTTGAAAGATTAAAAATTGCTAAGAATTAAATCAGGAAGTAGTGCTGCGGGAAATG 

AAATGAGCAGTGAACTAAACTTACTGAGAAATGAGAGGGGATGACCCAAGGGTTTGTAGA 

TTTTGTAGATGATAGCA 

Sequence 1486 
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AGGTACATGACATCCGACGAAATGAGACGCCACCTAATTGATTTCCGGGAGTCCGCACCA 

GGGGCCCTCAGGGAAGAGACCCCGCAAAGATCCTGGGAGACCAAGGTGGGGACCCTGGTG 

AGMGAGAGAGTTCAGGGGAGTCTCTCTTCATTGCCCTTCTGCTAACCCAAGCATTAATT 

TGCTAAGTATTTACCAGGGGAGTGGGAAAAAGAGTTGAGCGGGATTCTCTTAGGCTATGA 

GAGAGTCAGGCAGCCCCCAAGATAAAATAATGAACTAGAAAATCTGGAACCTTACTTCTC 

TGGGAATNTTACCTATCTGGCACCGTGGGAAGAAAGAAAAAAGGCTACTGAGTACCTGCC 

CG 

Sequence 1487 

GCGGTGGCGGGCGAGGTACTGACCTCTGCCAGTCAGCTGCGTAACTCCAGGCCCTAGGGT 

GCCCCGTCTGTCCAGCCAGGGATTGCATGGATATGCTGTGATCTCCCTTTTGGTTCTGAT 

TGAGTTGGACCTTGTGGGAGGAGAAACATAGATGTTGATACATGAACACATATGTTGGAG 

AGAGAAAGTTTTATCTTGGCATAGGACTTTTAAAACACAAGGTAATTTTTMTCA 

GGGACCAAAAAACACTCAATATGGGAAAAATCCAAATTCTGCCAAAATGTCTAAAGAGGT 

TTATTCTGAACCAGTATAAGTGACTGTGGTCTAGGTTACACAATTTCAAGAGATCCTGAT 

MAGCGTGCATGAGAGAGTTGGGCTACAGCTTGGTTTTACACATTTCAGGGAGACAGGAA 

TTGTANGGTAAAATTATGGCACAGGACTTTTAAAATGAAGCTGTGAAAGTTTACAGTCCA 

TAGAGAATAAAAATCTAGAAGTTT 

Sequence 1488 

CCGCGGTGGCGGCCGAGGTACTTGACACGACATATGGTAAATGAATAAGACAAAGGCTCT 

GATGGCTTCTCACAGCCCTGTGGATCAAACTCAGATTCTTTTCTAGAACCCCAAGGCCCT 

GTCAGTCTCACTAGCCTCTCTCCAGACACACCAGCCTTTTTCTACCTTTCCAACATCCCA 

AGTTCCTTTTCTCTTACAGGACTACATACACTCTCTCTTCTGCCAGAAACCATGTTCTAC 

CAGCTMTTTCCACTCAACTTTTAAGTCTCANCTGAAATGTTACTTCCAAAGAGAGGCCT 

CCACTGAACCCCAAGCCTGGGGTTCACAGCACCTGTCTCCATAACTACATAATAATCTCT 

CTGATGTTTAGGCTGGGCATGGTGGCTCACGCCTGTAATGCCAGCACTTTGGGAGGCCAA 

GGCG 

Sequence 1489 

GGGCGAATTGGAGCTCCCCGNGGTGGCGGCCGGGCAGGTACCCAGAGTTGCGAGGAGTTT 

TTTAACTGATTTAGCCAGGTGGCAANCATNAGTGAATGGATGAAGAAAGGCCCCTTAGAA 

TGGCAAG ATTA CATTTACAAAGAGGTCCGAGTGACAGCCAGTGAGAAGAATGAGTATAAA 

GGATGGGTTTTAACTACAGACCCAGTCTCTGCCMTATTGTCCTTGTGAACTTCCTTGAA 

GATGGCAGCATGTCTGTGACCGGAATTATGGGACATGCTGTGCAGACTGTTGAAACTATG 

A 

Sequence 1490 

GATNAGCTCGATATNGAATNNCNNCANNCNNGGGGANNCANNNGNACAAGAGCGGANNCC 

NNCGCAGAGGAGCTNCAANTTTACACACTGTTTAAATGAGGGAATANGCNGCAGCGCTTG 

GATGTAACTGAAGAAGACAGTNNGAGCNNCAAGGAGGGACAACCACGACCTATGAGGACA 

CCATGCCAGAGAGGCCTGGACCCCACGCTAGGCTCAGTGCCTGTTATACTCTTGGGACCC 

AGCG CTTTC CTTCTCCATCACGTGGCATACTTGGCATTATTTGTTGNTTAAAATATTGCC 

CTTAGTTTTNACCTTTCNTMGGAGACACMGGNNGACCTTTGNGACATTAACAGTTGCC 

CCAAGTNGGGGGNANANAAANAATTTTTGGGGGNGNAAAAANCTTTTGGCTTTTTNNAAA 

AAA I I I I I I I I I NAAANTTT 

Sequence 1491 

CTTAGGGC GAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGTGACCCTTGCTGGT 

GCTTTCATGTTTTGTGCCGAGGCMTTAGACTTTGTGCTGMTNTGTTTGTGCTGCCACC 

TCAGGGAAGGGGTGGMTGTGCAGCGTGGTTTCCATTTGACATTGTTTTCCCTGAGAGAT 

GGGAGGGCTGAACGTTACCTCTTGACAAGTCTTAGTGGACAGAGGGGCCCGGATACCCAA 

GCGCCTTAGTTCTTAGGGCTGGGTATTAGTTCATTTTCACACTGCTGATAAAGACATACT 

CGATACTGGGAAGAAAAAGAGGTTTAATTGGACTCCCACATGGCT 

Sequence 1492 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTACCCTTTACTTTTTCCC 
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CAAGACCATCTCAGGGTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACC 

CACCTCTCCCAAGAGCAGACATTAMCATCTTTGTGCTTTGMGAGAGTGMTTTTGGAT 

AGTCTTGTGATTCTCAGACTMCTTCCAGMTTATACTTTAACCCCTTCCAGATATGGTC 

CGCCTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTA 

ATTGTCAACCTCCAGTGTTGACTCTAGAMTATGAGGAAAGCTTTTCAGTTTTTAAMTT 

GCCATTTAMTTTAGTCTATTMMACAMCCTAGAGGTCTTGGTGCAGTTGATTTCAGA 

GTTTATTMTTTAAGTGGTCCCMMGTATTACATCTTTTATATTCTGGAAGAAAAGAAC 

TGTGAACAAATTAGA 

Sequence 1493 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACTGTGGAGGT 

CCCAGGAGTGCTGGTGATGGGCACAAAGTTCCGATGGGTGAAACCATTGACATAGAGACT 

GCCTCTGTCCAGGAGGNANGGGCCCAGCTCGATGATGCCATGGGTCAGTTTGCTCAGCTC 

CCAGTATAGCTGCTCTCTGTCCAGTCCAGGGTTTAGAGGGTCAAGGCGGTGAGTGCAGAT 

GGTGTCCACGCCGGTGGCTGNCCCACGTTTTTCAGGCCTGAGCAAGGTCAGTNTGCAGCC 

AGAGTACCTT 

Sequence 1494 

ATAGGGCGAATTGGAGCT CCCCG CGGTGGCGGCCGAGGTACTAATTTGAATTTGTAATGA 

GTCTGATGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGAATATC 

TTATATCCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATTATTATGAC 

AACT GAACAT GCTTATGCATCCCTCATGCCTTTAI I I I I I ATTTTGGGAAATCTTTCAGC 

TTCAGTTTTTGCTGATATTTATGTGATTCTTTGTTCTGCAATTCAAATTTCTGGGA 

MCAGTCTCCTTGGTTCAGATTACTGTTTTTTGACTAGAGCTTCTCGCTTCAGATTCTGT 

CATMGATTATGGCTTAACCTATGGTTGTCCTTTGATTTGGTGCCATATGAAATAAAACA 

TTATTT 

Sequence 1495 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACGCGGGGAGTTTCACGCGCGTATGC 
TTTGCCCGCCATGGCCGCAGTCAAGGNGCCTTTGGAGTTCCATGCCAAGCGGTCTTGGCG 
CCCCGAGGAGGCAGTAGAAGATCCGGACGAGGAGGATGAGGATAATACTAGTGAAGCCGA 
GAATGGGTTCTCCCTGGAGGAAAGTGTTACGGCTCGGAGGCACCAAGCAAGGATACCTTA 
TGCTGGCTACTTTGGATGAGAATGAGGGAAGTGATAGATGGAGGGCAAAAAAAGGAGCAA 
TCCGATTGACC TTTNA CCAAGGGGGANTTTTGGAAGC I I I II I II I i AAAAAI I I I I NTT 
TTTNGGNGAAATTTTTNNAAAAAC I I I I I 1 1 I NTTTNAAANNANATTACCCCCCTTTTAA 
AAAAAAAAAATTTNCCCCCCAAAAAAAAAANGGGGNAAAANANCCCCCCAAAAANAAAAN 
ATTAAAAAAACCCCCCCCAAANGNTTCAAGGGGCCC I I I 1 1 I I I I I I I NGGGGNAAAAAA 
AAAAMTTNGGCCCCCNCCNCNTTTTTTTGGGAGGAGAGGGGGGNCCCCCNCCCCCTTTA 
AAAAAAAAAAAGGGCCCCCCCGGGGGGGGGGGGNANTTTTANAAAN I I I 1 1 1 I I ICCCCC 
CCCC 

Sequence 1496 

CGGGCAGGACCATGGGAAATAAGAGCNGGCTNNNGGCATTCTGNGTANGGAGCCTGAGCC 

MACTCTAAAGCTGTCTTTATAMGGGAGGTCATGTGATGGCCAGAAATTGCCTTTGCTT 

CATGGTGCACTTGGTGGGGAGTCAGGTGTGGGGTGCTGGGTTTCACATCATCCCATTTTC 

TTTTNNGNNTTCAGACCTGCAATGCTTCTTTTGCAACCCGAGACCGTCTGCGCTCCCACC 

TGGCCTGTCATGAAGACAAGGTGCCCTGCCAGGTGTGTGGGAAGTACCT 

Sequence 1497 

AGGTACTTTTNGAAGTAACTGGACATGNGGGAGGNNAGGGGAANGGAAGTATTGNTATGG 

ACTGAACTGTGCCCCCAAAATTCATATGTNGAAGCCNTGAGCTCTGACATGATTGNATNT 

GAAGTCCTAAAGCCAGGAATGAGGAAGGCTGTGAATGTNCATTGTTCCATGCAAGAATGA 

CTCTGGNGNGGGCTATTTAGAGATCATGAGGGATACTGCCCCAGTTTCCACAGGCCAGAT 

GGNCTCCAACAAAAGCCACGGGGGAGTCACCCCTGCCTGGCAGATCTATCGGGTCAGGAC 

CACCGCCCAGGGGGTCCTGGAGAGGACAGTATAGGACCAAAGAGGAT 

Sequence 1498 
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CACCGNGGTGGCGCCCGAGGTACTTGGCCAAGCGCTCAGATCGGCAAGGGGCACCAAGTC 

TTGATCTGCCCAGTGCACAGNCCCACAACCAGGTCAGCGATGAAGGTATCTTCAGTCTCC 

CCCGAACGATGAGACACCATGACGCCCCAACCATTGGCCTGGGCCAGCTTGCACGCCTGA 

A GAGACTCG GTCACCGGAGCCATTCTGGNTTGACTTTTTAGCAAGAAGGNANTTNAAGGA 

NTTTTTTTI I ANCGNCTTNGGNAATCCTTTTGGGGGTGGGGCNCNTGGGNGAAAAACTCC 

CCCCC ANCCN TNGNGATTCNTTNCNNGNGGGNNGMAAATTTTTTGGNNAAAANTNNCCC 

ATAANNTTTT TNGTGM NAANGGANTTTTTTNANAMANCCCCCTGGGGTNTNTNTTTTN 

NNNAAAAANATTTTTTNTCTTCNCNGGGGGGGNCNNTTTTATAAAAAAAANGANNANACC 

CCCCCNCCGTGGNGGAG AAAATTNTTTTANATTA I I I I I I I I CNTCCCCCCCCCGGGCCT 

TGGGGGGGGGGGCCCCCCANCCC I I I I H 1 I I I I I I I 1 

Sequence 1499 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTAA 

TTTGTGTCATCCCATCAGCAATGAAGGTCCCTATCCAGGGTCCTGCTTGGAGCAGCATTT 

CATGTTCTTTTGCTGTTTTGTGCTTTGCCGATTTTGGATTTTATTT^ 

ATTTAAAAAACTCGTCACCTTTTGGAMTGCCCATTGCCGACTTGAATTTTTTTGTATG^ 

AGTCCCT C CTGATTT TGTGTGTGTGTGTCTGTGTTTAAGCAAGCGTTCGGTTGGTATAGN 

I I I I 1 1 I I GTTTTTTTMTTTAAATTGAAGGTAGCTGCCTCCTGAAAGCCAGCATTAAGC 

CAGAACACCCAGNTTCAAGCAAAAGACCCACCTCTCTGCAGAGGCAAAGTCTACTTTCTG 

GTACCT 

Sequence 1 500 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTAGTAGAGACGGGGTT 

TCACCATGTTTGCCAGGCTGGTCTCGAACCCCTGGCCTCAAGTGATCCACCCGCCGCAGA 

CTCCCAGAGTGCTGGGATTTCAGGTGTGAGCCACTATGCCCGGCCTAATACGTGGATTTT 

TAAAGCTTCAGGTTCTGGTTCAGAAGTTTCCTGGGTCTCATTAAAATAATGAGGCACTCA 

GAATTGGTCTMTAMAATAACGACCATTTCTTTCTACTTCAGCTNTTTCACAAACTTTT 

TTAATGAAAATGACMGNGAGGNCCTTCAGTAGGGGCATTTTCAGGGGANAAAAATAGCG 

GGNA GACCTGAAACCTGGGNTAGGNAGTTTNTTTTTATTTTTTGAACAAGMGANNM 

TT1TCANAAGACCCTAAAAMTTNTTTTCCCAAAAACAAAAGNGNTTTNTTTTTN 

GGNGGGACCCCCTTCCTTTTGGGNNTTTTNCCTTCCCCCCT 

Sequence 1501 

CACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGG 

GGGCCACTGACCACAGCTCTTTCTTCAGGGACAGACATGGCTCAGCGGATGACAACACAG 

CTGCTGCTCCTTCTAGTGTGGGTGGCTGTAGTAGGGGAGGCTCAGACAAGGATTGCATGG 

GCCAGGACTGAGCT TNTNA ATGTNTGCATGACCCCCAAGCCCCACANGGAAAACCCCGCC 

CCCGNGGACMTTTGTTTTNACCANGTTTCCCCCNT^^ 

TTTNCCNNCCCCCCCNCGGNGNNCCCNMGGGGGNNTTTTCCCCNNTTT7TTNANTNNNC 
CCCCCCCCCCCCCGGGGGNGGGGNGGNCNCCCCNCTTTTTNNNAANNAN i I I I I I I I I NT 
TNNAAAAANCCNCNNNTTTTTTANAANGGGGNTCCCCCNNAATNTNGGGGCCCNTTATCC 
CCNGGGGGNTAAMNAAANTNTTNNCCAAMAGGGGGGNCCCCCTCCCCCTTTTTNAAAA 
NAGGGGGCCCCCCCCGNGGGGGGNNGNGAANTNATAAAAN I I I I I I I ICNCCCCCCC 
Sequence 1502 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCCAGCG 

CAGTCTG G G CCCC AAAG CAGTTTC ACCCTTACCT AACTT ACCTAACTCTT AATAT AGCCT 

MACTCACTGAMMTAAGCTAACTTCATTTCACCTTTNGTAGCATACACGTAGACTCAG 

AGTNTATACTGAAATATAATGGGAGGCTGTTATNAAAAAAAACCNNCCCTCNAAAAAAAA 

AAAATNTNNNCNNCTNAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGGGGGGGGGNNNNG 

GG I ll I M I INATNNTAAAAAMAAAATNNGGGGGGGGGGGGGGGGGGGGGGGTTTTTTN 

TNNTNTNTTCCCCTNNANTCCGGGGGGGGGGGGGGGGGGGGTNGTNGGTTTCCTTNTATT 

NTCTNTNNCTANNNCACNCCTCCNANACCCGCNNNNGANNNNAANATNNNTTNNAATTAT 

NNNNATTTNNNAAAAAGNGGGGGGGGGGGGGGGGGGGGGGTTNCTTTCNNTTTNNANTAT 

AANTANNATAGTNTCTNNNGNANNNTTNTNTNNNNANATNTNGGNNTNCNCNCNTNCNTN 
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CNTCTTNTTTATAAAANAAAAAAA 
Sequence 1503 

CCCTCGCGGTGGCGGGCGAGGTGCATCACCCTGCTGAGGGACATCCAGGACAAGGTCACC 

ACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTG * 

ACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGC 

CTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCC 

ACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACA 

AGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCC 

CAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGAT 

GCGCTCAACCAACTCTTCGAAACAGC 

Sequence 1504 

CCGGGCAGGTACTCTTGACAAAGCCTCCCCAACAAGGTGAAATCACTCTCGGCTCACCAC 

TGGTGGCAGAAATACTCTGGGCCTGTCTGTTCTGAATGGTCAGTTTCACTGTAATCCTCA 

GGCCCTTCCAGTCACCCGTTGCCTTGGCAATGTCATCACCMCTTTTTTTGGAGACAGAC 

CCAGGGGGCCCGATCTT GGGGGCCAGGG CAGAAGTTGGCACCGACTTTNCCTTCCGGGNN 

CCCCTMGGTTA MANA CTTTTNTTTTTTTGGGGGNNNAAAANTTTCGGGCNNTTGGGGG 

GGGGCCCTTTNNTTTTTAAANNAAACCCCCNNTTTTNGGGAAAAACCCNAAAGAAAAGTN 

CCCC CTTGGGCCCCCCCCCNNCCCAAAAGGCCGNNAAAMCCCCCGGTTTTTTNGGNGTT 

TTTTAAAAAAAMGGAAANCCCCCCGCNNCGGGAMANTTTNTNANAANTTTTTTTCCCC 

CCCCCCCCNCGGGGGGGGGGCCCGGCCNCNN 1 I I I I M I I M I U I I I 

Sequence 1505 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGTGGTAA 

ATGCTTAACTCCACCTATATAAAAATCAAAAACAGGCAAAATGAATCATCAGACTACAGC 

TTACCTTTAGGGGGTGTGAGTAGTGACTAACAGGGGTCAAGATGAAGGTTTAAGGATGTC 

GCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGAT 

CACTGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCTGTCTCTACTAA 

AAATACAAAAAATTAGCCGGGCGTGGNGGTAGGCGTCTGTNGTCCCAGCTACTCGGGAGG 

CTGAGGCAGGAGAATGACGTGAACCCAGNAGGTGGAACTTGCAGTGAGCCGA 

Sequence 1506 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTGCATTTATTTTAGCCCATGGAATAA 

C TGTG CTGAGAAACCACAGAGTCAATCAGATTCAAAATGTTAAAATCCTTCCTGCTTGGA 

GTTTTCCGTCTTCACATCAMGCATTTCATGCCCGTCAGCAACTTTTTAATGCATTTGCT 

CCTCGTTTGCACAATTTCCATTTAMGACTTTCCTTGGCTGACTTCTCTGATGAGGTTTC 

CTGCTTGCCAGGAGAGCACGCTAATGCAGAMTTACMAGGGGGCTTCACGTCCCTTTTC 

CGGAGGACCTGATATTTCAGATAATTTCCAGCTTCAGTTTTGGAGAAACGACTGTTCTTT 

GCACCAGGGGAAAATAAACTGATTTTCAGTGTAAGCAACCTTTCTGCAAGTAGAATGGGG 

ACTGTTGGGMTGGAGATGAAGACTTCACTTCATGTTATTCATTTCCTTAC 

Sequence 1507 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I I < I I I I I I I I I I I I I G 

GTTAGGTTMCAGAAACAATGGGGTTATTTCCGGTAGTGGCMCAGCACGAACTAGTTTT 

CTTGTGGACTAATGGCCCGAGAGTTCCTGCGATTTGTGGATTATTTCCCTTAGATGCAAT 

CGACCATTTCGTTCCAATAAGAATTAATGCTACCACGGGGCTTACAAAGAACCCCTCAAG 

TCTTCCAAATCTGCCCATGACATCAACCTNTGCTGCGTAATCGGACCTGCACCCAACCCA 

GGTTT 

Sequence 1508 

CCGCGGTGGCGGCCGAGGTACGCGGGAGAACAGCTCAGAAGGAGACCCACAGTGAGCAGC 

TCCCCTGTGTCGGCGGGCAGGTCGTCCCTCAAGTGTTCAGCTCTCAGCAGAGAAAAGGCC 

CTGGAGAGGGTGACTCCTCTCAGCTCTCAGCAGAGAAGCAGCCCTGGAGAAGGTAGCTTC 

TGTTCGCAGGCAGATTGTCCAGAGGTCCTGCTGCTCTCAGACGGGGCCCTGGAGAGGATA 

GCTTCTATCCATAGGCAGG 

Sequence 1509 
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CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAGCA 

GTGGMGCTGCAGAGGGCCGMACMAAATGMGTTTTCTACCAATGTCCAGACCAAATG 

GCTCGAMTCCAGCTGCTATTGACATGTTTATTATAGGTGCTACTTTTACTGACTGGTTT 

ACCTCTTATGTCAMMTGTTGTATCAGGTGGCTTCCCCATCATCAGAGACCAAATTTTC 

AGATATGTTCACGATCCAGAATGTGTAGCAACAACTGGG 

Sequence 1510 

CTCCCGCGGTGGCGGCCGCCGGGCNGGA C I I I I I 1 1 I I I 1 1 1 M I 1 1 M I I 1 1 I AAAGAT 

CCAGACTAAGACACATCAACAAGAAATTTCCAAATACCAGGTCAAGAATACTTCACATGT 

TTCTGGAGGGAAAGAAAACAGTTCATATACAGNGAATCAGGAATTACAATTGCATCAGAC 

TTATCAACAGCATCGGGAAGATAATGNGAAAATACCTTCAAAACTCTNGAGGGACACAAG 

AAGATGCTGCCTGGCCACAGGAGCGAGACTGCTGGCCTCCAATCAGTCTTNTGGGCAGGC 

CTNTGATGCAATTACAGGGCTAAGGAACCACGGTAACCAATGTGCCATTGGCTGTGCTAA 

AACCCAGNGGCCCCAGGAAGAATCCTGTTGACACTGGCTTGGAACATGAGGCTTAA 

Sequence 1511 

CGACTACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTTTT 

II M I n I I I 1 1 If II I I I I I 1 1 GGCATATAATCCAGTTTATTAAATACAGATGATGGGC 

CAGACATGGTGATAGAGAMTACAGATTAAGAMCCAGATCAAATCCTTTTTAAGGAATT 

ATNTAGNGGAAAATATNTCMCTNTNTTISTITTACNCTACTATTCAtTATCTTACACTTCA 

AATCTTCACCTTTCCATTTTGACNGNCGCTNTTTTACTTCAGNNTCCTGAAM 

CCAACAGAAGTTACATAMMTNCTMTCTTCMGGGGCTTTCTAAAATATTTTNATCAC 

GTCATTAAAATCTTCTTTCACTAGGCAANGGTTNTGTCTTTATGGGGGCTG 

Sequence 1512 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATAT 

TGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGACTG 

TCAAGTTTCAACATTCAGGTCTGTCCCAACAGGCACCACACCCGGGGGTGGACTCCCTGT 

GTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATCTATGAGGGAATTTCTG 

CGGATGACCGGAAATGGGTACCTCGGCCGCTCTANAACTAGGTGGGATCCCCCCGGGCTG 

NAAGGAATTTCGATATCNAGCTTATCGATACCCGTCCGACCCTCGAGGGGGGGGGCCCGG 

GTCCCAGCTTTTTGGTCC 

Sequence 1513 

NGGGGCCGCCGCCCCCGGGGCCGGNCAAAGCAANGGGGGGCCAACNAAGGCNNGGNNNCC 

CCNGGGGGGNANGGAAAAACGNGGGNAANCCCCGCGGCAACCAAAAGGGGNCCAACAACA 

AAACAAGAACCGGGAGGCCCCGGGGGAGNCCANAAAAAANGGCGGCANAAAAGNCCCNGG 

GGGGGGGGGGGCCCCNAAAAAAGGAGGGNGGGANGCCAAAAACCAGCNACNAANCAAAAN 

GCGGCCCGGGGGGGCCGCCCCCAACCNGGGCCCCAGCCTTAAAGNCCCAAGAANCCGCGG 

GGAAAAAACCCCNGGGNCCGGGGGGCCCCAAGCCCNGGGCCAANNAAAAAANGGGAAAAA 

NNCGGGGCCCCCAACCCGCCCGCCCGNGNGGGGAAAAGGAAGGGGGGCCGGGNNCNCCGC 

CCGGNAANNCCGGGGGGGCCGGCCCNCCGGNGCCCCGCCNNNCCCCNNCGGCGGGCNAAC 

CNGGGAACCNGCCGGCCNGGCCGGCCCNCCCGGGGGGNCCCGGNNNACCGGGANGNGGNC 

CGGGCCGAAGGCCGGGGNANGCAAGCCGNCNAACCNCCAAAAAAGGGGC 

Sequence 1514 

GTNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTCTTNTAAATCTGGTGTGTTT 

TTTACACTTGCAGCACATCTCAAATGAAATAGCAACATTTCACATGCACAAGACTCACAT 

ATCTTCTGATGGCTATCTTGTCGAATAGCACAATTCTAGGAAGTTCAAAAGGGGGAACAA 

TTGCTTCTCAAATTGATGAAGGTAGAGAAGAATGGTATCAGAGAAACTGATTAAGCAGGG 

ACCTGGGMTCTGAGTTGAGTTCCGGGGCATGTTTGTTCCCTAGCACCATGACCTACATG 

GCAGTGTCTCTTCTCATACTATTNTAAGAGTCCCTGG 

Sequence 1515 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGCCAAGC 
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TGGACTGCATAAAGATTGGTATGGCCTTAGCTCTTAGCCAAACACCTTCCTGACACCATG 

AGGGCCAGCAGCTTCTTGATCGTGGTGGTGTTCCTCATCGCTGGGATGCTGGTTCTAGAG 

GCAGCTGTCACGGGAGTTCCTGTTAAAGGTCAAGACCCTGTCAAAGGCCGTGCTCCATTC 

AATGGACAAGATCGAGCGGCCGCGGGCAGGTACTTCATGAGACAAAATGGAAAAAGGAGA 

ATTAMTTAAMTGCACAACTMTATTTATCTACTACAGACATAATATTTCTCAGTTGTG 

MCTMTTACTATGCTTGGAAAATGCTAGCATCCTCATAAATATTTTGGTTCTATTGGGA 

TACAAMTCTGATTTCGCAMCTTTGCAAAGGCACATTTT 

Sequence 1516 

CCGGGCACGGTACGCGGGGGCATTTATCTTCACTCTGATGAGGGCTCAGACTTGATAACG 

CCCGTGGTGCCCCATCCCTATAGGAGCTGGTGAGATTGCAGCCTGCTGCCTCCCCTCCAT 

CAGCCACAGCTATTGGATTTCCCACCCAGAATCTTTAGGTAAATGAGGTAAGTCCTGATT 

TTTAAMCTTCTTTTGAATCTGGAATCCAAACACCTGAGTGGAAAGAGAAGCCTGCTTTA 

AACTGGACAGATGAAACTAGAACAGACTCTTGGAGACGGCTGGCAGGAAGTGAAGCTCAC 

CTTACCTGGGCTTACCTCACTGGGTCAAATCAGAATTTTATTTTGGAGGGCAGGTTGGCT 

ACTTTGGATATTATCTGNGMTTTCCTGCATTGGCTGGACTTCTAATCTCTGNGAATTTA 

AAAGCC 

Sequence 1517 

CCGCGGTGGCGGCCGAGGTACCTTTTAGTTATTGACAAGGTTAAAATAGCACTCTCAGTT 

TTTCAGTATTACAGAGAGCAAATAGTTTCCTCTCCTGCTCTGNGCAGTAGCTTTTCCAGA 

ACTATGGACAAAATTNGATCAGMAGAAGATTGATTATTTCCTCATCTTTTTTTCI I I I I 

TTTGAGACAGAGTCTCTCTNTGTCCCCCAGGCTGAAGTGCAGCGGCATGATCTCAGCTCA 

CTGCAACCTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTTCTGAGTAGCTGG 

GATTACAGGCGCCGTGCCANCATGCCCGC 

Sequence 1518 

CCGCGGTGGCGGCCGAGGT AC I I I I I I I I I I I I I I I I I I I I I CTTTCTTTC I I I I I I I I I 

I n I I I I I I I II I I I I I I I I I I n I I I I I I I I I I I I I I I I I I I I I GGCANNAAAAANCNT 

NATTNCCATTNGGCCCAAGGCTTGTTAGGAAAGTAAAAAAAGCCCCCNATTGGCNGGNGG 

GNNNGGCTTAGGCAAAACCCCTANTACTTTGCANGGGGCCCTCNAAAAAGCCCGNGGGCC 

CNA AAAAG NCCTTAAANCCNCCNTGNAANAGGGCCCCTTNAAAAAAAAACCCCCCNNNNN 

NMATTTTGNGGCCCCCCCCCGGGGGGGGGGGGGGGGGGGGGGNGGCCCCCCCCCCCNTT 

TNTNNAACCNCCCNCCCCCCCCCCNNAAANGGGGGGGCCCCCCCNAAAAAAAAAANGGGG 

GGGG GGGNNNGGCCCCCCCCCCCCCCCNAAAAAAAAAAMANGGGGGGGNGGGGNTTTTT 

TTTTTNCNCCCCCCCCNCCNNNCNNNNNNNNNGGGGGGG 

Sequence 1519 

CCGGGCNTGGA C \ » II I I I I 1 1 I I I I I I I 1 1 I I NCTTGCTTTTGG I I I II I CCTTCGATA 

TCTTCAAAATCTGTGTCAGAAGAAAAATGTGTTTCTGACTCCTGTAGTAGATTAAAATCA 

GTTGGTATTTCTGGAGCTGCTATCATTTCTTTATCATCTTCTGGAACACCTCAATGTCAG 

AAATATATTCATATATTTACGTGGGTAAMTGGNGGCTTGCCTTCTTTCTTCTAAGACAA 

TTCCTCACGGNGGGCCAAGCGCCCTCCGGGTGCCCCTCCAAACTTNTAGAAGCTTCGCCT 

CCGCCATTTTATAACTTACCTCCCCGCGTACCTN 

Sequence 1 520 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAG 

GACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCG 

CTCAACCAACTCTTCCGAAACAGC 

Sequence 1521 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 
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CACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCMTTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGT 

Sequence 1522 

TACT ATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTA CI rTTTTTTTTTTTTi 

TTTTGGTGGTAATTGAGCTGGGAAAAATTCAGAATTGGGTCATAATTAATGGTAACTAAA 

CAGATTTGNGAATATGGGACATCTGTGGNCTTGAAAACATCAGTATGATTTGNCCCCATA 

TTTCTTCAGCCTGGACAATAGAAACAGACAGGGGAGGGGGGTAAAGTGCANTAAAGTAGG 

CTGAGTGATGTGGTGCTAGCAGCTGGAGTCCAGAGAAGTTCTGACAGTGCAGGGAGCAGC 

CCCTTTGTTCTTTGGAGCACTGGAAGGGCTGAGCTGCATCTGAGGTGTTCAAGCCACCAA 

CAGGACAGGGTAGAGGACTAAGTAGCACATGTCCTCCAGAGCAGCTTCCTGTCTTTGTGT 

GGTCACATCACATCGGGGGAA 

Sequence 1523 

CGACTNCTAT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA C I I I 111 i I I 1 1 

I I I I 1 1 I I II I GCTTCT GGTA AAAAGAAGCATGTTACAATTTTCCCCCCCAATTCACATA 

GGACTCTAAGAACATATTTTAAATCAGTGCTTCCATACAGGMCGAMTCCACTATTTTA 

GAATTCTAAATCTTGTTGAAAAGCAACCTTATCTGAAGAGTAAACAAGAAGATTCAAAGT 

TAAGTATCAGTGCAGTCCAGAGCCCCTAAATGAATAAAACTGAATGTATCTTAAAAATAG 

GATTTGCACACCAGTAAGAGACTTGTTTACGATTCTGGGGAGGAGGGAAGAAACTGTAAG 

AGGGAGAGAAAAGGGAAGAAAAACAAGAAGAAAAATAATNGGAAAAAAAAAAAAGAAAGA 

AAG GTTTGTTT AG CTCAAAG 

Sequence 1524 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATTGAGCTTAG 

TGATATTMCTTGGTAAATTMGCCAGTTTGTCTCTTTGCTTTATCTGTCTTGTGGAATC 

TGTATCAGTGGAAATTAAATAGTCTTTACATTTGATGMCCCTGTTTAGGTGTTGGAAAT 

ACCCATCTATTJGTTAAAAAGGCAAGGTCCCATGATTTAGTGAATGGGGGATACAGACAG 

CCTTTATTCAAGTAACTGAATAMCAAAAGAATTAGAGAGTGTGATGAGTTTGAATAAAA 

AATATAGNTCATAAAAACCAGAAATGTGATAGAGCATAGTGGCTGGAAGAAAGTTACCCA 

AGTGGCTTGGGTAGTCAATGAAGTTGACTCCAACATGCAGTAGTACGCCGGGTCCTAAGA 

TAGAGATTAAGTCATGGTTTAAATGAGGAACAATCAGTAA 

Sequence 1525 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACACCACTGTGCCTGCTTTGAATCCTTTACGAA 

GAGAMAAAAAATTMAGMAGCCTTTAGATTTATCCAATGTTTACTACTGGGATTGCTT 

AAAGTGAGGCCCCTCCAACACCAGGGGGTTAATTCCTGTGATTGTGAAAGGGGCTACTTC 

CAAGGCATCTTCATGCAGGCAGCCCTTTG 

Sequence 1526 

GAGCNCCCCGCGGTGGCGGCCGNTGTAC i I I I I I I I I I I I I I I I I I GCTTNTTAATTGGA 

TGCCTGGAGACAATTCCATTTCAATTACCTTATTGGCATGACNAGATATACAAGGGCTGC 

CAATGTCAATACATTAAGACTGAGCGTGCTGGAGCAGCAGCCAGGGTTCAGGGCACTGCT 

GTGTCATCTGCGCCACGGTGCACAAAGGCAGNTTCAAAAGCATTTCAGCATGATCGCTTC 

CCTCTCTCCGCTCCTGGGGAGAGAAGGATCCTGCACACCACAGGCAAATCATGCTGAAAT 

TGAGGNGGNGCCTTTGGGACTCCCATCCCATCACAGTCTTGGGAT 

Sequence 1527 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATTGGTTGAACA 
AACAGGTGGGCAAAGTGAGGAAGATAAGAAGTCCATCCGTTCAGTTTCCCCACTGCGGAG 
GGAATAACACTGTCTTTCCACAGGTCACAGACTGGGATGAGCAACGGGCTGAAGGCACGT 
TTCCCGGGAAGATCTGAACTGGCTGCATCTCCCTTTCCTCTGTCCTCCATCCTTCTCCCA 



# 
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AGATGGTGAAGGGGGACCTGGTNCTG 
Sequence 1528 

AGGTACGCGGGATGGCACATANGACATCAGCTAGGCTTTTGGGAATCGTTTGTGTTCTTT 

GTGGAAATGTCCTTTAGAAGCACCCATGAAGTAGTGTGTTCAGACTGTGCACACAGAAAA 

CAGGCTCTGCCTTCACATGTGAGACGGTGGAC7TTTCCTNTGGACAAAATGACAGCATNC 

TGGCGACTCCACAGTGGAGCTGAGCGCCACTCCCTGTAGCCCGATCTGGGACTGAAACGC 

TTACACCTCTGCCTTAGAAGGAGTCCCCCNTGCC 

Sequence 1529 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACGCGGGGGAATTAGTCCGAGTGGAGAGAGCGA 

GCTGAGTGGTTGTGTGGTCGCGTCTCGGAAACCGGGTAGGCGCTTGCAGCATGGCTGACC 

MCCTGACTGAMGAGCAGATTTGCAGMTTTCAMGAAAGCTTTTTTTCACTATTT^ 

CAAAAG ATGGG TGGATGNGAACTATTAACCAACCAAAGGGAATTTGGGGAACCTGTAATG 

AGGATCTTTTTGGGCCAGMTCCCCACMGAMGCMGMGTTTACCAGGGMCATTGAT 

TTAATGGAAAGTAAGAATGCCTTGATTGGTTAAATTGGCACCMTTTGGACTTTCCCTTG 

MTTTTCTTGACAAATGGATGGGCCAAGGMAAAATTGGAAANGACCCCAGNACCNGTTG 

GAAGGAAAGGAAAA 

Sequence 1530 

CCGC GGTGGCGGCCGAGGTACGC GGGTGTTCCTTTTTGTTCAAAGTCTATTTTTATTCCT 

TG ATA I I I I I CI I I I I I I II 1 1 1 I I I I GNGGATGGGGACTTGTGAATTTTTCTAAAGGTG 

CTAT TTAAC ATGGGAGGAGAGCGTGTGCGGNTNCANCCCAACCCGCTGCTNACTTTTCAN 

CCNNTTTTCACCNGNCTNTGGGTTTTTMNACCCTNGGNTTTNTACCCCNNTCCTTTGNA 

AANCCCTNNNCCNAANNGNGGGGCAAANANCCNGNGGGCCCNCTANAAAANANCTTGCGG 

CCCCTGTCCCCCGGGGTTTTGAGGACAANTTTNCCCAAAGCNNMAAANAGANGTTTTTT 

CNCCNTCNNCTCCTGGNGGGGGGGGGGNGAAAAAAAAAAAAAAAAAN I I I I I I I I I CCCN 

GNN NNGGG GNGGNNNCTTNAAAAAAAAAAAANAATGTCCCCCCCCCCCGNNNNGTGGGCG 

NNAATTTTTNTTAANAATATTTTTNNCCCCCCCCCCCCCCCCCCGGGGGGGG 

Sequence 1531 

ATAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCATTCCGGGTCATCCGCAG 

AAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACAG 

GGAGTCCACCOCGGTGTGGTGCCTGTTGGGGACAGACCTGAATGTTGAAACTTGACAGTC 

AGAMAATAACTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGCGCATCCTCAATATT 

CCTTTTGTTCCTCTGGTAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGTAG 

GTTGGTGATGGTGAAATTCAGGTAGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTTGATA 

AACTGATGACTCCATTTCTGTCACATGGATGTCCACCAACTGGTAGGTGGAGCCCAGCCA 

ATGGGAATGAGGCATTCAGGGTCTTATCTAGAAAGACTTGCTCCACCAGGCTGGGGTCCA 

AATTGGAG 

Sequence 1532 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGAAGGTCCCGGCAGCACGCACGGA 
AGAAGACGGACCCCGCGATGAGGGGGGCGGCAAGGAGCACCTTCATGTTGCGTTCGGAGA 
GGCG CA GCATC CACAGGCCCCGCGTACAGACTGGTTTGGTAAATGCTAAACTTTTGTGTC 
TTTTGCCl I I I I AAAGGAATTGTTAACATTGGAATTGAGGGTATGTACCT 
Sequence 1533 

AGGTTTTGGGCGATCGTTTTCATACGAMTATTTGAGATGCTTTAGATGTGTGTGCATGT 

CAGCTGCCACCTGAAAGAMGGCCTCATTAMGATTTTCACTGATTMCTCTTTGATTGT 

TCTTGGGATCTCAGATGGGAATTCACGCTGCTTGCTGCAGAGCTCTTGGGCTAAGTGTAT 

TTTCT TAATT ACTGAGAAATGCGTGTTATCAGTAAGCAGTGAAAAGTCTTGAAAAAACTA 

AGTAATTTTAAAAAAAAAAAAAAAAAAAAAAANGTCCTGCCCG 

Sequence 1534 

NCGACTNCTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA Cl M I I i I II I 

ill I I I I I I I I GGNGGNNGTTTTCATTTTCTTCTGGACTGGTAATTTTTACTTCT 

GGCCTTGGTGGTCTTTGAGGGTCTACATCAGCCTNATCTTGTCCACTGCCAGCTCCTTGA 
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GACTTTCTTTCCTTGTCTAGGACCAGTTGAAGCTCATTATTAACCTGCTGCAGCTCCTCA 
CCATTCTCCTCATCGGNGCCTCCATTTACACGAGCAGGGCGTGCCTGCTGCTCACTGTGA 
TCCAAAATCTCCTNGACTTGCTCAGGGGCATfsrTGTAGCCTCC 
Sequence 1535 

CACTTA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTGGTG 

M I i I iGGAAMGTTACATTTAGATCTATTCTGAAGCTGTTCATTTTTAACAAATAAAAT 

GTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGTCTGTGA 

ACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAGTCAGAA 

AAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGTGCACGT 

AGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGTGAGAAT 

TTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCACTTATG 

CAAGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTTACTGGTGGGATGTG 
Sequence 1536 

AGGTACTAATATCCCTTAATGGCAGAATGTGATAATCATGGAATTAATTATTGCTAAAGT 

AGTTTTCAAATAAAAAAAAGAAAAAAGAAAACAAAACAAATTAAACTTGACACAATCTGA 

CCAMCACGTGTCAATTTACMTTTCMGGTTATTTTACAAAATACCTACATTTACACAA 

TAGGCTCCCGGCAGCATTTCCAGACAAATGTTCTTTTAATTTATCCTGACATGCTATAAA 

TGAATAMTTACACTATTTTAAAAATTATCATCAGTMGTTTTTCCTTTCTCATGGGGGT 

CAANAGCAAAAAAGAAAATNNAGGCNTGCCAAAGGAAGGATTTGGAGAGGGGAAAGCCGC 

CACGCACCACCAGTATAACCTTAAAATAAACAGGGAGGGGGCTGGGGGGAGGGGGTCAAC 

AAATTCACTTGAAA CCCTC ACAACAAGTGGTGGATGCCTCTTCCAAAGGGGGGNGGAAAA 

GGTTCAANCTTGGCTTTTATTAGGCCCANTTTCAAAAAAAGGTATTTTTAAAAGNAAAAA 

ATGGTTTTTGGTANGGGGAAAAA 

Sequence 1537 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAAACACCTCATGA 

GGCTGGCCAGAGATGACACCACTGCCCCGGATGCAAGGGGGCAGCC7TGCTTTGGACCCC 

GTTCTGMTCTCAGGCTTCATCTATTAGGCACACTGAGCAGAAACACTCCTCCCTGCCAC 

TTCCGAGTTGCAGCAAAAAGGGTCTMCTATGATTTAMGCAMTAATTTTAAACTTCAA 

ATTTTCATTACTGTCATACTTATGAGGACCAAATAATGTAATGACTAAGACATCAAATTA 

CATGTAATATGAATAAAACCAGTAAGAAAATGATAGTTACTTATAATTGGATCAACATAC 

AAAAAGAACGTCATTTGGCCAAAGTAAAACGGTTAAAAAAATAGTGTCTATAAAATCACC 

ATATAAAATCTTAGTATGTACCTGCCCG 

Sequence 1538 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAG 

TTGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGT 

GTGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCA 

CTGGACCTAAGCA GGGC CAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAA 

TGCAAAGTTATTTATTTTGTTAGAGTTTGAACAACNGTGATCTGTAGAAAACCAGTCACA 

TTTCCTTMGGATGCATNMTAAATATGAGCTAMTGTMTMTTAGMCmGGTTTCC 

AAGCAGATT CCATG CTCTAATTCCCTGGATAGGTGAATATAATAAGATGATNTATCCNTG 

MTMGNTNCTTTTNCTTTGCCCAAGGGGACANGATGGTAAATAAGGGCTACTAAATCAA 

GTTG N CCTTT AAAAT AA 

Sequence 1539 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAGT 

TGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGTG 

TGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCAC 

TGGACCTAAGCAGGGCCAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAAT 
GCAAAGT 

Sequence 1540 

GTACTTAG GGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGGCCCTTCCTT 
TGAAATCA I inn CTTCCAGGCCCTTGCGCTCCGGGCCTGGGAGACAGAATGAGAGACA 
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GCCCCAAGGATCTGTGATACACTTTCAGGGTCATTTTTTCCTTGTCCCTGCCAGTCCAGG 
CACACCTTTTGTGTTCTCTCCT^ 

ATAA1 1 I 1 TCAAACCTTTAAGTTCTCTTTCCTTTTGATTGACCATTTTGTCTTTAAATCA 

"Mil ICTTCTCACATTTTATTATTATAMCATTCMGGGAAACCMGCCATTATTTTGCT 

TAGAMTTTATTCAGCCAGCTGGGGCGGTGGTGGCTCACACCTGTAATCCCAGCACTTTG 

GGAGGCCCGNGGTGGGGTAAGATCACCCTGAGGTCAAGGAGTTTCGAGACCAGCCTGACC 

MTMTNGGTGAMCACTTGNCTTNTACTTAAAAAATACCAAAATTT 

Sequence 1541 

GGAGCTAAACCCCGGNGGCGGCCGAGGTACTTACCCTCAATTTCMTGTTAACAATTTCT 

TTAAAAAGGCAAAAGACACMAAGNrTTAGCATTTACCAAACCAGTCTGTACGCGGGGCCT 

GTGGATGCTGCGCCTCTCCGAACGCAGCATGAAGGTGCTCCTTGCCGCCGCCCTCATCGC 

GGGGTCCGNCTTCTTCCTGCTGCTGCCGGGACCTTCT 

Sequence 1542 

CCGCGGTGGCGGCCGAGGTACACMACGAGATGCTACCTAGGAGAAGGGTATTCTTTTCA 

CTATTCTTTCAMTTTTCTGTATGTTCGMCATTTTCATAGTAGAAAGTTGGGGGGAAAA 

TCTGTTTCATAAACATTTCCTCAGCAGCAGTCCAGTCTATTGCATTTTMTTGGTTGTGA 

TATCATTGTTTTATGCAATACGTTCTCAACAAGTATATCCTCCGGCAAACTGAACAAGGA 

CCAAGTCTGTTCTGCCTACAGCTCTGCTTCCTCATAGCTGCTTTCCAGAACGTGACTCTT 

GCAAATTATCAAGAAAGGGGAACTAATCTAAGGGATCCCAGATCAACAGCCTNATGAAGA 

CCTTAAT1TATGNTTCTAANATAMAGATAGGMGTTTTCAAAAAAAGCCCCTGCTTCAC 

ANAGGATCAANANCAGGGGGTGGGCCCTGCTGGGCTTNCACTGGGATTTTTTGAGCATTN 

CTTTCCCNGGNGGCNCGGNAAGGGGNGTGGNGTGAGCCCNAGGGNGGAAAAAAATTT 

Sequence 1543 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTAACTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CMCTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGMATATTTCTTTCATTTGCTC 

TGAGCTTMTATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAAGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTCA 

CGTGTATTCACAGCCTCAATGCCATAAGGAAACTCT7T 

Sequence 1544 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACCGACTTTGTCCAGCTGCAG 

GTGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTG 

CCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCT 

CTGAGCTTAATATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTT 

TATCMGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTC 

ACGTGTATTCACAGCCTCMTGCCATAAGGMACTCTTTTAAGAAGTTCTGACAGCTGGT 

CATGTTAGGTATAAGAACAGGGTGCCTTATCACTGGTGGATTTCATTTCT 

Sequence 1 545 

CGAGGTAC I I I I I I I I I I I I I I I I I I I I I GGGGAGTTTGTTGTTTTTAACCAAATTATNA 
TAGATGGAAGCATTAGGCAGCTGMTGTTCATTTGCCTTCANACATCATNTCCTATTTCA 
TTTGCTNGNCTCGCATTAAAMGAATCATTTATCAGCAAAAGCATCATTTATTTGTTTAA 
ATGACMGGTTTAGCTAGCAGAGNAGAGT1TGCNATGCTTTTAMNAMTMCNTTTGAC. 
TTTTC TTCAA GACACTACAAMCCATTTGTTCAAAAAAGGCTGCCCMNGTCATTTANAA ' 
G MTATT TTTCAAAANGTCTAmCNTATTmAAAMACTTGCCC 
GGTTTTGAATTCCACCTGGGCCTTTTCTTTTTAACCTTGAMGGGCTTAAA 
Sequence 1546 

GGCGGCGGCCGAGGNACAAGNANCNNTTGGNGGAGGGGGGNGAAACCCAANACCCGAACN 

NGGGACTGNGCAGACAAGCTATATCTTAANCCCNCNCCGGGCCAGACCNCNAGCAAGGGN 

GAGGAAGCAAAAGNCCACAGNNACNGGGGCAGGNAANNGGNANAAANGAGGGNGNGGGGC 
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NGGNTGCGGTGNTTNACAGGGGGAANCCCAACACCCCGGGAGGNCGAGGCAGGAGGANCG 

CCCGAGCCCAGGAGGNNGAGACCAGCCNGGGCAACANGGNGAAACCCNGGCNCNACAAAA 

AANACAAAANNAGCNGGGCANGGNGGNNNGGGCCAGNGANCCCAGCNANNNGGGGAGGCN 

GAGGNGGGAGAAANGCNNGAGCCCAGGAGGCAGAGGNNGCAAGNGAGCCCAAGAANGCGC 

CACCGGAACNCCAGCCCAGGCAACAGAANGAGAACCCGGNCNCACANNAAAAAAAAAAAG 

AAAANGAAAAAGAAAAAANGNCCCNGGCCGGGCCGGGCCGGNNCNAGAACCANGGGGAAC 

CCCCCGGGCNGCAGGAANNCCGAAANCAAAGCCAACNGANACCGGNCAACCCCGNGGGGG 

GGGCCCNGGNACCCANNNNNNNGGNNCCCNNNAGGGGAGGGGNAAANGGCNCGCCNGGNC 

GNAACAAGGGGCAAAAGCNGGNNNCCNGGGGGGAAANNGGNANNCCNGNCNNCAANNCCC 

NNNAAAAAACNAANCCGGAAGCAAAAGGGNAANNCCGGGGGG 

Sequence 1547 

AACACCACCGCGGNGGCGGCCGNCCGGGCAGGNACTTTNNTTNGGGGGNTAAAAAACCCC 

CCNAAAAACNGGGCCCTAGNAGAAGGCAACNTTCATTNCAAACGAGGGGGCCCCNGCCCC 

GNGAAAAAACGGGGAACACGACGNCNAAGGCAGANCCCCGNAAGNACCTACNNNGGACAG 

CCGGGGCAGGCGNGCAAANNNTTTGGGCNNGGCCNCGCAAAGCACAAAGGGGGACACANA 

ACCCACTGCCCACGGCGCAGGAGAAAAAAAAGAAACAGCAAANCACGAGGGGACAN 

Sequence 1548 

CCGGGCAGGTACATCANTTNTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAA 

AGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTC 

CGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCA 

AGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTT 

GGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAG 

CACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGC 

CCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCGGTGAGAAG 

GGGGTGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGACTGTCAA 

GTTTCAACATTCAGGTCTGTCCCCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAAC 

TTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATTCTATGAGGAATTTCTGCGGAT 

GACCCGGAATGGTACCTTNGGCCGNTTCTAGAACTAGGNGGATCCCCCGGG 

Sequence 1549 

CCGCGGTGGCPGCCGAGGTACGCGGGGCTTGCATCTCTGGGGCCAAGGAGTGGTGGGTGA 

GATCTTCCATGGCCCTGGCATGGGTGATATAAGCGGGACCGGTAAGGTGGTGGAGCTCTT 

ACCAGACCCTGCANAACCCTCTCCGTGGTGTTGAACTTCCTGGAACCAGGGTGTTGCATG 

TTTTCCTCATAATGCAGGTTGGTGATGGTGAAGTTGAGGGTGAACGGCACCAGGAGAGGG 

CCAGCAGTTGTGGGGCTGGGGAGGGAGGATGGAGTCCCTGACCCAAGGTCCACTGTGGAG 

GTCCCAGGAGCTGAAAAAAGT 

Sequence 1550 

AGGTACTTTACAAACAAGTCTGAAAAAGGAGGGAGTAAAGTATGGAAGAATGATCTCTGG 

ATGTTGC TACTG GCCTCAAAAAAGCAGTGCTACAGATTTCTGTGTGAAGAGAATACGCTG 

TTCACACATTTTCCTATTTCCAGGCATGAAAATATTCTATTGGGTAGAAGAAATAGGAAA 

ATCTCTTATGACAAATGAAAGACAGGTGCAAACACACCAATCCCTGTCTAGCAGTATAAA 

GCATATTGGGCTCAGAATTTGTCTGTTGCTAGCACCTGGCTTTCATACTATATCCTTATC 

AAATAATCAGATTGAAAGTCCAAATCATTCTTAAGCAAGCAAAAATCCTCAGTGGCCATA 

CCTCA 

Sequence 1 551 

GGCGGCCGCTCTAGAACTAGNGGANCCNTTTTGCGGGGGGAAAAAAACCCCAAGCCCANC 

GANACCGNCGACCNCGAGGGGGNNCCCGGNACCCAGCGNNNGCCCCCNAAAGAGAGGGNN 

AANNGCGCGCNNGGCGNAANCANGGNCANAGCNGNNNCCNGNGNGAAANNGNNANCCGCN 

CACNTTTTNNTTTNNCNGACGAGCCGGGNGCANAACCCCCAAANANA 

Sequence 1552 

CCGGGCAGGTACGCGGGCTGCCTGGGGATGGCAGCCGCGTGCGTTCCCTGTGCTGTCCTG 
TAGGGTTGTGGTGGCTTTCTGATATTCAGGTTGGTCCCTTTTCCTTCTTTTTGGATGGGC 
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TTTCGAGGTTGGGAGAGGAAGCTCAGGGATGGAACTGACTAAACATATGTGAGATGTTTC 

CTTGTTTGCTGCTCATGGAGGTTTCATTTCTGMGTCTTTTCTTGGGAGGGAAAAGGATG 

TGTGGATATATGAGGTGACTCTAGAACCCTCATTTTATGGATGAAGAAACTGTGATTCAC 

AGAGGGAGAGTGATTTGTCCAGTGTCACGTGGGGAGCTGGCCGGGGAGCCAGACCTCCTG 

TACCTCGGCC 

Sequence 1553 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTTCCATG 

GGCCCTGTTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCA 

GTCTTTTATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGC 

TGTGGTGAAAGCTGTAMACACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAG 

TCTAAAAAATAGTAGCAACGGGAATCATAATGAATACATGCAATGAATTTAAAATGTAAA 

MTGMTTTA AAAAG TAAAAAGGGCTCTGTGGTGTAATTTTTCTTAACTACAAGAGTCTA 

AATACACTGCTTTTCTTTAAGAGTTCATTTTMTTAGTMCGTCAMCAAAATTATTCTA 

GATAATGAGCCCTACAAA 

Sequence 1554 

ATCGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 

GGGTGGCCAGATTMTTTTCCAGCCMTAGAGGAMGTTTTTATATATTTTTTGCTAAGG 

TGCTGGAGAGGGGAAAGGATGCCACACTTCAGAAGCAGGAGGACGTTGCTGTGGCTGCTG 

CAGTCTTGGAGCCCCTGCTCMGCTGGCCCTGCTGGCCGGCCTGACCATCACTGTTTTTG 

GCTTTGCCT ATTCf CAGCTGGCTCTGGATATCTACGGAGGGACCATGCTTAGCTCAGGAT 

CCGGTCCTGTTTTGCTGCGTTCCTACTGTCTCTATGTTCTCCTGCTTGCCATCAATGGAG 

TGACAGAAGTGTTTCACATTTGCTGCCATGAGCAAAGAGGAGGTCGACAGGTACCT 

Sequence 1555 

CCGCGGTGGCCCGCCCGGGCAGGTACAGCAAAAAAGAAACTGAGAAGCCCAAACTGCTTT 

CTTGTTAACATCCACTTATCCMCCMTGTGGAMCTTCTTATACTTGGTTCCATTATGA 

AGTTGGACAATTGCTGCTATCACACCTGGCAGGTAAACCAATGCCAAGAGAGTGATGGAA 

ACCATTGGCAAGACTTTGTTGATGACCAGGATTGGAATTTTATAAAAATATTGTTGATGG 

GAAGTTGCTAAAGGGTGAATTACTTCCCTCAGAAGAGTGTAAAGAAAAGTCAGAGATGCT 

ATMTAGCAGGTATTTTAATTGGCAAGTGCCACTGTGGAAAGAGTTCCTGTGTGTGCTGA 

AGTTCTGAAGGACAGTCAAATTCATCAGCATGGGGCTGTTTGGTGCAAATGCAAAAGCAC 

AGGTCI i I 1 1 AG 

Sequence 1556 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTCCACATCCCGAAGGCCATAGCGTC 

GAGCAAGGTCACATGGATGGCAGCACCTTACCACTCAGGCTCAGGATATTGGGATCTGTT 

GCCMAGCCACCACACATTTGCCACTCAATTCTGTGGTTTCCGCAGATGAGAAGGCTGAT 

TTGAACTGCTTCAACACAGGATCCTGCAGGACCTCCTCCTTTGCCATATGCTCCTTCAGC 

AGTTCTGTCTGCACAATCCCCGGCCACAGAGACACACAGCTGACCCCATGGCGCCGCAGC 

TCGTGGGCACAGTCAGCAGCCAGCTTGTCACACGCAGCTTTGCCCACACCATAGGGGACA 

TTGAACATATACTGCAGGCTTCCTGGGGAGGAGATGACCACGATNAGCCCCTGGCCAGCT 

GGTACCT 

Sequence 1557 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCAGGTACGCGGGGGGT 

CCTGGTGCTGCGGCCACACCACCCTGGGGTGCTCATTGACAGAGCTGCCATAATGAACTT 

GAAAGGACGGGAATCACCGAGGGAAGCTGGGGCTCCCCTGCCCACAGGAGAGGATCCCCG 

TTCTTCAAGCTTCTCTGCTCAGTGTCTACTAACGACCGACATTTGCTAATGTAAATAATA 

GTAMTTATTGAGAATTCTMTTCTTTTACACAGTCTGTTTTTMTCTATTT^ 

TAAAATCTATGACT 

Sequence 1558 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

CAATGAGGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACG 
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ACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCA 

TCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCC 

TTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCCCTTCTGTCCCGCTGGAACCCATGGT 

CCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAAT 

TCCTTGAAATTTGCAAGTTGTCTGGGTTCATG 

Sequence 1559 

TACT T AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCCAGGCACAGCTAA 

TTTTACTTTTTATTTTCTTGAGACAGGTCTCACTCTGTTGCCCAGGCTGGTCTTGAACTC 

TTGGCCTCAAGTGATCCTCCAGCCATGAGCCACTGTAAATGTCTGGGAATGCCAACTTGA 

AGCCCAGCTGGTCAGAAGTTCTAGAGGCCCAGACTTGCAACTGGTGTCTGAAGGGATGGC 

AGTCTTGGGGACTGAGCCCTCAACCTGAGGGATCTGATGCTATTTCCAGGCAGATAGTGT 

CAGMTTAAATTGGAGGATACCGCAGTTAGTGTCTGCTGCAGAACTGATTGCTTGCTTGC 

TGGTAGGGGGGAAATCCCCACATATTGGGGGGGTATTTGAAGTCTTTTGGTGTTGACTCT 

TACTGNGTTTNGTTGGTTTTTCATGTGAAGAGCAGAGGGAAAAGCAAGGGA 

Sequence 1560 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTGACCCAACAACTGTGACTA 

GCTGGCCACGTCATTCAGGGCTGGTGTGGCATTTATGTGTGTGTGTGTGTGTGTGTGTTT 

TTCCTGTTTGCCCAGCAGTGCATTGTGGGTTCCAAGAGTGGGTAGTGTGTGTATGTGTGT 

GTGTCAGAGGGAGACCTGGCAGGCACCTNTTTGAGAGTAGCTGTGGTCAGAGCTGTTTGG 

TCAGTGCATTATGTTGAATGAGGTCCAGGAACCCAGAGCCACCCAGCAGACACCACTGTG 

GCTTGCCAGCTGCCAAGATGGAGAAGCATGTGCCCCTGTAGAGCGTCTTCCCAGAACCAG 

ACCCC GAGCCACTCGCTTCCTCTGTGCTGNGACAACATTGGTGCCAGGGGAGATNGTTNT 

TTTTTCAAAGGGGACCTACTGTAGCCCCTTTTAAT 

Sequence 1561 

CCACTCACTATAGGGCTGAATTGGAGCTCCCCGCGGTGGCGGCCGGAAGAGCAACCGAGA 

TGAAGGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACT 

TACAGAGAGTTCCAAGMACTATGATCCTGCTTTACATCCTTTTGAGGTCCCACTGAGAA 

TATGTMGAGCTTTAAATGCTACCAAACTGGMCGAGTATTTGCAAAACCATTCCTTGCT 

TCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAGAGAAGCTGGCT 

ACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAAT 

TGTATCCGTTACCT 

Sequence 1562 

GGGCGNGTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTCCTGCCTTCCCCCATATA 

CTGAAGTTTGAGAGGCTGGGAAGGTGCGGAATGGGAAAAGGAGCAGCTGCTTATGTTCAG 

TTTAACATTCTCTGGGTTTCTCCATCTAGGTCTTGAGTTCATTCTCTTCCTGTCCTTTTG 

GCTTCCTTGTTTAACCTGGTCCCTGTTTCAGGAGAGAAGCCTCATCAGTGCCAAGTCTGT 

GGGGAAGACCTTCTCTCAGAGTGGAAGCAGGAATGTGCATATGAGAAAGCATCACCTGCA 

GCTGGGAGCAGCTTGGGAGTCAAGAGCAGGAGCAAACTGCTGAGCCACTAATGGGGCAGT 

AGTTTGCTTG 

Sequence 1563 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGC 

CAGCCCAATGCACAGCAGTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGA 

CTTTGTCCAGCTGCAGGTGCCTGCAGGTCACACTGGCTAACTACTTTGTGATGGGCTCTT 

CmCTGAGGTTCTGCCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATA 

TTTCTTTCATTTGCTCTGAGCTTMTATTGTMTTTGTATTTGATCTGCTGGGTCTTTGG 

AGTCAGGACTGGTTTTATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGC 

CCACAGAACCTTCACGTGTATTCACAGCCTCAATGCCATAAGGAAACTCTTTTAGAAGTT 

CTGACAGCTGGTCATGTAGGTATAAGACAGGTGCCTTATCACTGTGGATTTCATTTCTTG 
Sequence 1564 

CCGCGGTGGCGGC CGAGG TACAMTTGTCGTTTTTATTCCTCTTATTGGGATATCATTTT 
MAAACTTTATTGGGTTTTTATTGTTGTTGTTTGATCCCTAACCCTACAAAGAGCCTTCC 
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TATTCCCCTCGCTGTTGGAGCAMCCTTATACCTTACTTCCAGCAAGCAAAGTGCTTTGA 

CTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAMCTGTTCTTTTGCATTTTGCCG 

CTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCAA 

ATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCCA 

ACTGCTMTGCAAATTAAMCATATTTCATTTTMCATGATTTCAAAATCAGTTTTTCAT 

ACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGAA 

TGGGGTAGAMCATTGTAAATATTTACTCTTTGCAAACCCTGGTGGGTATTTTATTTTGG 

CTTCA7TTCAATCATTGAAGGTATATTCTTATTGGAAATGTACCTGCCCNGGCCGGCCGC 

TCTAGAACTAGTGGAATCCCCCNGGC 

Sequence 1565 

ACTACTATAGGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACTTT 
TTGGTGTTTTTTGGAAAAGTTACAT1TAGATCTATTCT 

TAAMTGTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGT 

CTGTGAACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAG 

TCAGAAAAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGT 

GCACGTAGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGT 

GAGAATTTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCA 

CTTATGCAAGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTCACTGGTGGGATGTGGCC 

CTGCTTGTATTCAGMCCATAGTTTTAACAAGCCANCATTAAATCCACAAGTCTTTGCCA 

AAGCACTTTAAGCCTNTTGACATTTATTGGMTTAATTTACCTGCAAGGAAAGTTCATAT 

ATCTAGCTTTGGTAACCCTACATTCGGGAAAAATGTTTCCATGANATAACTAAAANCCCC 

ATGAATGATACAATCTTGGACAAAACCCAAAGNNGGCATAATTAGCATAAAACTCCAAAT 

Sequence 1566 

CCCGCGTCCGGCATCTCCCAACGTGACTGACCCGCCGACCACGACCCGCAAAGTGGTCCC 

GACGACGCTCACCACCACCAAGCCGCCAGAAACCTGTGAGAGCTTCAACAGCTGTGTTTC 

CTGTGTCAACGCCACCTTGACTAATAATATTACCTGCGTCTGGCTAGATTGCCATGAAGC 

AAATAAGACCTATTGTTCAAGTGAATTAGTAAGTAATTGTACCCAGAAGACCAGTACTGA 

CTCCTGTTCTGTAATACCTACCACCCCAGTGCCAACCAATTCTACAGCTAAGCCTACAAC 

TCGGCCTTCCTCTCCTACACCTACTCCCTCAGTTGTCACATCAGCAGGTGCAACAAATAC 

CACTGTGACTCCAACCTCACAGCCTGAGCGGAAGTCCACCTTTGATGCAGCCAGCTTCAT 

TGGAGGAATCGTNCTTGTCTTGGGTGTGCAGGCTGTAATTTTCTTTCTCTATAAATTCTG 

CAAATCTAAAAGAACG 

Sequence 1567 

TCGCCNCGCGTCCGGGCAACTGCAGTTGGAAAAAAAGATTCAACTTCAAAGCAGAGGATT 

TTTGATGAAGAACCAGCTAATGGAGTGAAGATAGAAAGGTTTACAAGGGATGATCCTTGG 

TTATCTTCAT GTGAA GAAGTGGATGATTGTAAAGACCAGTTGGAGAAGCAACAGGAAAAA 

CMGAGATACTTTTGCAGGAAGTGGCATTCACTCAAAGGAAAGCAGTTATTCATGAGAGA 

GTCTGCAAMGTGATGAMCTGGGGAGMGAGTGGTCTGMTTCCAGTCTATTTTCATCC 

CCAGTTATACCCATAAGAAACCATTTTCATAAACATGTATCACATGCTAAAAAATGGCAT 

CTTAATGCTGCTGTAAACAGTCATCAGAAGATTAATGAGAATGAGACACTATATGAAAAT 

AATGGMTGTGGAMACCCCCTCAGAGCATTCACCTTATTCAGTTTACAAGAACCTCAAA 

CA 

Sequence 1568 

GCTCCATGCCCTTCTCTGAGACGGGGACCAGGGGATGGCAGNCATGCACCTGACAGCCTG 

GCCCNAGAAGTCGGTGACCTTTGAGGACGTGGCTGTGTACTTCACCCAGGCGGAATGGGA 

TGGCCTGTCCCCTGCACANAGGACCCTGTACAGGGATGNGATGCTGGAGAATTATGGGAA 

TGTGGCCTCCCTGGGATTTCCACTTCTCAAACCTGCTGTGATCTCACAACTGGAGGGAGG 

AAGTGAGCTGGG 

Sequence 1 569 

CGCGTCCGTTTCTCCTGGCACCCTGTATTCATGGCCTTGGCGTTCTGCCTCTGCATGGCT 
GAAGCCATCCTACTCTTCTCACCTGAACACTCCCTGTTCTTCTTCTGCTCCCGAAAAGCA 
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CGGATCCGGCTCCACTGGGCAGGGCAGACCCTAGCCATCCTCTGTGCAGCTCTGGGCCTG 

GGCTTCATCATCTCCAGCAGGACCCGCAGTGAGCTGCCTCATCTGGTGTCCTGGCACAGC 

TGGGTGGGAGCCCTGACACTGCTGGCCACTGCTGTCCAGGCACTGTGTGGGCTCTGCCTC 

CTTTGTCCCCGGGCAGCCAGGGTCTCAAGGGTGGCTCGCCTCAAGCTCTACCATCTGACA 

TGTGGACTGGGTGGTCTACCTGATGGCTACAGTAACGGTGCTTCTGGGCATGTACTCAGT 

ATGGTTCCAGGCCCAGATCAAAGGTGCGGCCTGGTACCTGTGCCTG 

Sequence 1570 

CGTCCGCTAAGTTCCAATATTGAAAAATCTGTAAAAGACCTCCAGCGCTGCACAGTGTCT 

CTTGCACGGTATCGAGTTGTAGTTAAAGAAGAGATGGATGCCTCCATTAAGAAAATGAAA 

CMGCCTTTGCTGAATTGGAGAGCTGTTTAATGGATCGAGAAGTGGCGTTGCTTGCTGAA 

ATGGACAAAGTGAAAGCTGAAGCAATGGAAATTTTGCTCAGCCGACAAAAGAAGGCTGAA 

CTTCTAAAGAAGATGACTCATGTGGCTGTTCAAATGTCAGAGCAGCAA 

Sequence 1571 

GCAGCCGGCCATGCAGGCCGTCATCCGAACTGACATGGACCAAATTATCACCTCCAAGGA 

GCACCTTGCCAGCAAGATCCGAGCCTTCATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAA 

CCATGTCCAGCCCTACATCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGG 

CTTCACTGAGGTGCGAGATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCAT 

CAACGAGGGCCGGCATTGACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTTGGCG 

TACCACCCCCTGAAGATGCAGAGCTGCTATGAGAAGATTGGAGTTCGCTGCGACTGGACG 

GGCTGCAGC 

Sequence 1572 

CCGMCAANGTGGCCACCCAGGTTTTTAACCCAAGTCTAGTGGTCATCCTATTCTTTCCA 

CACCAACATGCCAAAAGCCTTACCTNGAAAGAAAATATAATTTGCAAGAAGCATCACAGT 

GCCGGGGTCTATATTCTCGATCAGGTTGNTAATTTTCCCATGGGTTTTTTGACTGATAAA 

GNCATTGATCTGCTTCTGAGCCATTTCCAAATTCTGAAAGTTGGTMGGATGGTTTCGGN 

ACTGTAAAAGTTCTTGGCATCTTCC 

Sequence 1573 

CGCGTCCNGGNCGGAGAAGACAGTAGGGATACTGGATATGGGAGGAGCCTCTCTCCAAAT 

TGCTTATGAAGTTCCTACCTCAACCTCTGTCCTTCCTGCAAAGCAGGAAGAAGCTGCCAA 

GATCCTGCTGGCTGAGTTCAACCTGGGCTGTGATGTGCAACACACTGAACACGTGTACAG 

GGTTTATGTCACMCTTTTCTGGGTTTCGGAGGCAACTTTGCCCGGCAGCGCTACGAAGA 

CCTTGTTCTGAATGAAACTCTTAACAAAAACAGATTGCTTGGTCAGAAGACAGGTCTGAG 

TCCCGACAATCCATTTCTGGATCCCTGCCTGCCAGTGGGACTCACAGATGTGGT 

Sequence 1574 

CGCCGTCCNGTTTACTTGGAGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTT 

TCAA GANNNT GGCATTGAAGTTACATCCTGATAAAAACCCGAATAACCCAAATGCACATG 

GCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGATCTACGGAAAA 

AGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCNGTATGAAAGCT 

GGAACTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGG 

AAAGMGAGAATTTGATGCTGCTGTTAATTCTGGAGAACTGTGGTTTGTAMTTTTTAC 

Sequence 1 575 

GAGGCGCTCAACCTACCGAGGCGCCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCG 

CTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTCACGTG 

GCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACTGCG 

CCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCTT 

CCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGC 

GCCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCAT 

CCAGTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTTCAAGTTTCTCGACATCG 

GATACAATCAAGC 

Sequence 1576 

GACCACGCGTCCGCGCACCGCTTCATTGAGGCTGCAAGAGCACACGGGCACCCACGTGCT 
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GCAGTACGAATGCAGCCTGGAGCAGGCCCTGCGCCACGTGCAGGAGCTCCGGCCCATCGC 

CCGCCCCAACCCTGGCTTCCTGCGCCAGCTGCAGATCTACCAGGGCATCCTGACGGCCAG 

CCGCCAGAGCCATGTCTGGGAGCAGAAAGTGGGTGGGGGTCTCCCCAGAGGAGCACCCAG 

CCCCTGAAGTCTCTACACCA7TCCCACCTCTTCCGCCAGAACCTGAGGG 

Sequence 1577 

CTACACTCAACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAG 

CTACATTCAACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGA 

GCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTGATCTCCCTCAGGCCTGAGAAGGATG 

GGGCAGCCACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGC 

TGGACATACAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGTGTCACCCAACTGG 

GCTTCTATGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAGAATTTAT 

CAATCCGGGGCGAGTACCAGATAAATTTCCACATTGTCAACTGGAACCTCAGTAATCCAG 

ACCCCACATNCTCAGAGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACAC 

TTTTACAAAGGCAGTCAAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGAC 

GATGGACTTCCGTGTTGGTCACTGTCAANGCATTGGTCTTCTTCAATTTG 

Sequence 1578 

GCGGCCGCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGNNNTNCAAT 

AGNTTTATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGC 

TTCACTTAAAAATGGGGGTGCGGMCATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGA 

CGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1579 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATNTCT 

ATTGNTAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 

AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 

GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1580 

CTCCCCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGT 

AGGCCNTTNCCCTACCMCTMCTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGA 

AGCTCCTTTCTATTACTCATCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTC 

CAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1581 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTA 
TTGTTNAACAATNNCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 
AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 
GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 
Sequence 1 582 

AGGTACAAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGNTAGGCCNTTACCCTACCAA 

CTAACTAATGTTCCGCACCCCCATTTTTAAGTGMGCTGTGMGCTCCTTTCTATTACTC 

ATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCT 

TAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1583 

CCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAMCATCATCGTCTTTTTNGGC 

CATTACCCTACCMCTAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCT 

CCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAAT 

AGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1584 

T CTTC GANCACGNTTCGGGCGGCTTTTCCCCGGGCAAGGCTTCTAAATCGGGGGGGCTTC 
CTTTTAGGGGGTTCCGAATTTAAGTGGCNTATAACGGGCANCCTTCGAACCCCCAAAAAA 
AACTTTG 
Sequence 1585 



# 
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AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTNGGTAGGCCATTACCCTACCAAC 
TMCTMTGTTCCGCACCCCCATTT^ 

TCATGCGATAMTAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1586 

CGGGCAGGTACCTMCCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGCnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCCCAACGT 

GAACCGGTTACACTTTGCCMGCGCCCTANCGCCCGGTCCTTTTCGCTTTTCTTTCCTTT 

CCTTTTCTTCGGCCACGGTTCNNCCCGGCTTTTTCCCCGTCMGGCTTNTAAATTCGGGG 

GGCTTTCCTTTTAAGGGNTCCCGATTTMGTGGCTTTANCNGGNAACCTTNGAACCCCCA 

AAA 

Sequence 1587 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGMGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGT7TCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1588 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1589 

TACCNCGCGTCCGGGGCCCGGATGCTGGGGGCCACCAGGGCCCCGGGGATGTGCTGGTCT 

TCATGGATGCCCACTGCGAGTGCCACCCAGGTTGGCTGGAGCCCCTCCTCAGCAGAATAG 

CTGGTGACAGGAGCCGGGTGGTATCTCCGGTGATAGATGTGATTGACTGGAAGACTTTCC 

GGTATTACCCCTCGAAGGACCTGCAGCGTGGGGTGTTGGACTGGAAGCTGGATTTCCATT 

GGGAACCTTTGCCGGAGCATGTGAGGAAGGCCCTCCAGTCCCCAATAAGCCCCATCAGGA 

GCCCTGTGGTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAG 

CGTATGACCCTCTTATGTCGCTGCGGGGTGGTGAAAACCTCGAACTGTCTTTCAAGGCCT 

GGCTCTGCGGTGGCTCCGTTGAAATCCTTCCCTGCTCTCGGGTAGGGCACATCTACCGAA 

ATCAGGATGCCCCGTCCCCGTTTGACCAGGAGGCCACCTTGAGGAACAAGGTTCGCATTG 

CTGAAGACCTGGCTTGGGGTCA 

Sequence 1590 

GGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGATATCCACATATTTTTGAG 

AAAAATTCCCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACC 

ATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACA 

GCACAGCAGCTMTGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGC 

CTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTC 

TGGTTCTTCAGCCTTGCTCCTTC 

Sequence 1591 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTTGAATATCAATTTCTAAATATTTACCC 
AAAAATGTATATTTAAAG I I I I I AACAACTTCTTTTCAATCTTGAAAAAGTTCCCTCATT 
TTTTCTTTTAAAATCCCA1TMCAATCAATGGTTCTTTTAAMTAGM 
TGTCCATTTAAAGAAGAATATGGATCTTGGAAAATCATTTGAAT AI I I I I T CTTAAAAAT 
AGATTTTGTTTTTTTGTTAATTTGTTTCCAGAMTAACTTCA 
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CCACCGTAAGATTCATATAATCTAATCAAAGATCTACCAACTGGTTGTTTTACCCTGGAT 
CCAGATTCACCMTTAMCCTAAMTTTTTCCTTGTTGGATCTAAMTTTTACCGTTATC 
AACAGCTTTGACAACCCAAANTTATTGGAAAAATA I 1 1 I I I I AAGCTTTGAACTTTTAAG 
GGTGGGTTAACTTGCATTA 
Sequence 1592 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGG 

GTTTCCCTGCTATGTTTTGTCACACATCATTGCTGGGAGAGTTCAGTGCTATCCAAGTGA 

CTCCCCGGGGAGAGGACAACTGGAAATTTATGCCTGGTGTCTTCCTGGACTCCGACCTAT 

GCACCTTTTTCGTTGGATGATTTTMCCTGTCTCCTTTTGCTGCAATGAACCATAACCAT 

GAGTATATCAGCTTTTCTAATTTTTGTGMGTCCCTCAAGTGAATTATCAAACCTAAGGG 

TAGTTTTGGGGACCCCCATCACAGAAGAGGTCAAACAGGGGGCAGG 

Sequence 1593 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAA 
ATTATTATTTATCTATACAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGC 
TAAAAAAAAAAAAAAAAAAAGAGGGAGAATGAAAACTTCCGGACGCGTGGGTCGAAGCTT 
GACCT 

Sequence 1594 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGGCTGTCTACACTGACATCATCCAGGGCAAGCT 

GGACCAGCGAAACCAGCTGCTGGAAGTGGATTTCTGCATTGGCCGTGACATCCGAAAGAA 

GGATATCAATAATATTGTCAAGACCCTGCATGAATGGTGTGATGGCTGTGAAGCAGTTCT 

ACTGGGCATNGAGCAGCAAGTTCTGAGAGCCAACCAGTACCTT 

Sequence 1595 

CCGCGGTGGCGGCCGCCCGGGCAGG I I I I I I II I I I II I I I CTTTCTGTTCCTTGGACTA 

GATMTCTGMATCAACTGTCTTCAGTTTTGCAGACTCTTGTGCCAGCTAAAATGTTCTG 

TTGAGCCCCAGAAGCTAATTTTCTTTTCAGTTA 

TTTTTAATATAATTTCTACCTC I I 1 1 I I ATATTCTCCATTTGGTGAGACATTCACATACT 
TTCTTCCAG I I I I I I I I AGACGTAGTTTCCTTG AGTTCTTTGAGCATATTTAAAATAGTT 
GATTTAAAGTATTTGTCTAGTTACTCCACCATCTGAGTTTCCTCAGGGAAMTTTCTATT 
GCCTCCTTTTTTCCTGTGTGTGGTCGGNCCATACGGACGCGTGGGTCGAAGACCT 
Sequence 1596 

ACTTNNTN I I I I I I I I I I I I I I I I I I I I AAGCGCCCGGCATTTTCTAAATAAAATCATTT 
TATTTGGNAAAAGGGTTTTMCAGNTATACCTTTCTAGCTAAAAGAAAAGAAATAGCGGG 
ATGTACCT 
Sequence 1 597 

AGGTACGAAGAGAAAGGAATCAAAGCCTACTANCTCAAAAAATTGTCAAATTGCAAATGA 

GGACATCTAGAGAGGAAGAAAGGAAAAAAGGAACTAAAAAACAGAAACAATTAACAGTAA 

GTTCTTAACTATCAATAATTATTTTAAAAGTAAATAGATTAAATTATCTAATCAAAAGAC 

ATTG AATGGCTGAATGGATTAAAAAACAAGATCAACTATACATTGCCCATCAGAGATTCA 

TTTTAGCTTTAAGGATAAACACTGGTTGAAAGTGAAAAAAGTCAAGGCTGGGCATGGTGG 

CTCATGTCTATAATTCCAGCACTTTGGGAGGCCAAGGTGGGCAGATAATCTGAGGTCAGG 

AGTTTGA 

Sequence 1598 

CCGGGCAGGTACCACCTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 

TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCGGCACCTAC 

CACCGTGACCCTGTGGGCCCCGGGCTGGACATACAGCAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTAT 

Sequence 1599 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACAGGACCAATGCTGCC 
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CATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGT 

GGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATAC 

TTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGG 

GAMTCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGT 

AGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCT 

GGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCT 

GGACCCAGAAAAGCCAG 

Sequence 1600 

TCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTCACTGTCTCATATAATCNCAGCCTCC 

TGTGTGATAGCTGGTGTCATCTCCACTTACAGATGAGGAAACTGAGGATAAGCAGGGTTG 

AATAACTTGCTCGAGATCACAGAGCCACGGGTGGNGAAACAGGATACAAACCTGGTTCTG 

TT TGACT CTAAGACCATTCATNTTTCCTCTGAAACTCAGTATTGCACAGTGTAGAAATGC 

AG I I I I I AAGACCTCCCAAAGTGACGTGCTGNGTCACTGCCCATCATTAGCTANATTGAG 

TMATTGCTGCTTAGCCCCANTTGTTTTGACAGAATCAATAGCCCTTGCTGAGGGGCCAN 

CAGCCTACGGACACAGGAGCATGCTTCATGGGCAAGACCACCATGCACACTCAGAGGGGA 

AGCCACAAGGCAACCTCCACGCCACTTAAGATTTGTAGGGCTCTGAACACATCACCAGAT 

ACAGACCACCTACTTATTTTTTNCACTGTAATANCAAAGGCAGGMTCTTTTTNCTGTAG 

GGTAAAGTTTGGGGG 

Sequence 1601 

GGCAGGTACAAGGCCCCAAAGAGGAGGAATTCCTTGTAGAGGAGCTTGTAGATGCTTCCC 

CTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTGAGAGTG 

TATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCCCCAGGG 

CTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCCACACCCCAGCCTTCA 

CCCTGCGTCAAGTGGACAAG 

Sequence 1602 

AGGTACCAGTGGGGACTTCTGAAAGAACNNTACTNGTGTCAGTGGAAAAGCTGGCATTTT 

GGGAAATGCTGGTCTCTCTCAGTCCAGGAGTCAAGGAATATGTTGACTCTCTCTTAATTT 

TTGTAGTCTCAGAGGAAACAGACATTGATGTGGAAACAGTTGTATGCCCCATGGTGGAGG 

TGGTATCCATNGGAGCTGTGGCCTTGGTTTTTTCTGAGTCAGCTAGGACAGAGGATTGTG 

ACCCATGTCCAGAACTGGTGGTTTCCACATTAGTCGCTGCTGTGCTTGTGGAAGGATGCA 

TGGCTTCTATAGCTGTGGTGTCTTCATCTGTTGTCAGTATCTCATGTGAGGNACCTGCCC 
G 

Sequence 1603 

CCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTCCCAAGCCAGGCGAATGT 

GGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGG 

TGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATT 

GATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCA 

GGCTGCAGTAAGTTTGTCTTMGAAAGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGC 

GAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTGTTTCCTCA 

Sequence 1604 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCACCTGAAGGCCCTCACACTC 

AACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAGCTACATTC 

AACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGAGCAGCATG 

GGCCCCTTCTACTTGGGTTGCCAACTGATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCC 

ACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGCTGGACATA 

CAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGGTGTCACCCAACTGGGCTTCTA 

TTGTCCTGGACAGGG ATA G C CTCTTC ATC AATG G CTATG CACCCC AA AATTTATC AATCC 

GGGGGCGAGGTACCTGCCCCGGGCGGGCCGCTTAAAACTAGGNGGGATCCCCCNGGCTTG 

CAGGAATTTCGATATTCAAGCTTATCGATACCCGTCCNACCTTCGAGGGGGGGG 
Sequence 1605 

CCGGGCAGGTACCACNTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 
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tCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCTGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAACAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTATCAATCCGGGGCGAGTACCT 

Sequence 1 606 

CGGCCGCCCGGGCAGGAACNNNNTTTTTTGGGGGGGGGAAAACCNAGACGGAGCCNCGCN 

CAANGGCCCAGGCGGGAGTGNAAGGGCACCAGGGGGGGCNCACCACAAANACCGCCGCCC 

G GGNGAAAGCC ACNCNCCGGCCNNAGCCNCCGGAGNAACGGGGGGAACAGGGGCAGGCCA 

TN ii i 1 1 M i I IGNGGGGGGNGNANGGGGNGGANNCNCCAGGNAAAAANCANGCNGGCCA 

GGGGGGGGGGGAACNCCNGACCTNATGANGCACCCGCCNNGGNCNCCCAAAANGCGGGGA 

NNANAGGGGNGAGCCACCGNGCCNAGCNGACGG 

Sequence 1607 

CGAGTTACCAGAAGGAGAGATCACCACCATCGAGATCCACCGCACTAACCCGTACATCCA 

GTTAGGAATCAGCATCGTTGGCGGCAATGAGACGCCACTGATCAACATCGTNATTCAGGA 

AGTNTACCGGGATGGGGCCATCGCCAGAGATGGAAGGCTCCTTGCCGGAGACCAGATTCT 

TNAGGTCAACAACTGTGATATCATGCAACGTGTCCCATAACTACGCCCGGGCTGNCCTTT 

CCCAGCCCTGCAGNACCCTGCACCTGACAGNGCTTCGGGAGCGGCNGCTTNGGCAGTCGT 

GCAAA 

Sequence 1608 

CGAGCCTTTAGATGGCGTCTCCTCAGGGGGGCCAGATTGCGATCGCGATGAGGCTTNGGA 

ACCAGCTCCAGTCAGTGTACAAGATGGACCCGCTACGGAACGAGGAGGAGGTTCGAGTGA 

AGATCMAGACTTGMTGMCACATTTGTTTGCTGCCTATGCGCCGGCTACTTNGNGGAT 

GCCACCACCATCACAGAGTGTCTTCATACTTTCTGCAAGAGTTGTATTGTGAAGTACCTN 

CAAACTAGCAAGTACTGCCCCATGTGCAACATTAAGATCCACGAGACACAGCCACTGCTC 

AACCTNAAACTGGACCGGGTCATGCNGGACATCGTGTATAAGCTGGTGCCTGGCTT 

Sequence 1609 

GCGTCCGACGTCCCCCAGGAGAATGGTAGACACAGATGAGGAAATTGTGGAGATGGGCAC 

AAACCGCAAGGTGAAGAAAACGAACAAACACCGAGTTGATACGGATAGTCCCCGTTCCCC 

TGAGGGCCGACCCCGTGACTCCCGATGAGCGTCCAGTTGCGCCGGGCATCCCTGGGCCTC 

CCAGCGTCTTTCCCGGAGGTTCATCGCCGCACGGCGGAAAGCGCTCTCGGTTCCGCTTTC 

CGGCCCCAGCCTCCCGGGCGCCCTCGCGCGGCGGCTAACGCTGGTCCTCGGCCGGGCGCG 

CTGACGTCATCGTGCGTCAGAGTGAGCCCGGATGGGGCGGCGGGCTTCGGGAGCGCCCGG 

GCTGATCCGAGCCGAGCGGGCCGTATCTNCTTGTCGGCGCCGCTGATTCCCGGCTCTGCG 

GAGGCCTCTAGGCAGCCGCGCAGCTTNCGTGTTTGCTGCGCCCGCACTGCGATTTACAAC 

CCTGAAGAATCTTCCTATCCCTAT 

Sequence 1610 

CGCGTCCGGCGGCGGCGGCTGAGGAGGGCCCGGCCTGCGAGAGCCTCAGTGGGAGCCGGC 

TCAGCCCTCGGCCACCATGTCGGCGCCGTCGGAGGAGGAGGAGTACTGCGCGGCTGGTGA 

TGGAGGCGCAGCCGGAGTGGCTGCGCGCCNAGGTGAAGCGGNTGTCCCACGAGCTGGCCG 

AGACCACNCGTGAGAAGATCCAGGCGGCCGAGTACGGGCTGGCGGTGCTCGAGGAGAAGC 

ACCAGCTCAAGCTGCAGTTCGAGGAGCTCGAGGTGGACTATGAGGCTATCCGCAGCGAGA 

TGGAGCANCTCAAGGAGGCCTTTGGACAAGCACACACAAACCACAAGAAGGTGG 

Sequence 1611 

CGCGTTCGAGTCTGGAGACGACGTTNCGAAATGGCACCTCGCAAAGGGGAACGGAAAA6A 

AGGAATGAACAGGTCATCAGCCTTGGACCTCAGGTGGCTGAAGGAGAGAATGTATTTGGN 

GTCTGCCACATCTTTGCATTCTTCMTGATACCTTTGTCCATGTTANTGMCTTTCTGGC 

NAGTGAGTACTTCAGAAAGGCATNAAACANGCCTCAAAGGGAC 

Sequence 1612 

CCCCGCGTCCGCCCACGCGTCCGGGCTCGGCTGCACCGGGGGGATCGCGCCTGGCAGACC 



• 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

265/467 



CCAGACCGAGCAGAGGCGACCCAGCQCGCTCGGGAGAGGCTGCACCGCCGCGCCCCCGCC 

TAGCCCTTCCGGATCCTGCGCGCANMMGTTTCATTTGCTGTATGCCATCCTCGAGAGC 

TGTCTAGGTTAACCGTTCGCACTCTGTGTATATAACCTCGACAGTCTTGGCACCTAACGT 

GCTGTGCGTAGCTGCTCCTTTGGTTGAATCCCCAGGCCCTTGTTGGGGCACAAGGTGGCA 

GGATGTCTCAGTGGT ACGAA CTTCAGCAGCTTGACTCAAAATTCCTGGAGCAGGTTCACC 

AGCTTTATGATGACAGTTTTCNCATGGAAATNNGACAGTACCTGGCACAGTGGTTAGAAA 

AGC 

Sequence 1613 

GTTNAGTNGMGTTCTCTACCATTGAATCAGTGAACTAGAAAGATCTGATTTGGCCTGGG 

ACCAGTGTTCAAGTTGGTTTGGTCTTTATTAAAAATCACAATATTCCGAAAACAAAAAAA 

CCTAGGAGATAAATGTAGAGGTATTGACTTTTCGTATCTTTTATCTTCACACTGAAACAA 

GAGCTATCCTATTTGATTATTAAAGTGAGCTATGTGTTAAGTGCCAGGACATTTCTAGCT 

TTTGTGAGAATGTGTCTACATATGAGTATAATAAACCCACATGTATACACAATTGTCTCT 

TATGTACTCCTACCTGGCAGGAGTCTTTG 

Sequence 1614 

CGCGCCGGTGGTGCGATCTCGGCTACTGCAACTTCAGCCTCCTGGATTCAGGCAACACTC 

C TGCCTCA GCCTCCCACGTGGCTGGGATTACAGGTGCCTGCCCCCATGGCTMTTTTTTG 

TAi n 1 1 IGTAGAGATGGGGTTTCACCATGTTGGCTGGGCTGGTCTCACTCTCCTGACCT 

CAAGCAATCTGCC TGTCTCAGCCT CCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCC 

CCCAGCCTGAGCCI I I I I I I I I I I CTAATGCATCCAAGGTTAAGGGGAAGACGCAAATAA 

CAGGACTATTCTAAAAGGAAACCTGTTTGAACTCTGTGAGATCAAGTCATCAGTCTCAGT 

AT TNCA CAGGCACAC CTTM TTT CATTG GTAAMGATATATATATTTTGNCTATTTTTGN 

GCTTTTGGGGGCCTATTTTGNGCTTTTTTACCTTTMTG^ 

GTGGA I (Ml! IACCA 

Sequence 1615 

TCGCCNCGCGTCCGTAGAAACTCACACTAGACACACGCAGTAGTCATACGTCTTCACACG 
GTTTAGGAGCTACTGGACCMCATTCTTGTTTTTGCTTT^ 

TCTGGAGCTAACTGTGGAGCAGCCAAATAGTAGCTGGCATGTTGATTCAAACCATGGGCT 

GAATTTGCTCATAGGCTGTGCATCAGACAAAAGCTTGAATATTTGTGTTGTATGCTTGTT 

CCAACCACCGQTTGTG TGAGCAT TTTTGTGGCTTGTACAGAMGTACACTTTTAAATTGT 

CTCT TGCATCACTAAAATTTTTTTAAAATGAGCATAACAACGAAAGGCATCCAGCTGACT 

TTTTGATTCCMGATTATTGATTGGATTGACTTllTrGCATTAM 

TAAATCATATGGCGAGTCAGGGAATAAAAAAGTCAAAMGGAMCAAATAGMGCTTTTT 
I I I I I IAAAAA 
Sequence 1616 

GGNCGAGCGCGCCTTGCGGGGGCGGTATCCCGGCGCCCTAAGACCCCACGACCNCNNGCA 
CCGGCCGNTGCTGCNAGACCCCGGCCCGNGTCGGTCCGATGTCGCCCCCCGGNCCCGGCG 
GAAACGCCTCCCTNCTGGCCAGGCTGTTCNAGCACCCGCTT 
Sequence 1617 

TCGCCACGCGTCCGTTNAGATGCAGMATGMMMAMCACCTTTGTTTTATAAATATC 

AAAGTACATGCTTAAAGCCMGTTTTTATCTAGTTTATTCTAGTACTTAGCTTGCCTGGA 

ATAGCTAATAMTTATTCATGTATGTGCTTTTGAAMTCCAGAGCCCTATTTTTACACAC 

TTGTGTGMGTTGGCAAACATTTTGMAAATGGAMAAAGTTTCTAATAATTGGGAACAA 

TTACATTMTTMTATTTTGTAAMTATTGMGCTTTTAGCCCTATGTCAATTTGTAGAT 

TAAMTMATTAATTATAGGAAAGGAAGATAACAGTGAGAAACCAAACATTAC 

Sequence 1618 

CACGCGTCCGCCCACGCGTCCGGCGCGGCGGGAGCGGGCGGNGCGAGCGGGAGGCGGCGG 

CTCACAACTGAAGCGCTGNGGCATGGNGCGCGCCTGCCTCCAGGCCGANNNGTACCTNAT 

GTTCGCCTTCAACCTGCTNTTCTGGCTGGGAGGNNGTGTGNGGGCTGGNTGTCTGGCNTC 

CTGGCTTGGGC 

Sequence 1619 
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TCGCCACGCGTCCGGACCTGGATGTGAGGGTGAANGGGCTGTQCTGCTGGGAGCCACATT 

CCTCATTGACTACATGTTCTTTGAGAAGCGAGGAGGCAGCTGGGCCCTCTGCCATCACCA 

GTTAGAGGCCACCATGGTGTGAGGAGACCATCACCTCGACCAGAACTCCAGATGGTCACC 

TGCCCTGGCCCCTCCTCTGGGCAGCCCCTTTCCTCCATGTACACTGCAGGGGACAGAAGG 

GGGGCCCCATCCCTACCCTACTCCCTGGCCGCCTGCCCCTGTGGTTCCCAAGGAAGGGGG 

TATTGTATTGAGAGCCGCTCTCCTGCTACCTCCCACCACTGTCCAACAGTCCCTCGGCAC 

ACAGGCATATTAAGCTTTCACACTTTTCCCATGCACTTTTTTCCACCCC 

Sequence 1620 

GGAGTCGACCNCGCGTCCGGGGGCTTGCTGGGATCATGGCGGAGAATCACTGCGAGCTCC 

TGTCGCCGGCCCGGGGCGGCATCGGGGCGGGGCTGGGGGGCGGCCTGTGCCGCCGCTGCA 

GCGCTGGGCTCGGCGCCCTGGCCCAGCGCCCTGGCAGCGTGTCCAAGTGGGTCCGACTCA 

ACGTCGGCGGCACCTACTTCCTCACCACTCGGCAGACCCTGTGCCGGGACCCGAAATCCT 

TCCTGTACCGCTTATGCCAGGCCGATCCCGACCTGGACTCAGACAAGGATGAAACAGGCG 

CCTATTTAATCGACAGAGACCCCACCTACTTTGGGCCTGTGCTGAACTACCTGAGACACG 

GCAAGCTGGTGATTAACAAAGACCTCNCGGAGGAAGGG 

Sequence 1621 

GTCGCCCCGCGTCCGGGGCCCGCGGGCCTCGCCTCCGCCCTCCGCCACCTCGAGCTGCGG 

TAGCAGCGACTCATGAGAGCGCGGCCGGAGGACAGATTTGATAATGGGCTGCATTAAAAG 

TAAAGAAAACAAAAGTCCAGCCATTAAATACAGACCTGAAAATACTCCAGAGCCTGTCAG 

TACAAGTGTGAGCCATTATGGAGCAGAACCCACTACAGTGTCACCATGTCCGTCATCTTC 

AGCAMGGGAACAGCAGTTMTTTCAGCAGTCTTTCCATGACACCATTTGGAGGATCCTC 

AGGGGTAACGCCTTTTGGGAGGTGCATCTTCCTCATTTTCAGTGGTGCCAAGTTCATATC 

CTGCTGGTTTAACAGGGNGGNGGTACTATATTTTGNGGCCTTATATGATTATGAAGCTAG 

MCTCCAGAAAGACCTTTCATTTAAGAAGGGTGAMGATTTCAAATAATTAACAATACNG 

AAGGAGATTGGTGG 

Sequence 1622 

TTCGGGAGTCGCCCCGCGTCCGCTTTTAGAAAAGGCCAATATACCTATCACACTTTGGAA 

GTAAAAATA CACACT TTCGTGTGTACCTAAAAAAAAAATCGTTGAAAATCAAGGCCAAAG 

GTAGTGCAA I I II l I CATTAAGATTTAAAAAAAAGGG AATGATAGTCTTTGAAAGAAAAC 

AGTAGGCA TCCA GCACTGGACAAMCATGGGTATCAAAGATGAATAATCTTTGGAGATTC 

TGGCAGTGTTTTCCCAGMCAAGTCAAGTGGAAAGTGGAGAAATTATCTGTATAATTTTG 

GACACATACAATGGCAGTTTATCAMGGGTTTTGTTCTGTGGCCTGAATTTACTGGGGTC 

CTACCTACACATTGAACATGTTTTGGCTGGC I i I II I M I f II i I I CAACTTGCCAGTTT 

CACTTTACATGGTTAGTAATAAATGGTTTCCACGGGGTGAGTTGGGATMAATTNTTNAA 

AACATNTTAAATTCCA 

Sequence 1623 

GGAGTCGACCNCGCGTCCGAGCCGGGCCGGGGCGATGTGGAGCGCGGGCCGCGGCGGGGC 

TGCCTGGCCGGTGCTGTTGGGGCTGCTGCTGGCGCTGTTAGTGCCGGGCGGTGGTGCCGC 

CAAGACCGGTGCGGAGCTCGTGACCTGCGGGTCGGTGCTGAAGCTGCTCAATACGCACCA 

CCGCGTGCGGCTGCACTCGCACGACATCAAATACGGATCCGGCAGCGGCCAGCAATCGGT 

GACCGGCGTAGAGGCCGTCGGACGACGCCAATAGCTACTGGCGGATCCGCGGCGGCTCGG 

AGGGCCGGGTGCCCGCGCGGGTCC 

Sequence 1624 

CGCGTCCGGGGCAGCCGCGCCCGCGGAGTTTTCCGCCCGGCGCTGACGGCTGCTGCGCCC 

GCGGCTCCCCAGTGCCCCGAGTGCCCCGCGGGCCCCGCGAGCGGGAGTGGGACCCAGCCC 

CTAGGCAGAACCCAGGCGCCGCGCCCGGGACGCCCGCGGAGAGAGCCACTCCCGCCCACG 

TCCCATTTCGCCCCTCGCGTCCGGAGTCCCCGTGGCCAGATCTAACCATGAGCTACCCTG 

GCTATCCCCCGCCCCCAGGTGGCTACCCACCAGCTGCACCAGGTGGGTGGTCCCTGGGGG 

AGGTGCTGCCTACCCTCCTCCGCCCAGCATGCCCCCCATCGGGCTGGATTAACGTGGCCA 

CCTATGCGGGGGCAAGTTCAACCAGGGACTATCTTCTCGGGAATGGCGGCCAACATTGTC 

TGGGGACATTTGGAGGGAGCCAACATGCCCAAACCTGGACCCTGGGGCCCCTGGGGGCTG 
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Sequence 1625 

CACGCGTCCGGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGCGGCCTCCCGGCGGGAGC 

GAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGGCGGCTGCGGGCG 

CAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGGCGGCGGT 

CCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCGGC 

TCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAAGATGTTCCGCGAGGTTGAGGAAC 

TGATGGAGGACACGCAGCACAAATTGCGCGCGCGGTGGAAGAGATGGAGGCAGAAGAAGC 

TGCTGCTAAAGCATCATCAAGAAGTGGAACCTGGCAAACTTAC 

Sequence 1626 

CCACGCGTCCGGCCGGGGGGTGCCCCCGGGACGTAGCGCCGCGGAGAGGAAGCGGCAAAG 

GGGACCATGCGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTGCTCTGGATC 

ACGGTGCTGCTGTTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCCGACGGGACCTGA 

AGTGCAGACCCCTAAGCCTTCGGACGCTGACTGGGACGACCTGTGGGACCAGTTTGATGA 

GCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGACGACCCCTATAAAGCTGTATG 

CTTTCAACCAGCGGGAGAGTGAGGCGGGATCTCCAGCAATCGGGCCATCCCGGACACTCG 

CCATCTGGAGATGCACATGGCTTGGTGTATTGGACGGGACCTTCCACCCACTT 

Sequence 1627 

GCCACGCGTCCGCCGCCCGCTTGCCCGTCGGTCGCTAGCTCGCTCGGTGCGCGTCGTCCC 

GCTCCATGGCGCTCTTCGTGCGGCTGCTGGCTCTCGCCCTGGCTCTGGCCCTGGGCCCCG 

CCGCGACCCTGGCGGGTCCCGCCAAGTCGCCCTACCAGCTGGTGCTGCAGCACAAGCAGG 

CTCCGGGGCCGCCAGCACGGCCCCAACGTGTGTGCTGTGCAGAAGGTTATTGGCACTAAT 

AGGAAGTACTTCACCAACTGCAAGCAAGTGGTACCAAAGGAAAATCTGTGGCAAATCAAC 

AGTCATCAGCTACGAGTGCTGTCCTGGATATGAAAAGGTCCCTGGGAAGGANGGGGGCTT 

GTCCAAGCAAGCCCTACCACTCTCAAACCTTTACGAGACCCTGGGNAGTCGNTTGGATCC 

ACCACCACTCAAGCTGTACACCGACCGCACGGAGAAGCTGAGGCTGAATGGGGAGGGGCC 

Sequence 1628 

CCTAAGGGCAACAAGGGCGGTCCTGGCCAGCCGGGCTTTGAGGGAGAGCAGGGGACCAGA 

GGTGCACAGGGCCCAGCTGGTCCTGCTGGTCCTCCAGGGCTGATAGGAGAACAAGGCATT 

TCTGGACCTCGGGGAAGCGGAGGTGCCGCTGGTGCTCCTGGAGAACGAGGCAGAACCCGG 

TCCACTGGGAAGAAAGGGTGAGCCCGGAGAGCCAGGACCAAAAGGAGGAATCGGCAACCG 

GGGCCCTCGTGGGGAGACGGGAGATGACGGGAGAGACCGGAGTTGGCAGTGAAGGACGCA 

GAGGCAAAAAAGGAGAAAGAGGATTCCCTGGATACCCAGGACCAAAGGGTAACCCAGGTN 

AACCTGGGCTAAATGGAACAACAGGGACCCAAAGGCATTNAGAGGCCCGAAGGGGA 

Sequence 1629 

AGTCGCCCCGCGTCCGCTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAG 

CAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTG 

GAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGT 

GAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTC 

AGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTT 

AGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGGAACCTGGAAGATACCCTTCAAGGAG 

TCAGCAGAGCACACCCAMACAACTGTTTTCTTCTAAAACTGGGTAGAAGCCTAAGAAAG 

ATAAACTCTGCTCCTCCTACAGAAACAAAATCTT 

Sequence 1630 

TNCGGGCCTGGTGAGCACCGCCGAGGCGCGGGCCAGCTCTTCGAGGTTGTGCGCGGGAGT 

GGCACGGCGGGCCGGGCCGAGCGAGGGGCTAACTTCAGCGGTGGCACCGGGATCGGTTGC 

CTTGAGCCTGAAATCATGACCACCCCAGGAAAAGAGAACTTTCGCCTGAAAAGTTACAAG 

AACAAATCTCTGAATCCCGATGAGATGCGCAGGAGGAGGGAGGAAGAAGGACTGCAGTTA 

CGAAAGCAGAAAAGAGAAGAGCAGTTATTCAAGCGGAGAAATGTTGCTACAGCAGAAGAA 

GAAACAGAAGAAGAAGTTATGTCAGATGGAGGCTTTCATGAGGCTCAAGATTAATAACAT 

GGGAGATGGCCCAGGGTGGGTGTCATCACTtCTGACATGATTGAAATGATATTTTCCAAA 
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AGCCCAGAGCAACAAGCTTTCAGCAACACAAGAAATTCAGGAAGCTGCTTTCAAAAAGAA 
CCTAA 

Sequence 1631 

CGGAGTCACCACGCGTCCGGGGCTGCCAAGGGAGGAGGAAGATGGCGGCGGGGGCGAGGT 

GAGGTGTTGGCAGTGGAAAGGGGTTCGGGCTCGGGGGGCGGGGGGACGCGGAGCGATGGC 

CCGCGCCGGCCGCAGGGGCGGATAAAAAGCCGTCGCGCTGCGGGAGTGGGCGGGAGGGAG 

AGGGGGTGTCCGAGGGCCACAAGAGTATGACGGGGCTGTACGAGCTGGTGTGGCGGGTGC 

TGCACGCGCTGCTCTGTCTGCACCGCACGCTCACCTCCTGGCTCCGCGTTCGGTTCGGCA 

CCTGGAACTGGGATCTGGCGGCGCTGCTGCCGCGCCGCCTCTGCGCGGTCCTAGCGCCGC 

TCGGCTTCACGCTCCGCAAGCCCCCGGCAGTCGGCAGGAACCGCCGTCACCACCGGCACC 

CGCGCGGGGGGGTCGTGCCTGG 

Sequence 1632 

CGTCCGTTTGTTTAATA I I II 1 1 1 ICTCTCTTGAACAAAACTGAGATAATTTAGAAAACA 

GGTGCTTAATTGCAATAAAATTACTATGAAGTATATTAAAAATCACGACATTGTAAAATC 

TCACTTTAGATCATCAAAGAAAACCATTGTTACTATCTGCTTTGAGCTTAGGAAAATGTA 

CMGAGAACAAATTAAAATTGAAAMTTGATTTCACTTAGAAAAACTTCTAGGAACAGGG 

TGAACCACTGATTTTMTTTGCCTAATTATCTTATGACAAGTATCAAATTAAGATGACAC 

TTAMGGATCCTTAGCATTTAACTTAATGATGGAGAAAGAGTGCTCAATAGGACAGTTCC 

CCAGTTAAGGGGTMTGGAGATGCCCATTTTCAGGAGGACCATTCTAAGMGATATTTTT 

GGATTCATTAAAAACATTTAAATAAAAAGCCCTTCTTCAAGATTGGGAAC 

Sequence 1633 

CGCGTCCGCCGGCCTGGTGCACGCGGCGCACCGAGGCCTCCCGCAACGCCGCCGACAAGG 

AGCGGGCGGCGGGCGGCGGCGCCGGCAGCAGCGAGGACGACGCGCAGAGCCGCCGCGACG 

AGCAGGACGACGACGACAAGGGCGACTCCAAGGAAACGCGGCTGACCCTGATGGAGGAAG 

TGCTCCTGCTGGGCCTCAAGGACCGCGAGGGTTACACATCATTTTGGAATGACTGTATAT 

CATCTGGATTACGTGGCTGTATGTTAATTGAATTAGCATTGAGAGGAAGGTTACAACTAG 

AGGCTTGTGGAATGAGACGTAAAAGTCTATTAACAAG 

Sequence 1634 

CCCCGCGTCCCGGTTGGCCGGGCGGAGGTCTTCGCTGAGGCCCGGGGCGGGGTGGCGCCA 

CCCCTGATTGCGGTGCCACGGACTGCTCCTGCTGGGCGGAGAGGACAGATTTTGCAAAGC 

GGAGGCTTGCGACGGGTCCTGCAGGGGGACAGTGAGGAAAGGGCCCGCCTCGTNTCCGCT 

CCTGGGGGACCCGCAGAAATAAGAATCAAACTCCACAATGACAACCTATTTGGAATTCAT 

TCAACAAAAATGAAGAACGAGAATGGGAGTCCCGATTTAGT 

Sequence 1635 

CCACGCGTCCGGGCGGGGCCATCCAAGCAACGCTGAAGGCCTTTTCCAGCAGCTGGGAGC 

TCCCGGATTGCGTGGCACAGCTGAGGGGCCTCTGTGATGGCTGAGCTCTCTTATGTCCTA 

TACTCACATCAGACATGTGATCATAGTCCCAGAGACAGAGTTGAGGTCTCGAAGAAAAGA 

TCCATGATCGGCTTTCTCCTGGGGCCCCTCCAATTGTTTACTGTTAGAAAGTGGGAATGG 

GGTCCCTAGCAGACTTGCCTGGMGGAGCCTATTATAGAGGGGTTGGTTTATGTTGGGGA 

GAATTGGGCCTGAATTTCTCCACAGAAATMGTTGCCATCCTCAGGTTGGCCCTTTCCCA 

AGCACTGTAAGTGAGTGGGGTCAGGCAAAGCCCCAAATGAGGGGTTGGTTTAGATTCTGA 

CAGTTTGCCAGCCAGGCCCCACCTCAGCGTCTGTCGAACAAACAAAGNTNGGGNGGGTTT 

N 

Sequence 1636 

CCNCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCTTCTGCAGCAAGCTCAGGAGAGCT 

GCTGTCTTCCCTCCCGCCCACCAGCAACGCACCCTCTGACCCTGCCACAACTACTGCAAA 

GGCAGACGCTGCCTCCTCACTCACTGTGGATGTGACGCCCCCCACTGCCAAGGCCCCCAC 

CACCGTTGAGGACAGAGTCGGCGACTCCACCCCAGTCAGCGAGAAGCCTGTTTCTGCGGC 

TGTGGATGCCAATGCTTCTGAGTCACCTTMCTTTGAACCATTCTTTGGAATTGGCGTGG 

TATATTTAACCACGGGAGGCGTGTCTGGAAACGCAAACTATCATTAATTTCATACTAGGT 

TTGTACCGTATCTGTAGGCATTCCTGTAAATAATTCCAAGGGGGAAAACTAAACNGGGAC 
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GTGGGGTT GTATC CTGCCAGGTTTGAGTGGGGGCTCACACCTAGGGTGAGAAGTCAGAAA 

GCGCTTGTATTTTAAACAACCAAAAAGAATTGAAAGGGTG 

Sequence 1637 

CGTCCNATAGGCTTGCACACTTTTCTAACTACATGTTTAAGTGGCAGAGTCCAGGCTGTC 

GAGTCACGGTTGGGT TTGM TCTGACTCCACCAGTMCTTTGGTTGGAAAAAATCACTTA 

TCCTCTTTAAG CTTGA TTTTATTTATTTTATTTTATGTMGAGTGAGACAGTAGTA 

MTAGGGTTGCTTTTAAATTAGAGTGAACATGAGGCATTTATTCGGTGCCAGACAGATAA 

CTGCCTATAACAGGATGTGATCAGCACAAGTAACAGAAAATTAGCCTGGACGGTGGCTTA 

AGCAATGGGGAATGTTTATCTCACATAGCMAMGGTCTGTAMTAGGATGGTTTTAGAG 

TTGGGGTGGGGAAGCCAAAAATGTCATCAGGATTTCCTTGGAACCCGT 

Sequence 1638 

CGCGTCCG GATTAA TACAACTCTTAAAAAATATAGTCAATAGGTTACTAAGATATTGCTT 

AGCGTTMGTTTTTAACCGTMTTTTMTAGCTTMGATTTTMGAGAAAATATGAAG 

TTAGAAGAGTAGCATGAGGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAGAAGGAAAG 

AAGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAACCCACCATGCCCCAGGCTCAGCAG 

GGAGCTGCTGGATGAGAAAGGGCCTGAAGTCTTGCAGGACTCACTGGATAGAAGTTATTC 

MCTCCTTCAGGTTGTCTTGAACTGACTGACTCATGCCAGCCCTACAGAAGTGCCTTTTA 

CGTATTGGAGCAACAGCCGTGTTGGCTTGGCTGTTGACATGGATGAAATTG 

Sequence 1639 

CGCGTCCGGCTCCCCGCACCCCCTCGCACTCNCTCTGGCGGGNCCAGGGCGGCCTTCAGC 

CCAACCTTGCCCAGCCCCACGGGCGCCACGGAACCCGCTNGATCTCGCCGCCAACTGGTA 

GACA 

Sequence 1640 

GTCGCCACGCGTCCGGCGGCCGGGCCGGGGCAGCCGGGAAGCGGGTGGGGTGGTGTGTTA 
CCCAGTAGCTNCTGGGACATCGNTCGGGTACGCTCCACGCCGTCNCAGCCACTGCTGTGG 
TCGCCGGTC 
Sequence 1641 

CGTCCGCTCCTCCCGCCTGAGGCGAGTCTGGGCTCAGCCTAGAGCTCTCCGGCGGCGGCG 

CAGCTTCAGGGCAGCGCGGGCTGCAGCGGCGGCGGCGGTTAGGGCTGTGTAGGGCGAGGC 

CTCCCCCTTCCTCCTCGCCATCCTACTCCTCCCTCCTCGTCATCCTCCCCCTTCGTCCTC 

CTCGCCTTCCTCCTCCTCGTCAGGCTCGACCCAGCTGTGAGCGGCAAAGATGGGCGGCGC 

CCAGGCCGCCGCCTGCCAGGCTGTCGGGGCGTCATGGTGCCGGCGCCCATCCAAGACCTG 

GAGGCCCTGCGCGCGCTCACGGCGCTCTTCAAAGGAGCAGCGGAACCGAGAAACAGCACC 

CAGGACTATCTTCAAAAGGAGTTCTGGATATCCTAAAGAAATCTTTCTATGCTGTTGAGC 

TTGCCTGCANGAGATCCATCCCAAGTGGAAAACCT 

Sequence 1642 

ACATTTATCATGGATGCTGACCGGGAGAAAGAAAGAAAGAAACGGGAGGAGCGGGAGCGT 

AAGCGGCGGAAGGAGGAGGAGGTGCAACAGCCAAAGTTGGCAGAGGAGAGACGGCGGCAG 

AATTTACAGGAGGAAAAGGAAAGGAAGTTGGAATGCCTGCCCCCTGAACCTTCCCCTGAT 

GACCCTGAAAGTGTCAAGATCATCTTCAAATTACCTAATGATTCTCGAGTAGAGAGACGA 

TTCCACTTTTCACAGTCTCTMCAGTMTCCACGACTTCTTATTCTCCTTGAAGGAAAGC 

CCAGAAMGTTTCMGATTGMGCCMTTTTCCCAGGCGAGTGCTGCCCTGCATCCCTTC 

AGAGGAGTGGCCCAATCCCCCTACGCTACAGGAGGCCGGACTTAGCCACCAGAAGNTCTT 

TTTGTTCAGGACCTMCTGACGAATGACATTTTTTTCTTTCTGTCCCCTCCTACCCCAGT 

CCCTAAAAGAAATGGGGNAAAAAGGAAACAACAGCAGTCNTAAAAAAAAAA ^ 

Sequence 1643 

CGCGTCCGGAGGGGCTAAGAAGGTTGTCCTTGCCTAATGCTCTGATCTGTAAGTGAATAG 

GGCAGMCAGTTCAGCCTTGAGGTTAGAATTTAGCAGGAGCTATCCTGACTTAATATCCA 

GTTGTGGGGTTTGCAAAACAAAACAGCTGTATGTAATCATTGCCACTAGTTCCATCTAGA 

ACTCCTTTCTAGTTTGTTATTTTTAAAATGTTTATACATAAAACCACCAAAATACATAGC 

TTCGACAAGATGGAAGTTTATTTCTCTCTCCCATAACAGTGCAGTGATAGTCAGCTGGTC 
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CAGGCCAGGCAAGGGGCTGGTCCATGATGTCATCAGGCACCCAGGTTCCTACTGGCTTTG 

CATGTGGCCACAGTTAGCAACAAANGGAGGCTGTAAATTT 

Sequence 1644 

CGCCACGCGTCCGGTGATGCGGACCCCGGGCGGGCGCAGGGCGCGGGCTCCGGCGCCGCC 

GCTGCGTCCTCCCCGGCCGCGGGCGAGCCGCTGCAGAGGGAGCGTCGCGCCGGGGCGGAG 

TGCGGGCTTGCGCGGCAAGTGCGCGCCGAGGTCACGAAATGGATTGGAGTGAACCGGAGA 

CCCCGAAAACGGAAGCGCAGGGAGAAGGAAGAGGTGTTTGAAAAGCTTCTTCCAGACCAG 

CTGGTCTTGCTTCTGGAGCATCTCTTGGAGCAGAAGACTCTGAGCCCCCGAACTCTGCAA 

AGCCTCCAGAGGACATACCACCTCCAGGATCAGGATGCAAGAGGTTCGCCATCGGTGGTG 

TGAACTCATTGTTAAGCACAAGTTCACGAAAGCCTACAAAAGTGTGGAGAGGTTCCTTCA 

GGGAGGATCAGGCCATGGGGTGTGTACCTCTACGGGGAGCTGATGGTTGAGTGAGGACCC 
CAGAC 

Sequence 1645 

TCGCCACGCGTCCGGGACATCGAGTNCGGGCTGGCTACGAACTCCTCGGGGGCGAAGGTG 

GCGGAGAGGGATGGGTTCCAGGACGTCCTGGCGCCCGGGGAAGGCTCGGCGGGACGGATT 

TGCGGTGCGCAGCCAGTGCCGTTCGTCCCTCAGGTGCTTGGCGTGATGATCGGGGCCGGA 

GTGGCGGTGGTGGTCACGGCCGTGCTCATCCTCCTGGTGGTGCGGAGGCTGCGAGTGCCA 

AAAACCCCAGCCCCGGATGGCCCCCGGTATCGGTTCCGGAAGAGGGACAAAGTGCTCTTC 

TATGGCCGGAAGATTATGCGGAAGGTGTCACAATCCACCTCCTCCCTCGTGGATACCTCT 

GTCTCCGCCACCTCCCGGCCACGCATGAGGAAGAAACTGAAGATGCTCAACATTGCCAAG 

AAGATCCTGCGCATCCAGAAAGAGACGCCCACGCTGCAGCGGAAGGAGCCCCCGCCCGCA 

GTGCTAGGAAGCTGAC 

Sequence 1646 

TCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGACGAGGGGCCGGGTTCTGCG 

GCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAGCTTCTTCACCGACGGGTC 

CTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTCTAAGCGACACTCAACCTC 

TGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGGCTGGTTGTATTATAAGCA 

GATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACATGCTGGGCTGGATCAGAGC 

GATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCGGCTGTGCCAACCAAGCTCTGAT 

CAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGCTCTGGGCCCAAAGCTGATTC 

CTCCCCCAAAGGCTCTCGCGGCCTGGGGGGCCTCAAGTCTGAGTTCCTCAAGCAGAGTGC 

GGCCACGTGGCCTCANGACTCAAGACCTGCCCGCAGGGAGCAAGGATGACAGTGCTGCAG 

CCCCCAAAACCCC 

Sequence 1647 

GGTGTCGCCCCGCGTCCGGTTTCTTCCTAATTTATATTTCCGATACATANGTGTAGAACA 

GGAATTTGCAGMGCCATTTAAGTTATCTTTTGAGGTMNGCTCTGATTTAGCATTTATT 

CTGATAA AATCT AATACATCATGGGATATATATAAAGCAACTTAATTCTTGTGGTGTAGT 

CTTAATAGTTTTGAATGTTGACTGMTGTCTATAAMTTGTGAGTTTGTCTTTGTTACAT 

TCCAGTGTTTCTGCCTCTTGGCATGCTTAAAGCACGGCTTACTTCATCTGCTCCTTACAC 

ACTAAAATGCTG TTAGTG TGCTCAACTACAGAAATAGCCGCTGCTAAGTTGATGTAGATT 

7TCTACTTGM TATTT TTATGGTTGTAGGAACCTCAGGAGGGTCAGTGTTTACTGGTTTA 

TATATGCCTTC I I I I I CCTGTTTGAGCTTCTCTCTTTGAAGGGATTCTAACAGAACAAAA 

GCTGCTGATCACCCTAAGTTGGAAACAGNAAAGGNGTAATTAATATAACTTAATGC 

Sequence 1648 

TCACCACGCGTCCGAAAGTCCGTGACATGGTTCCCCGTGGTGGCCCGTGGCAGCCCGTGG 

CATGGCGTGGCTCAGCTGTCTGTTGAAGTTGTTGCAAGGAAAAGAGGAAACATCTCGGGC 

CTAGTTCAAACC7TTGCCTCAAAGCCATCCCCCACCAGACTGCTTAGCGTCTGAGATCCG 

CGTGAAAAGTCCTCTGCCCACGAGAGCAGGGAGTTGGGGCCACGCAGAAATGGCCTCAAG 

GGGACTCTGCTCCACGTGGGGCCAGGCGTGTGACTGACGCTGTCCGACGAAGGCGGCCAC 

GGACGGACGCCAGCACACCGAAGTCACGTGCCAAGTGCCTTTGATTCGTTCCTTCTTTCT 

AAAGACGACAGTCTTTGTTGTTAGCACTGAATtATTGAAAATGTCAACCAGATTCTAGAA 
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ACTGCGGTATTCCAGTTCTTCTGACACCGGATGGGTGCTTGGGAACCGTTTGAGCCTTAT 

AGATCATTTACATTCAATTT 

Sequence 1649 

CNCCACGCGTCCGGGGATCCCTGGGGAGAGTAACAGTGGCCCCACATCCCTCCTCCTGGG 

AGACGCTGGTGCAGGGCCTCAGTGGCTTGACTCTCAGCCTAGGCACCAACCAGCCCGGGC 

CTCTGCCTGAAGCGGCACTCCAGCCACAGGAGACAGAGGAGAAGCGCCAGCGAGAGAGGC 

AGCAGGAGAGCAAAATAATGTTTCAGAGGCTGCTCAAGCAGTGGTTAGAGGAAAACTGAG 

ACGTGCACCCCCATGGGATGGAGACCCGAAGGGACTCAGACGGAGCCGCCGTGTTGGCAG 

CGCCTGGGTGTGGGCCCATTTTGGGGACCAAACAGCAAGCTGTGGTCGGATGAGTGCCAG 

GACCTGTGTACCGGGACACGTGGGGAGTCCTCCCAGCATGATGCTTGACTGACCCGAGGA 

AGGTCCTCATGTTTCGTGCCTGTCATTCTCGGATGGCTGTGAGGCATTCCTTGGCAAGGG 

ATGCTTGCGTACCAAGCGGTCCTACCGCATCTACATGGCTTCTGTGATGCATGTTGTCGC 

TTTCCCACCCNGGAT 

Sequence 1650 

CGCGTCCGAGCTTTGCAGGGAAGAACAGAGTATGGGTCATCTCAGCCCCTCATGCCTCGG 

AAGGCTACTACCGCCTCATGATGAGCCTGCTGAAGGACGATGTGTACTGTGAGCTGGCGG 

AGAGGCACATCCAACAGATTGTGCTCTTCCACCAGGCAGGTGAGGAAGGAGGCAAGGTGA 

GAAGGATCACCAGCGAGGGCCAGATCCTGGAGCAGCCCCTGGACCCTAGCCTCATCCCTA 

AGCTGATGAGCTTCCTGAAGCTGGAGAAGGGCAAGTTTGGCATGGTGCTGCTGAAGAAGA 

CGCTGCAGGTGGAGGAGCGCTATCCATATCCCGTTAGGCTGGAAGCCATGTACCGAGGTC 

ATNGACCAAGGCCCCATCCGTAGGATCGAGAAGATCAGGCAGAAGGGCTTTGTCCAGAAA 

ATGTAAGGCCTCTTGGTGTAGAGGGCCANNGTTGGTTGNCTGAGGGGGAATTGACCCGTT 

GGAAGGGGGAAGCAATGAAAGGGCCAAAG 

Sequence 1651 

CGCGTCCGGGATGCGTTGGGTCTGAAAGTCGATGAAGGACGCGATTACCTGCGATAAGCT 

TCGTGGAGTTGGA AATAA ACTATGATACGGAGATTTCCGAATGGGGTAACCTAACTGAGC 

MACCTCAGTTGCATTTTGATGAATCCATAGTCAAATTAGCGAGACACGTTGCGAATTGA 

AACATCTTAGTAGCAACAGGAAAAGAAAATAAATAATGATTTCGTCAGTAGTGGCGAGCG 

AA AGCG AAAGAGCCCAAACCTGTAAAAAGGGGTTGTAGGACATNTTACATTGAGTTACAA 

MTTTTATGATAGTAGAAGAAGTTGGAAAGCTTCAACATAGAAGGTGATATTCCTGTATA 

CCGAAATCATAAAATCTCATAGATGTATCCTGAGTAGGGCGGG 

Sequence 1652 

GTCGCCNC GCGT CCGCAACATTATTGAGATTGTCGTGTATAGTCATCGAATATCAGCCAG 

TTCCTGTMTTTTGTGACACGCTCTCTGCCAAGCCCACCAAGTATTTCCTTTATAGCTAA 

AAGTTCCATAGTACTAAGGAAATAAAGCAATAAAGACAGTCTCAGCAGCCAGGATTCTGG 

CTGAAGGAAATGATCCGCCACCCTGAGGGTGGTGATGGTAGTTTCTACCCATACCTCAGC 

CTCAGGCGAGTGGCTTATAGCCTCCATTCATGGTGCACTTTATTTATGGTACTAAGATAA 

AGACTGTCAATCCATTGATTTATCTCCTCCTGTCCCCCATCTAAAATACCCATGCTGCTT 

TTCTGGAGTGTTGTGGGGGGGTTACCAGCTTGATCCACTGGTGCTCTTTAAGAAGGCCCA 

AGAMGGTCTTTGGGGCATTGCCAAAGAAAATCCCGGATTTATGTGGGAAAACCCTCACT 

TTTCTCTTTACNGGCTGGTACCAAGA 

Sequence 1653 

CCGTCCGI I I I I I I I I I G AACTTACCGTAAAA I I I I I I I I I AAAAAGTGCTTGTAAAAAT 

TAAAGAGGAATAAAAGGGGGGTGAACAGCCAGTACGATAGTGCATGCCTGAAATTCCAGT 

GCTTTGGGAGGCCGAGGCAGGAGGATCGTTTGAGGCCAGTAGTTGGAGAGCAGTGTGGGC 

AACGTAGCAAGACCCCATCTCTACAAAAAATTTAAAAGTTAGCCGGGCATGGTGATTCAC 

ACCTGGAGTTCCAGCTGCTGTGGAGGCTGAGGTGGGAGGATCGCTTGAGCCCAGGAATTT 

GAGGCTGCAGTGAGCCATGATTGCACCACCGCACTTCAGCCTAGGTGACAGAGCAAGGGT 

CTACCTCAGAAAAAAAAAAAAAGGAGGAGCAAGCACGTGTTGATGGGTGGAAATCCAGCC 

AGAAATGCTGAGGCTGAAAAGATTGTTCCCAGTTTNCTGGTAGCCAGGGGAAAAGGGGAA 

ATT 
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Sequence 1654 

CGCGTCCGCTGACATTTCCTAGGAAGCTNGGAAAAGGAAAGTGAAGGAATGGTCTAAAGA 
AATGACCATTCACACTGATTTTGTCTGGACAGTCTG 

CAGCCTTTGCAG GACTT GGATTCAGGGTTTTACTCAGTCCCTTTACCTTAGGTGGAATCT 

TCTTAAATTGAMTTTTTGGTMGGAATTCATTTCACAGGTAGTGTTTCAGACTCTGAAA 

GCCCTGACTTGGTTCTTGGCTTCTACTGTACTAGTTACTAGTTACTGGTACTGTTGCCAA 

GCAATCTGCTTGGAATTTGTGGATCCCTGCTGTTCCCTAATCCCACCCCCTGCCCTGAGA 

CAGTGAATGTAGTCCGTGAAGGGAGTGCCTCTCTGGGACCCCCTGTGTTGTTACAGGCTG 

TGCAGTGCAACAATTCCAGCAAAAATACCCTATCCCCGCACTTAGTCATTCGTGGTAACT 

AACAATTTTGAAATACTCATATAAAATGAACAGGAAAGTGGTTAGTG 

Sequence 1655 

GACCACGCGTCCGGACCCAGACCCGGCTGACCCACCTACCCGCGATCCTGCCCATGGCTG 

ACGGGCTCTTTCGGCGCAGACCCTGGGGTCTCGAGCAGATTCGCCCGGACCCCGAGTCCG 

AAGGCCTGTTTGACAAGCCTCCCCCGGAAGACCCTCCCGCTGCCCGCGGGCCCAGGTCGG 

CGTCGGCCGCGGGCAAGAAGGCTGGTCGGCGCGCGGGCGGGAGGGCGCAGGGGGGCCGCG 

CCGGGCAGCCCCCGAAGGCCGCATCGCGCCCCCCGCCCAAGAAGGAGGCGCCTCCACTGG 

ACGAGGGCTGCTATCTCGACCATTTTCCGCACCTCTCCATCTTCATCTACGCAGCCATCG 

CCTTCTCCATCACCTCCTGCATCTTTACCTATATCCATTTACAGCTTGCCTGAGTGGCCA 

GCGCGGGACGGGGTGGGCGCAGGACCGAGCGGGGAGGGAAAGGGGAAAACGGGGGCTNGG 

CATTTTGTGTTTTAG 

Sequence 1656 

CNCTAACCCCGAAC TCTAG ATCGTCTTGCTTGTTTGTCTGAAGAAGGGAATGAAATAGAA 

AGTGGAAMATMTATTTTCAGAGCATCTTCCCTTAAGTAAGCTACAGCAAGGCATAAAA 

TCTGGTACATACCTTCMGGAACATTTAGAGCTAGCAGGGAAAATTACTTGGAAGCTACA 

GTATGGATTCATGGCGACAGTGAAGAAAATAAAGAGATAATCTTACAGGGACTTAAACAT 

TTAAACAGAGCTGTTCACGAAGATA7TGTGGCTGTGGAGCTTCTCCCCAAGAGTCAGTGG 

GTAGCACCATCTTCTGTGGTTTTACATGATGAAGGTCAAAATGAAGAAGATGTGGAGAAA 

GAAGAAGAGACAGAACGAATGCTTAAGACTGCTGTAAGCGAGAAAATGTTGAAGCCTACA 

GGTAGGAGTTGTAGGAATAATAAAAAGGAATTGGA 

Sequence 1657 , 

CGTCCGCGGACGCGTGGGCGGACGCGTGGGCTGGCTGTATCTATACTTTCCTTGAGAAAA 

ATCCCATAAAGTGGATGGACCTGTGAAGAAAATGTATGCTTATGGCCTAGCCTTCATGTC 

TGGCTGATGTATCCTATAAGGCAGTAAGCCCCTTTTCTAGTCTCTGGTAAGATGCAAGAG 

CTCATATCCCCAT CACTGA CATTTTAGTTTGGAAATAATATTGAGACTGTGCTATGACCA 

ACCCCTGATGTTGTTTTTTCTTTTCAAACTTTTGCATATGAGTAGAGGAAAAGCCTAAAA 

GTTAAGTATTTATGTCTGGGGGGATACCTTCAGGTGTCTTATCTGTTTTATGCAAGAATT 

TATGTGTTCATCTTTATTCAGTGCAAAGA I I I I I I I I I AAATTTTGTTTATAATTGGAGG 

TAA CATTA AGACAACTCTTNCTCCACAAGAAAACCTCTAAAATTAATATTCCTTAAGATT 

TGGTTTTCCTTTGCCTTATMTATTACCTTTTAATTGCATGCAAGATTGTCATACTTT^ 

AAAAG 

Sequence 1658 

GTCGCCCCGCGTCCGTTTGATATACCACTCTGATAACTCATATAAAAATATCATCATAAA 
MGCTTAATTTCATCCCTTTTATGTTGGTTTTAAM 

ATTGAGMCTCTTACATAGTAGMTCCATTCTATAATACATGTGTTGACAAAGCTTTAGA 

GAMGTTTCCTATTC TCTT CCATTTCCCCTGCCCAAAGTGCTGACATAGGCAGTGATGAA 

GMTCTTTACCMGATTTTCAGGGTGTACCTATGAAATTGCTTTAAATGCACTGCTGGTG 

TAMTMTTAGCMGCAAAAGCGTTTCTGTGACTTCAGGTACCAGCTTAAAGAGCACTAG 

GGATGGGGAACGAATGCCAAATCAGACTCCACCTAGAGCACCAGGAAACAGCTTGTCCCT 

GGTAGGGAAATGGTGTTGCTGAAAG 

Sequence 1659 

CGACCNCGCGTCCGGCTGNTGACCCCATGCTGAGTGGCCNGTGGGGAGCGGCGCCCGGCA 
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GGCTC TTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTTAAGGAG 

TTTTTCCCMGAAGMMGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAA 

GTAACTGCCCATGCCCAGAAATAAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGT 

GTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGGCGCTGC 

CAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGA 

AGGCAGAACAAAGAATGCATGCCCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATC 

GGAAACCTGTGAGTCANAGTCAGAGAAACTTACCCAGCCACGTATTCCTGTTTCATGGGT 

NCTGTAGATGTTTTGAGTCAAGGAAGGTA 

Sequence 1660 

TCGACCNCGCGTCCGGTGGGTCCCTGCCGGCCGGCGGCGGGCGCAGACAGCGGCGGGCGC 

AGGACGTGCACTATGGCTCGGGGCTCGCTGCGCCGGTTGCTGCGGCTCCTCGTGCTGGGG 

CTCTGGCTGGCGTTGCTGCGCTCCGTGGCCGGGGAGCAAGCGCCAGGCACCGCCCCCTGC 

TCCCGCGGCAGCTCCTGGAGCGCGGACCTGGACAAGTGCATGGACTGCGCGTCTTGCAGG 

GCGCGACCGCACAGCGACTTCTGCCTGGGCTGAGCTGCAGCACCTCCTGCCCCCTTCCGG 

CTG CTTTG GCCCATCCTTGGGGGCGCTCTGAGCCTGACCTTCGTGCTGGGGCTGCTTTCT 

GGCTTTTTGGTCTGGAGACGATGCCGCAGGAGAGAAGAAGTTCACCACCCCCATAGAGGA 

GACCGGCGGANAGGGCTGC 

Sequence 1661 

GGTGTCGACCNCGCGTCCGGCGCCCCGCTCGCATTGTTCGGGCGACTCTCGGAGCGCGCA 

CAGTCGGCTCGCAGCGCGGCACTACAGCGGCCCCGGCCCGGCCCCCGCCCGGCCCCGGCG 

CAGGCAGTTCAGATTAAAGAAGCTAATTGATCAAGAAATCAAGTCTCAGGAGGAGAAGGA 

GCAAGAAAAGGAGAAAAGGGTCACCACCCTGAAAGAGGAGCTGACCAAGCTGAAGTCTTT 

TGCTTTGATGGTGGTGGATGAACAGCAAAGGCTGACGGCACAGCTCACCCTTCAAAGACA 

GAAAATCCAAGAGCTGACCACAAATGCAAAGGAAACACATACCAAACTAGCCCTTGCTGA 

AGCCAGAGTTCAGGAGGAAGAGCAGAAGGCAACCAGACTAGAGAAGGAACTGNNAACGCA 

GACCACAAAAGTTTCACCAAGACCAAGACACAATTATGGCGAA 

Sequence 1662 

GACCACGCGTCCGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATT 

TTCCTGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGT 

CCTTAGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATC 

CCCGGATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTNCTGA 

CCTACGTGTACTTCGTTCTCTCACTTGGGCCTCGNATCATGGCTAATCGGAAGCCCTT 

Sequence 1663 

GTCGACCACGCGTCCGGGCTCCATCCGGGCTATCCTGCCGCCTTAGCGGCTGCTTCTCCC 

CAGGATGCGGGCAGGGGGCCTCTCTCCCACTCCCCACACACCGATTTCTGAGTAGCGATA 

GGGGCTGGAGGCTTATTTTATGGGGTAGGGGGCCGCTGGTAGGCGAAGATTGTCCGAGGG 

AGAGGGGGAGGATGAAGCCAGTGCGTGGCGGAGACTTGCCAGATGTTGATGCCTAAGAAG 

MCCGGATTGCCATTTATGMCTCCTTTTTAAGGAGGGAGTCATGGTGGCCAAGAAGGAT 

GTCCACATGCCTAAGCACCCGGAGCTGGCAGACAAGAATGTGCCCAACCTTCATGTCATG 

AAGGCCATGCAGTCTCTCAAGTCCCGAGGCTACGTGAAGGAACAGTTTGCCTGGAGACAT 

TTCTACTGGTACCTTACCAATGAGGGTATCCAGTATCTCCGTGATTACCTTCATCTG 

Sequence 1664 

CCGCGTCCGGGGGTTGGTACCCGAGCGCCTTCCCCTCACCTCAACCAGAGAAGAGCATCC 

GGTTGCTTTTTAAAGCTTTTAGCCTGCCCTAGCMGGACAAAGCATGTTAGATTAGAGAT 

GCTTCTGCTGATCGCAGGGGTTCTTATTTGAAAACATCTATGATGGGGGTGGGGTGGGAG 

GAGACAGGTTGTGGTTATGCAGGAAAATCTTGTCCTAAAAATATATGAGTTTGGGGGTAA 

GGGGTGGGATAGCCMGCAAMTCAGTMTTATTTTAAAATGAACATATGTATTTTTATT 

AACTTTTAGTTAAATACAGATTTTACAACGAGGTCAGCATAAGCCTAAATCTATATAGAG 

GGCTMCTCAGGCATTGTCTTGTTTATTTGTAGACTGGATTAAAAACAACCTGTCCTGTT 

TTGTNAGTTCCCAGCTTCTTTCGTTTAGAATAAATTAGACCAAAAGAA 

Sequence 1665 
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GCGTNCGCTTAATGTCAATGTGGCCTGGGCTGGAGGTCTGGACCCCCCCATGGGGGATCC 

TGAGTACCTGGCTGCTTTCAGGATAGTCGTGATGCCCATCGCCCGAGAGTTCTCTCCAGA 

CCTAGTCCTGGTGTCTGCTGGATTTGATGCTGCTGAGGGTCACCCGGCCCCACTGGGTGG 

CTACCATGTTTCTGCCAAATGTTTTGGATACATGACGCAGCAACTGATGAACCTGGCAGG 

AGGCGCAGTGGTGCTGGCCTTGGAGGGTGGCCATGACCTCACAGCCATCTGTGACGCCTC 

TGAGGCCTGTGTGGCTGCTCTTCTGGGTAACAGGGTGGATCCCCTTTCAGAAGAAGGCTG 

GAAACAGAAACCCAACCTCAATGCCATCCGCTCTCTGGAGGCCGTGATCCGGGTGCACAG 

TAAGTGTGGAGATGGGACACTCGCTGAGCTCAGACTGAAGGATCTTGGT 

Sequence 1666 

CGACCNCGCGTCCGGTGTGATGATCGCTACTGCTGGAGACCGCACAGAGGAGTTCCACGG 

CCACNGCAGTGAACCTCCTGGGGAACTTGCCCCTCAAGTGTCTGGATGTTCTCCTCACCC 

TGGAGCCACATGGAGACTCCACGGAGTTCATGGGAGTGAATATGGATGTGATTCGTGCCC 

TCCTCATCTTCCTAGAGAAGCGTTTGCACAAGACACACAGGCTGAAGGAGAGTGTAGCTC 

CCGTGCTGAGCGTGCTGACTGAATGTGCCCGGATGCACCGCCCAGCCAGGAAGTTCCTGA 

AGGCCCAGGTGCTGCCCCCTCTGCGGGATGTGAGGACACGGCCTGAGGTTGGGGGAGATG 

CTGCGGAACAAGCTTGTCCGCCTCATGACACACCTGGACACAAGATGTGAAGAGGGTGGC 

TGCCGAGTTCTTGTTTG 

Sequence 1667 

NCGCGTCCGACACTATTTAGAGAGCTCCCTTCCCACCTCTCTGCCCAGCCTTGTTACCTC 

ACTTCTGCTCTGGCCATGGCTGTGAAGGGCCCAGCCAGCTCCCTGTTTTGATGTTCTGTG 

CAACAGCTCCGGGGTCTTGTGACTGGAGATCCTCAACAGGCCCTGGAGCCAGGACTGGAG 

TCTTGGCAGCTGATGAGCAGCACCTTGCCGGCCAGGAGGAGCTGATGCTGACGATCTCCC 

CAACATCTGAAGGCTTAAAGAACATTGTCGTTCTTCAGCCCTCCTTGCTTCTCTCAATAC 

AATAAGACATTGCAGAAGCAAAAGGGTGGCCTCTGCTCCAGGCAAGGCAGCTGGCTCTGT 

CTGGGGGCGTCGGCCTGGGGCTTGGGTGCCACGTGCTGAGATTGCATAGTCAAAACAAGC 

CAI 1 1 1 IGCCMCAATAGCTTGTGGCTCCCACATTTTTCTACCCTTGCACTNAANGGCCA 

GACCACTCTNTGCATGGACCAANACCATNTTTCCAMCCCATGGGGCTTTTTTTNCC 

Sequence 1668 

CANGAATACTGAAAAATGAAGCCTAAAATGAAGTATTCAACCAACAAAATTTCCACAGCA 

AAGTGGAAGAACACAGCAAGCAAAGCCTTGTGTTTCAAGCTGGGAAAATCCCAACAGAAG 

GCCAAAGAAGTTTGCCCCATGTACTTTATGAAGCTCCGCTCTGGCCTTATGATAAAAAAG 

GAGGCCTGTTACTTTAGGAGAGAAACCACCAAAAGGCCTTCACTGAAAACAGGTAGAAAG 

CACAAAAGACATCTGGTACTCGCTGCCTGTCAACAGCAGTCTACTGTGGAGTGCTTTGCC 

TTTGGTATATCAAGGGGTCCAGAAATATACTAGAGCACTTCATGATTCAAGTATCACAGG 

MTTT CACCTATTACAGAGTATCTTGCTTCTCTAAGCACATACAATGGATCAATCCATTA 

CTTTTGCTTTGGAGGATGGAAAGTTATGAGATATATGTTGAAGACTTGAAAAAAG 

Sequence 1669 

GTCGACCNCGCGTCCGCCCGCCCATCACTGCTGTTCCTCCAGGGCCAGCACTCGGGCGAG 

GCAGGGGAGCTGCCTTCGGTACATAATTTGAAGGGGCACTCCCTCTTGGGCACATGCCGG 

CCCTGAGTGCCTCCCTTGCCTCACTCTGATCCTGGCCCCATAATGTCCTCAGTGGAAGGT 

GATGGGGGCCGGT GCTG TGGGGAGAGTAGAAAGAGGGGTTGGCATGACTAAAAATACCAG 

TATGTGTATTAAGTATTTTGAGAATGAAATGCCAAGGAGTGCCTACTATATGCCAGCTCT 

AGGAATGGAGTAGACAGTGGACACAAGAAGGACTTACGCCCTGAGCACAGGTGCCAATGG 

TGACAAGACTGGCAAGACGTGAGGGCATGAATGGTTCATTCAGGCAGCTGCTGCAGATGT 

GGTCACCTGGTGCCATCTGCTGCTCCCTTTTCCACTTTTCTATGTCCTCCTTCCACCCCA 

A 

Sequence 1670 

CGACCNCGCGTCCGGTCTGAAGGGTCTGGCTGGTGAGCCAGGTTTTAAAGGCAGCCGAGG 
GGACCCTGGGCCCCCAGGACCACCTCCTGTCATCCTGCCAGGAATGAAAGACATTAAAGG 
AGAGAAAGGAGATGAAGGGCCTATGGGGCTGAAAGGATACCTGGGCGCAAAAGGTATCCA 
AGGAATGCCAGGCATCCCAGGGCTGTCAGGAATCCCTGGGCTGCCTGGGAGGCCCGGCCA 
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CATCAAAGGAGTCAAGGGAGACATCGGAGTCCCCGGCATCCCCGGTTTGCCAGGATTCCC 

TGGGGTGGCTGGCCCCCCTGGAATTACGGGATTCCCAGGATTCATAGGAAGCCGGGGTGA 

CAAAGGTGCCCCAGGGAGAGCAGGCCTGTATGGCGAGATTGGCNCGACTGGTGATTTCGG 

TGACATCGGGGACACTATAAATTTACCAGGAAGACCAGGCCTGAAGGGGGAGCGGNGCAC 

CACTGGAATACCAGGTCTGAAGGGATTCTTTGGAGAGAAG 

Sequence 1671 

GAGTCGACCNCGCGTCCGCAACTGGTGTCCAGCTCGGTGCACTCCAAGCGCCGTTCCCGA 

GCGGACCTCACGGCCGAGATGATCAGCGCCCCGCTGGGCGACTTCCGCCACACCATGCAC 

GTTGGCCGGGCCGGAGACGCCTTTGGGGACACCTCCTTCCTCAATAGCAAGGCTGGCGAG 

CCCGACGGCGAGTCCTTGGACGAACAGCCCTCTTCTTCATCTTCCAAACGCAGTCTCCTG 

TCCAGGAAGTTCCGGGGCAGCAAGCGGTCACAGTCGGTGACCAGGGGGGAGCGGGAGCAG 

CGTGACATGCTGGGCTCCCTGCGGGACTCGGCCCTGTTTGTCAAGAATGCCATGTCCCTG 

CCCCAGCTCAATGAGAAGGAGGCCGCGGAGAAGGGCACCAGTAAGCTGCCCAAGAGCCTG 

TCATCCAGCCCCGTGAAGAAGGCCAATGACGGGGGAGGGCGGCGATGAGGAGGCGGGCAC 

GGAGGAAGGCAGTGCCCCGTCGGAAT 

Sequence 1672 

CGCGTCCGCTCGCGGCNGGGCATCGNGTACATCCTCAGCAACCATGGGCTACGTGCGCCA 

GCTCTCCCAGGCCCTGGACACATCCAACGTGATGGTGAAGAAGCAGGTGTTTGAGCTACT 

GGCTGCCCTGTGCATCTACTCTCCCGAGGGCCACGTGCTGACCCTGGACGCCCTGGACCA 

CTACMGACGGTGTGCAGCCAGCAGTACCGCTTCAGCATTGTCATGAACGAGCTCTCCGG 

CAGCGACAACGTGCCCTACGTGGTCACCCTGCTTAGCGTGATCAACGCCGTCATCTTGGG 

CCCCGAGGACCTGCGCGCGCGCACCCAGCTGCGGAACGAGTTTATCGGGCTGCAGCTGCT 

GGACGTCCTGGCTCGCCTGCGGTGAGTCCCCACTGTAGCGGTCCTGCCGGNTTNCCCCTC 

CTGCTCCCAAGGCCAGGCCCACCTGCCCTTTGGCTCCCAGCCACCTCACCTAAGCAGCAC 

CTTCCAGATGGCAGGGGAGGTGGC 

Sequence 1673 

GTCGACCACGCGTCCGGCCAGAGCTGAGTGGCAGCCGCCTCCCTTATGCAGGACATGTGC 

TCTCGGCTTCACCAGGGTTCTGACCGGGTCTGCTTCTGCATTCACAGCGCCTCCTGGACC 

TGAAGGCATCTGAGTGTGAGACCCTGTTCTAACTCTTAGAAGTGACATTGTAAGAGGTGG 

TGGGGACCAGCTAATTGGTCCAACCCAGCCTGAGTGCACCACCCTTTGAACAAATGTATC 

AGTGATGAAAATTTGCCTTTGCCCCGGCTTGCCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGTGGGCGGATCACTTGAGGTCGGGAGTTCAGGACCAGCCTGGCCAGCGTGGCGAAACCC 

CGTCTCTACTAAACATAAAAAAATTAGTCAGGTGTGGCGGTGCGTGCCTGTGGTCCCAGC 

TATTCAGGAGGCTGAGGCACCAGAATTGCTTGA 

Sequence 1674 

TGACGGCGGCCCGGCCGACGGGAGCCGGGGCGGGGCGGCGGNCCANCGAAGGAGCGCGCG 

GGCGGTCTGGCCCCGCCCCCTCCCCGCCCGCCTTCCCGGTGACCTTCAGGGGCCCGGGTG 

GCGGGCGCAGGCCCCTGCGGCGGCGGCGGGATGTTCGTGCAGGAGGAGAAGATCTTCGCG 

GGCAAGGTGCTGCGGCTGCACATTTGCGCGTNCGACGGCGCCTAGTGGCTGGAGGAGGCC 

ACCCNGGACACCTACNGTGGANAAANCTCAAGGAGCGCTTGCCTCAAAGCACTGTGCTCA 

TGGGGAGCTTANAAGATCCCCAAAAGTATAACCCATCATTAAATTTAATCCCACGCTGCC 

TNAANANAAGGGGTGCTTGNGTGATTGCCATGNACCATNCTTGGGAAGGAAGAAACCATT 

CCCAGGACCCAAAAGATGGGCCCTATTTNTTGGATTA 

Sequence 1675 

CACGCGTCCGGGATCCCGTACCCGGGACAGACTCGGCGCCGCTGGCTGGCCTGGCCTGGT 

CGTCGGCCTCTGCACCCCCGCCGCGGGGGTTCAGCGCGATCTCCTGCACCGTCGAGGGGG 

CACCCGCCAGCTTTGGCAAGAGCTTCGCGCAGAAATCTGGCTACTTCCTGTGCCTTAGTT . 

CTCTGGGCAGCCTAGAGAACCCGCAGGAGAACGTGGTGGCCGATATCCAGATCGTGGTGG 

ACAAGAGCCCCCTGCCGCTGGGCTTCTCCCCCGTCTGCGACCCCATGGATTCCAAGGCCT 

CTGTGTCCAAGAAGAAACGCATGTGTGTGAAGCTGTTGCCCCTGGGAGCCACGGACACGG 

CTGTGTTTGATGTCCGGCTGAGTGGGAAGACCAAGACAGTGCCTGGATACCTTCGAATAG 
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GGGACATGGGCGGCTTTGCCATTCTGGTGCAAGAAGGCCA 
Sequence 1676 

TCCCTCTGCTGATGATGGATGCCCCTAACACCTGTGCCTAACACCCCTACTGAACCCCAC 

AGCTCCAGCCTTAGTTTTTGGA GTCM GTGTTAAAGGTTTCTGGCCAGAGGAATTGGGGT 

CTTG CCAT CCCT GCMT AGCCCTlTTATGGGCTCTGGGAGACAGCTTTAGGGAATAAATG 

GGGATTTTCCCCTTTTTCTACCCACTCCTTTGCTTCCTCCAAGACTTACCCMCTCCTTC 

CCCCTCAGAGAACCAAATAGCCTGAGGAAGCAGGAGAGTTCCTGGTTATGGCAGATTCTT 

GGTGATTTGGGGCTTCAAGACAGTAGGTGAGAGATGCTGTCAGGGACGTATCTTCTTCAT 

ACCAMGTCACTGGTCCTTTCTCAGCCTCTCTCGTGCTTTTCTCCTMTGACCATATTTT 

TGCCAAAAATTGGGAATATGTTATCTGACAGACCAGAATATTTGAAGGTTTGGGCTG 
Sequence 1677 

GCGCCGCGGATATNCGGATCAACCTATGGTNTCAATATTGTNAGTTATTCAGCATAAACA 

GMTTATTTCCCMNACTTGATCTGAMTATTNlsn"AATGGTCNTACTNGAMCTTATATT 

CTTNCTGGGAGNGANGTNTTTATCATTTTTCCATGGAGACAGGTTCTAACTCTGTTGCCC 

AGGCTGCANTGCAGTGATGTGATCATAGCTCACTGCAGCCTGAAACTCCTGGGTGTCAAG 

TGATCTC TGGCCT CAGCCTCCCAAGTAGTTGGAACTTCAGATACGTGCCACCACAACCAG 

CTMTTTATTTTTTAGAGATGAGGTNTCGCTATGTNGCCCAGTCTGGCCNNCTAGCCNCA 

AGTGATCTGGCCATCTNAGCCTTCAGTTGGAGATGTCTGATTTATGTTAATATAAGAAAG 

CTGTTGATCGTTTATCATAAANGCATTT 

Sequence 1678 

GTCNCCNCGCGTCCGCTCCTCCGCCGGCATGCAACTCGGCGCCCGCGGTCCATGGACCGG 

AACCTCGGGCCGACGGACGGGAACCCGGGCCGCGATCGCCGCCTCCCCGCCTCAGGCTCC 

TCCTCCTCGCTCTCCGCCGCCTCCGCCGGACTCCCGCAGGCCCTGCACCGCCGCCGCCAG 

GCTAGCGGAGCTGCCCCGGGAAGCTGGGTGACGGGTTCGCGGCTGCCGCCGGACTGCGGC 

CTACTCCGCCGCCTCTCAGTGCTATTGTCCCTGGGCCTGGCCTTGAGCGGGTCCACTGGG 

GAAGGCNCGTGTGCGCCGGCTCCGCGGAAGATGCCGGACCAAGCCCTACAGCAGATGCTG 

GACAAGAAGTTGCTGGGTTTGTTTTGCTACTGATGAAGATGATAGAACAGCTGAATGGGT 

GAGGACCATGGCAGGTGCAGGAGGATCTACAAAATGGGTTCACCAGGGCCTGTCTACAAC 

GCTGGGTGGATGAAAAGCAAAGAG 

Sequence 1679 

GCGTCCGGGCCCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGC 

CGGAGCTGCMTGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCT 

GCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTG 

CCAGGTTAGT GGTTA CTTGGATGATTGTACCTGTGATGTTGAAACCATTGATAGATTTAA 

TMCTACAGGCTTTTCCCAAGACTACMAMCTTCTTGAAAGTGACTACTTTAGGTATTA 

CAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAA 

GGGGACTGTGCTGTCAAACCATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTGCG 

AGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGAAGGAATGTGAACAAGCTGAACG 

ACTTGGAAGCAGTGGAT 

Sequence 1680 

GTCCGGCGTGGGGAAGGGTGGGGTGAGGGGGCGTGGCCGCAGCTAGGGCGGCGAAACTCT 

CCTCCCCTCGGCCCCACCGCGTGGGACGGCGTGAACGTGGTGTCGGAGGGATGTCAGCCT 

TCTCTGAGGCGGCGCTGGAGAAGAAGCTGTCGGAGTTGAGCAACTCGCAGCAGAGCGTGC 

AGACCTTGTCCCTGTGGCTCATTCACCACCGTAAACACTCGCGGCCCATCGTCACCGTGT 

GGGAGCGGGAGCTGCGGAAAGAGTGGAGGTGCAACAAATGAGAAATTCCAATTGGAGATT 

TTGTCAAACAGGMTTTACTCCAMCCMACAGGMGCTTACTTTTCTCTACCTAGCCAA 

TGATGTCATACAGAACAGCAAGAGGMGGGGCCAGAGTTTACAAAAGATTTTGCACCAGT 

TATA 

Sequence 1681 

CCGGCAAAGCAGGGACTCCTGATTTATATGTCCCTCCTCCTGGCAATCCTCTCACCCCAC 
CTCCCCTGAGAACCTCAGTTCTTCCTAAATTGCTAAAGCTGAGGGGAAAGGGATGCTTTG 
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CGGCAMGGCGCTGCTGCCTCAAGCTTGCTTTACATGCCTCTCTAGTTCCTCTCGCACTA 

CAGAGTGGTAGCGAACAAAAGCGTTCGCCCTAGAAGCGACCTGAATGGAAAAATCTGCAC 

ACGAACTAATGGGTTTGTCACGGAAAGTAGGGAACCGGGTCTGCAGCATTCCCTGGAGAC 

AGACTTTCTGGTTGGTTTTCAAGGGTCCAAGGCAGCCATCAGCCCGGCTGTGCCCTCCCA 

CCCTGCCTCCCACCCAGTTGATTCTCTCTTTGTGTAAGTTTAGCCCCTCTGAGGGTGGTG 

GAGTGAGAGCATCCCATCAGATATATATACGATTCATCAGTCGGCACTTAAAAAG 

Sequence 1682 

TCACCNCGCGTCCGAAAAACGCAGATGATATACCTGCAACATCNGTCATGGCTGCGCCCT 

GTGCTCAGAAGCAACCCGGGTGGAATATTGCTGGTGCAACAGTGGCAGGGCACAGTGCCA 

CTCAGTGCCTGTCAAAAGTTGCAGCGAGCCAAGGTGTTTCAACGGGGGCACCTGCCAGCA 

GGCCCTGTACTTCTCAGATTTCGTGTGCCAGTGCCCCGAAGGATTTGCTGGGAAGTGCTG 

TGAAATAGATACCAGGGCCACGTGCTACGAGGACCAGGGCATCAGCTACAGGGGCACGTG 

GAGCACAGCGGAGAGTGGCGCNCGAGTGCACCAACTGGAACAGCAGCCGCGTTGGCCCAG 

AAGGCCTACAGCGGGCGGAGGCCAGACGCCATCAGGCTGGGCCTGGGGAACCACAACTAC 

TGCAGAAACCCAAGATCGAGACTCAAAGCCCTGGTGCTACGTCTTTAAGGCGGGG 

Sequence 1683 

CCGTCCGCTCCTTGGCAAGAACGAAAGGTGTGATGAAACCTCCCTGCTCGGAAGGGTCTC 

CGTGGAGGTGTCCTCATTTCACATGCTGGGTTTTGCAAGCGAGGAAGCCAGGCAGTGGAG 

GAACTAGAGAGAGGCAGGCGTGTGTGTGGACAAGCGCTGGAGCCGCAGCCCTCAGACTGG 

CACGGGAACGCCAGCGTTGGGTGTTCAGATTCCACGCGTAtGTCTGGGCTCACTCACAGC 

ATGGCCGAGTGTCTGCAGTGCTGGTCCTGACCCTTCCAGAGCAGCAGTGGACAGATGAGA 

TAAGACTGTTTCAGAAACAAAGATGGCCACAGCCTTCCTAACAAGCAGGTCATCTGGCCA 

TGTCTGTATTGTAACTGGTAAAAGGCTTCAAGTCAGATTGATGATCAAGAAAANGTCAAA 

ACCCCAGCCCAAGATTGGGAMGCAGGTTNGTGGNTCCAANGCTTTTTAAAAAAATTATT 

TGAAGCTCTTCATTCCTNTTTCTGTGAGTGTGTCTTTCTCTT 

Sequence 1684 

NCCCATACTGGGGGCCCCCTTCCTGCAGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTT 

CCAAGACACTGTCACTGATGTGGATAAATCTTGGAAGGAGCTCAGTAATGTCCTCTCAGG 

GATCTTCTGCGCCTCTCTCAACTTCATCGACTCCACCAACACAGTCACTCCCACTGCCTC 

CTTCAAACCCCTGGGTCTGGCCAATGACACTGACCACTACTTTCTGCGCTATGCTGTGCT 

GCCGCGGGAGGTGGTCTGCACCGAAAACCTCACCCCCTGGAAGAAGCTCTTGCCCTGTAG 

TTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGATCGCTTGTTCCACACCAGCTACCA 

CTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATGCACCGCTGTACTAGCATCTCCT 

GGGAGCTGAGGCAGACCTGGCAAGTTGTATTTGATGCCTTCATCACGGGGCAGGGAAAGA 

A 

Sequence 1685 

CCGCTGGTTATTACCCAGCTGGATGGTTTCCTTTTAGGCAAGAAGGAGGTCATCAGCAGG 

CTCCCAACAATAATGCCGAAGTTAACAATGATGGGCAAAATGCAAACAACTTGGAACTTG 

AAGAAATGGAGCGTCTTATGGATGATGGGCTTGAAGATGAGAGTGGAGAAGATGGAGGTG 

MGATGCCAGTGCMTTCAMGGCCTGGATTAATGGCTTCAGCTTGGTCTTTCATCACCA 

CCTTCTTTACTTCACTAATACCAGAGGGGCCTCCCCAGGTTGCCAATTGACCTGAAAAAC 

TGTGCCAGCTACMGGAGGGTCTGACTTCAGGAAAGTGGTTTAAATAACAGTGCAATTTC 

MAMAATTTATAACTTTCTTTTGATCATCATGTACAGAGGTG I I I I I I I I CTTTAGGCT 

TCTCATG CATATGAATATTTTAAGCACGAATGGACTACTAAATATCTGAGI ITTTTTTn 

I I I I I I I I I AAAGAATC 

Sequence 1686 

CGCGCCGGTTTGGCTGCCCTGCATAAGCTGCTACAAATAGAATAAAGAATTTCATACGCC 
TGTATCTATCATTTAGATGCATGGAAAAMATGGGCTTTGCACACAATGGGTTTGGAGCT 
GACTGGGAACAATGGAAAAAATTACATTAGCTGTGGTTGTAAAG I I nTTTGTTTI IGGT 
TTTGI I I I M II M C I I I I I IC I I I I I I M M M I I ACCATCTTGTGAAAGGTTtCTGAA 
ACTCGATAATAAAAAGCGGTTGGTGTAAATTATTCTTTTGTGTCACATTTTTAGAAGGAA 
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AAACATAAAAGAATGTATCCTTAGTACTGGTTCTTAAACAGCCCATAAAAACCCATTGGC 
CTGAAGCTTATATCTCAGGCCTATGCCCATCTTATAGTCTTGGAAGACAAAA 
Sequence 1687 

CGCGTCCGTGGGTCTCGCCCTCAACCTGTGCATGTATATGTGTGTCTTTGTGTGTGTATG 

TGTGATCTCTGCCTGCAGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCC 

CGCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTG 

GGCCGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATC 

ATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTC 

TCTGCTGTGCTGGTGGAGCCCCCGCCGATGAAGCCTGCCTGGTGCTTGGCCTGAAGTTTC 

CCAAGGAAGCTGTCTGGTGGGCCCCTTGGGTGATGGGCCCCTTGGAGGGCTTGGAAGCCC 

CCCCATGGGCCAAGCCCCAGCCCTGGGCTTCTGGTTCNTCTTGTCCCTGGTCACCCCCAT 

CCCCACTCCCCCTTGGGTGGGCCTTGAACTTNCCACTTCCCTTGGGTGGGGCCC 

Sequence 1688 

AGGAGGNTTGAAGGAGTTGNNGGAGGAGGAGGATGGAGGCGAGGGCGAGCGAGCCCAGCG 

GGGTCCNGGNCGCCCCGCGGGCCAAAGTCGAGCCCTNCCGCCNNTGGGCGAGCGCGCCAG 

CCGCCCNTTTCANAACAGTTCGNCGCCACAAAANAAAAGAACGGGGGGGTGCCGAGGTTN 

CCCATTGANCTCTTAAAGTGGTGCAGGTCCCTGTTGAGTGCGCTGCACCGGGCCGTGACC 

CGCGCCCCTGTGCGTCCC 

Sequence 1689 

GGAGTCGACCACGCGTCCGCGCCGGCCGCCGGTGTCCGGACCGCTCGCCCCCGTTTGGAC 

CCGACTTCGGTTCTTCTGGGGTGTTGATGCTCCTAAAGCCCGAGAGCACGTGTCCAGACC 

CTAGCCTGTACGACGCTGACTCTGCCCGGTCCCAGAACCAAAGCCATGCCGGGGTGTGGC 

CTCTGACCCCACGCGGAGGGGACCTCGCCTTGCGGGACCCCACCTGGAACCCGACCTNCC 

AGNCTCGCAGCCGGCCTGAGCCGCCATGCGCGGGAAGTTGCTGCCGCTGGCCGGCCTATA 

CCTGGTGCAGGGCCTGCCCTACGGGCTCCAGTCCGGCCTCCTGCCAAATGCTGCTTGCGT 

GCCGGCGGCCTCTCGCTGACGCGCGTGGGGCTGGCCAAGGTTCTGTACGCTNCGTGGCTT 

GCTTCAAGCTGGCT 

Sequence 1690 

CNCCCCGCGTCCGCGGACGCGTGGGTGCTTGCTGCTGAACCTGAGCTGCAAGTTGGAATT 

GATATAATGMGACTAGTTTTCCAGGTCGTGGTTCAATTCCAGMTTCTTTCATATTATG 

AAMGAAAGTTTACCAACAAAGAATGGGAAACAATCAGAAGCTTTAAGGATGAGTGGACT 

CAGCTGGATATGTTTTATAGGAATTGGGCACTTAAGGAAAGCTTCATAAAAGCCATTGGT 

GTTGGACTAGGATTTGAATTGCAGCGGCTTGAATTTGATCTATCTCCATTAAACTTGGAT 

ATAGGCCAAGTTTATAAAGAAACACGTTTATTCCTGGATGGAGAGGAAGAAAAAGAATGG 

GCATTTGAGGAMGCAAAATAGATGAGCACCATTTTGTTGCAGTTGCTCTTAGGAAACCC 

GATGGGATCTAGACATCAGGGGATGTTCCATCTCAGGATGATTCCAAACCAACCCAGAGG 

GCAATTTACTATTCTCAACTTTAATGATTTAA 

Sequence 1691 

GACCACGCGTCCGCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCC 

GGGCCGCTGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCGGGGCGCTGTGTCCCC 

AAGCTCCCCGTCCTCGGCCAGGCGGGCACCACGGCAGGGGCTGAGCTACCCTCATGGAAG 

GGA GAGGA CCGTACCGGATCTACGACCCTGGGGGCAGCGTGCCCTCAGGAGAGGCATCCG 

CAGCTTTTGAGCGCCTAGTGAAGGAGAATTCCCGGCTGAAGGAAAAAATGCAAGGGATAA 

AGATGTTAGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAAGG 

CAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTCTCCCTCCTTGGGCTCCT 

TCGACCCCCTGGCTGAGCTCACAGGAAAGGACTCAAATGTCACAGCATCTTCCACAGCCC 

C 

Sequence 1692 

ACAGAATTTAGGGGTGGGTGAAAGCACTTGNGCTTTAGCTNTTTCATATTAAATATATAT 

CTATATTTAAACATTCATGGCATAGATGATGATTTACAGACAATTTAAAAGTTCAAGTCT 

GTACTGTTACAGTTTGAGMTTTGTAGTATTACATCATTACATAAGTCATTTTAGTAACA 
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GCCTTTGTGAAATGAACTTGTTTACTATTGGAGATAACCACACTTAATNAAGAAGAGACA 

GTGAMGTACCATCATMTTMCCTAMTTTTTTGTTATAGCAGAGTTTTCTTGTTTAM 

AAMAATAAMTCATCTNGAAMGCATTTTGTACAGTTAAMTGTATMTGMGCTTTTG 

CCAACCAGACTGGTGCTTAGCAACCAAA I 1 1 I I I I I I I AAATAAAGCTTTTATGGCAGGT 

GGGTAAATAAGGTGGCCTTCCAAATATATTGGTGTCTTGATGGAGAGTTNT7TAGTTGAA 

ATGAATGTGGGTCTTTTCT 

Sequence 1693 

CGGTTAACATGGCCGTCACCGACAGCCTCAGCCGGGCTGCGACTGTCTTGGCAACTGTNT 

TGCTCTTGTCCTTCGGCAGCGTGGCCGCTAGTCATATCGAGGATCAAGCANAACAATTCT 

TTATGAAGTGGCCNATCAAACAANCTGGGCCTGTTCTTGGTGTGTACATCCCCGATTCTG 

GTATTAATTATCGACATGTTGCAAATACCCTTTCTGTTTATAGAAGTGTCAAGAGGCTAG 

GTATTCCTGACAGTCACATNTGCCCCTAATGCTTGCAGATGATATGGCCTGTAATCCCTA 

GAAATCCCAAACCAGCTACAGTGTTTTAGTTCACAANCAATNTG^AACTAAATGGTGTAT 

GG GAGAATG ATGTGGGAAGGTGNNATTATAGAAGTTTNTTGAGGTAAACNGGTGGGAGAA 

NNNTTTTTTTACCNGGGTAATTTAANCTGNGGGAGGGANTCCCCACCCTAAGTANCTTCC 

TTCGGG 

Sequence 1694 

GTCCG CMGATGGACGCAGCTCTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAG 

ATTTTCCTGAGACCTCAGAGCCCGTTTGGATACTGGGTAGMAATACAGCATTTTCACAG 

AAAAGGACGAGATCTTGTCTGATGTGGCATCTANACTTTTGNTTTACATACAGGAAAAAC 

TTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGG 

TGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGG 

TGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGAC 

AGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAG 

TCCATAG GG CCAG GTGGTACGG 

Sequence 1695 

CCCCGCGTCCGCTCGNAGCTGTCCGCGGTCTGTTTGGCCCGAACGGCGGCGGAGGCGCTG 

ATCATGGCGACATTCATCTCGGTGCAGCTGAAAAAGACCTCAGAGGTGGACCTGGCCAAG 

CCGCTGGTGAAGTTCATCCAGCAGACTTACCCAAGCGGCGGGGGAAGAGCAGGCCCAGTA 

CTGCCGCGCGGCGGAGGAGCTCAGCAAGCTGCGCCGCGCCGCAGTCGGTCGTCCGCTGGA 

CAAGCACGAGGGCGCGCTCGAGACGCTCCTGAGATATTATGATCAGATTTGTTCTATTGA 

ACCCAAATTCCCAT TTTCT GAMATCAGATCTGCTTGACATTTACCTGGAAGGATGCTTT 

CGATAMGGTTCACTTTTTGGAGGCTCTGTAAAACTGGCTCTTGCAAGCTTAGGATATGA 

AAAGAGCTGTGTTGTTGTTCAATTGTGCAGCCTTAGCTAGCCAAATTGCAGCAGAACAAG 

AACCTGGATAATGATGAAGGGATTGAAAATCGCT 

Sequence 1696 

TTCGGGAGTCGACCCCGCGTCCGGGCCAGCCGGCTCGCCCGGGGGCCATGGCAGCAGCGG 

CTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCCTCCCGGGGAAGTCATTCATCTGAATG 

TGGGAGGCAAGAGATTCAGTACCTCTCGCCAGACTCTCACCTGGATCCCAGACTCCTTCT 

TCTCCAGTCTTCTGAGCGGACGCATCTCGACGCTGAAAGATGAGACCGGAGCAATCTTCA 

TCGACAGGGACCCTACAGTCTTCGCCCCCATCCTCAACTTCCTGCGCACCAAAGAGTTGG 

ATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCATGAAGCCCAGTTCTATGGGCTCACTC 

CTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTTGGATTCGATCTTCTTGTGGAAACGTC 

CTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCCAGTGAAGCGGCGGAACCGGCACAGC 

CTAGTGGGGCCTCA 

Sequence 1697 

CGTCCGMGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATTTTCC 

TGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGTCCTT 

AGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATCCCCG 

GATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTCCTGACCTA 

CGTGTACTTCGTTCTCTCACTTGGGCCTCGCATCATGGCTAATCGGAAGCCCTTCCAGCT 
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CCGTGGCTTCATGATTGTCTACAACTTCTCACTGGTGGCACTCTCCCTCTACATTGTCTA 
TGAGTTCCTGATGTCGGGCTGGCTGAGCACCTATACCTGGCGCTGTGACCCTGTGGACTA 
TTCCAACAGCCCTGAGGCACTTAGGATGGTTCGGGTGGCCTGGCTCTTCCTCTTCTCCAA 
GTTC 

Sequence 1698 

CGCGTCCGGCCGCGCCCATGGCCCGCCGCTGCCCGGCCGCGCGGGCGGGCCCGCCACGCC 

GCTGTCGCCCACGCGCCTGTCGCGGCTGCAGGAGAAGGAGGAGCTGCGCGAGCTCAACGA 

CCGCCTGGCGCACTACATCGACCGCGTCCGCGCGCTGGAGCTGGAGAACGACCGGCTCCT 

GCTCAAGATCTCAGAGAAGGAGGAGGTGACCACGCGCGAGGTGAGTGGCATCAAGGCGCT 

GTACGAGTCGGAGCTGGCCGATGCCCGGAGAGTCCTGGATGAGACGGCTCGAGAGCGTGC 

CCGGCTGCAGATAGAGATTGGGAAGCTGAGGGCAGAGTTGGACGAGGTCAACAAGAGCGC 

CAAGAAGAGGGAGGGCGAGCTTACGGTGGCCCAGGGCCGTGTGAAGGACCTGGAGTCCCT 

GTTCCACCGGAGCCGAGGTGGAGCTGGCAGCTGCCCTCAGCGACAAGCGCGGCCTGGAGA 

Sequence 1699 

ACGCGTCCGGAAGAATCTACACTTCT7TGCACCAGAGTATGGAGAAGTCACTAATGTGAC 

AACAGCAGTGGACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCTGGAGAT 

TCAGGGCAATGGAGAGTCCTCATATGTGCCACAGGAAGCCATCAGCAGTGCCATCCAGCT 

TCTAGAAGACCCATTACAGAGGGAGTTCATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCG 

CAGACCAACAGCCAGAGMCTTCTGTTCCACCCAGCATTGTTTGAAGTGCCCTCGCTCAA 

ACTCCTTGCGGCCCACTGCATTGTGGGACACCAACACATGATCCCAGAGAACGCTCTAGA 

GGAGATCACCAAAAACATGGATACTAGTGCCGTACTGGCTGAAATCCCCCAGGCCCTGAT 

CTGCGCTGTGGCTGTCCCTGGGACGTGCTGCAGCCCTCCTGTCCCTTCCCCCCAGTC 

Sequence 1700 

GGGAGTCGCCCCGCGTCCGGATTTCAGTTGGTGGCGTCATAGTCTCATTACAGTGTCTAT 

CTTGGCATTTACCMTTTACTTGTCATCTTTGTCCCAACTATTAGGGATATCTTTGGTTT 

TATTGGTGCATCTGCAGCTTCTATGTTGATTTTTATTCTTCCTTCTGCCTTCTATATCAA 

GTTGGTGAAGMAGMCCTATGAMTCTGTACAAMGATTGGGGCTTTGTTCTTCCTGTT 

AAGTGGTGTACTGGTGATGACCGGMGCATGGCCTTGATTGTTTTGGATTGGGTACACAA 

TGCACCTGGAQGTGGCCATTAATTGGCACCACTCAAACTCAAACTCAGTCCATCTGATGC 

CAGTGTTGAGTAMCTCAACTACTATGAAATTTCACCTMTGTTTTCAGTTTCACTTCCT 

TTTGAAGTGCAGATTCCTCGCTGGTTCTTCTGAGTGCAGMTMGTGAACTTTTTTGTTT 

TGGTTTGN I I 1 1 J I AAGAAAC 

Sequence 1701 

CCCACCGTCCGCGCGCGGCGCCTCGCCTCGGCCGGCGCCTAGCAGCCGACTTAGAACTGG 

TGCGGACCAGGGGAATCCGACTGATAAATTAAAACAAAGCATCGCGAAGGCCCGCGGCGG 

GTGNTGACGCGANTGCGATNTNCTGCCCANNGCNTCTTGAATGTCAAAGTTGAANAAANC 

CAATGAAGCGCGGGTAAACGGCGGGAAGTAACTAATGACTTCTCATTAAGGGTAGCCAAA 

NGCCCTTCGTCATCTNAATTAAGTTGGACCGCGCANTGAAATNGGATGAAACCNAGANTT 

CCCACNTGTCCCTACCTACNTAATCCAAGGCGGAAAACCACAAGCCAAAGGG 

Sequence 1702 

CGACCACGCGTCCGGACAGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGGTGCATGG 

CCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGACTCTGGC 

ATGCTAACTAGTTACGCGACCCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCC 

TCTTCCTGCACGCCGGGATCTTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCC 

TGCGGGACCTGGAGAAGCTGGCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTG 

GTGTCACCGGCAACCTGGCCAGTGCCATCTTCCTGCCATACCGAGCAGAGGTGGGTCCTG 

CTGGCTCCCAGTTCGGCATCCTGGCCTGCCTCTTCGTGGAGCTCTTCCAGAGCTGGCAGA 

TCCTGGCGCGGCCCTTGGCGTGCCTTCTTCAAGCTGCTGGCTTGTGGTGCTCTTCCTCTT 

CACCTTTGGGCTGCTGCCGTGGATTGACAACTTTGC 

Sequence 1703 
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GATCGACTTCGCCTGACGGAATCCAGGGGTCGTAATATATGTAAACTCGCGTCCGNGCTG 

CGTGCCCAGAGTAGTGGCCGAATACCTTAACGGGGCTGTGCGCGAGGAGAGCATCCACTG 

CAAGTCGGTCGAGGAGATCTTCGACGCTGGTGCAGAANCTGGCCGACCAGTCGGGCTTGG 

ACGTGATCCGCATTCGCAAGCCCTTCCACACANGACAACCACTAGCATTCAGGGCCAGTA 

GGCACCCCTTCACCAACAAGCTTGACCACGTGTCCCGCGGAACTANCGCACCCCCGAGNA 

GGGTTCAANGNATTCCTT GCACCC AGCCCAGATGCCCTGGCTTNTGGGGGGNCAAGTGAC 

CCTTTGTGNMCCCACTCATTTTTTATGGCAAGGTGAGCATTNCCTAAAAACCCTTGAAA 

MTGANGGGAANAACTTTCMGGGGTTTTTTACAGGGCCCTTGGI 1 1 I I I 1 1 AAAATCCC 

CAANATTTGGATAATAAATGGAATCCTCAAAAACACAAGTGGAGGAAGGNTCTTGAAAGG 

GC 

Sequence 1704 

TCGACCACGCGTCCGGCCGGAGAAACTTGAGCCGGCTGCCCCGCCCACGGTGCCCGAAGC 

CCCAAAGGCTGGAATTAGGGGCTAGAAGTCTGGCACCCACCGCCTGGCCAGGTGTTCGGG 

ACGCGACCAGGTGGGCGGTCGCCCCGCCCCGGGAGCGCGGCTTAATAGCTGAGAGCCCGG 

GGGCCAGGCCGNGGCTGCGGCCCAGGCAACGCCCTGAGGGTGGCCACGCTGNCAGGTGTT 

CCACTCCCCCGGGACTATGGGCAAGGGCCCGGGGCGGGGAGGGCGGCAGGTGCTGACACT 

GGAGCTGCGCCGGAGGTCGGGGAACTCGGCCTCCTAAGACTGAGGACACTCGCCTGCTGG 

GCCGGNCGAGCTGTGCGGTGCCCTCCGGGACGCAGGGGGCGCTGCAGCCACGCTGGGTCA 

GGCTCCGAAGGGCCCTCCCAACCCGGGGA 

Sequence 1705 

CGCCACGCGTCCGGAAAGATGGAGGTGTGGGGACAGGAGCTGGGTGTGCTGGGGACTGGC 

CGCGGACCCCTAACCTGTGTCTCCGGTCTCCCTCCGGGAGCGGCTCAACCCAGCCCATCG 

CTCTGGCCCCGTTCTGGCCCTGCAGGGTGGTGGTTGGGACGTTGAAATGAGCGCGCGAGT 

GGTACGTCCTCTCTCCGCGCTCACGCCCCCCTCCTCACCGTGTTTCCCGCCAGGACCATC 

AGCACGTGCCCATCGACATCCAGACCAGCAAGCTGCTCGATTGGCTGGTGGACAGAAGGC 

ACTGCAGCCTGAAATGGCAGAGTCTGGTGCTGACGATCCGCGAGAAGATCAATGCTGCCA 

TCCAGGACATGCCAGAGAGCGAAGAGATCGCCCAGCTGCTGTCTGGGTCCTACATTCACT 

ACTTTCACTGCCTAAGAATCCTGGACCTTCTCAAAGGCACAGAGGCCTTCCACGAAGAAT 

ATTTTTGGC 

Sequence 1706 , 

TCGCCNCGCGTCCGCTGAAGCAAGAGAATCACTTGAACCCAGGAGGTGGAGGTTGCGTGA 

GCTAAGATCGCGCCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCGTCTTAAAAA 

AGCCCATGGCAGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCA 

AGGTGGGCGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCGAACACGGTGAAACCCC t 

GTCTCTACTAAAAAAAATACAAAAAAATTAGCCAGGCGTGGTCGTGGGCACCTGTAGTCC 

CAGCTACTCAGGGGGCTGAGGCAGGAGAATGGCGTGAATCCGGGAGGCGGAGCTTGCAGG 

GAGCCGAGATAGTGTCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAA 

AACAAAAAGCCCGTGGCAATTAATGGTAAAGGAAACCCGGCTTTTAGTGTAAAGAGGTAA 

CATAA 

Sequence 1707 

GCGTCCGGCCTCCAGCAAAGCCCATTCAGTCAGCTCTGCAGGCTCATCTTACAAGAATAA 

TCCCTTTGCCAGCTCAATCTCCAAACATGGGGTTTCTTCTGGCAGCTCTTCCTCGGGAGG 

AACACCAGTCCAGAGTTCTGTTTCTGGGAGCCTGGTCCCTGGCATACAGCCTCCCTCCGT 

GGGACAGGCCACCAGCCGACCCGTCCCAAGTTCAGCAGGGAAAAAAATGCCTGTTTCCCA 

GAAGTTGACTCTGGTAGCCCCTCCAGGCGGTCCAAACGGAGATTCCAGTGGTGGGACCCA 

GGGGAGTGGCAAAGTTGCTGACCTCGCCGTCCCTAAAGCCCTCTGCAGTTAGTAGTGTGA 

CATCGTCTACCTCCTTGTCAAAAGGAGGCGAGTGGGGACTGTGCTGCTGGCCGGCTCCTC 

TTTGATGGCTTTACCCTACAAATCCAGCAGCCCAAAGCTGTCTGGGGCCATGAGCTCGAA 

CTTCTTGGGAAATTATAC 

Sequence 1708 

CACGCGTCCGGGAAGCGGTGCGCTCCGTCACCACGGGAGTGCGGGAATCCGCCGTTTGCG 
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CTGAGGCAATGGCGGCAGCTGCGCCGGTGGCCGCGGACGACGATGAGCGGCGGCGGCGGC 

CGGGGGCTGCACTGGAGGACTCCCGGTCCCAGGAAGGGGCAAATGGTGAGGCCGAGTCAG 

GTGAGCTCAGCCGGCTTCGGGCTGAGCTGGCAGGCGCCCTGGCAGAAATGGAAACCATGA 

AGGCTGTGGCAGAGGTGAGCGAGAGCACGAAGGCCGAGGCTGTGGCTGCGGTGCAGCGGC 

AGTGCCAAGAGGAGGTGGCCTCGCTGCAGGCCATCCTGAAAGACTCCATCAGCAGCTATG 

AAGCCCAGATCACCGCCCTGAAGCAGGAGCGACAGCAGCAGCAGCAGGGACTGTGAGGAG 

AAGGAGCGGGAGCTGGGCCGCCTGAAGCAGCTGCTGTCCCGGGCCTACCCCCTGGA 

Sequence 1709 

CACGCGTCCGCGGACGCGTGGGTCCGCGGCGGCGTCCGGGGTCTCCAGTAGGGCTGACGC 

TCCGGTGCTCGCACAATCCCCCGCCTCGGCTGGCAACGGGCGTCCCTCCACTCCCCGAGT 

CCCCGGCAGCCGCCGCCACCCCAGCGCGCCCCGATCTGGCCCCCTGCCCCGCGAAGATGG 

CTGCCGTACGCCGGGCCCGCAGTTATTGCCGCTGCCTGGTGCGCTTCTCCGACCGAGAAC 

TCTGCTAAGCTCCGCTGCAGAGACAGGCAGGAGTAGACACCCGGACACCCAGCACCCCTC 

CTTCGGGGGGCGGTGCAGAGGGGGCACGGAGAGCCCCTCGAGCGCAGCAGGCCGCCCCGC 

CAGCATGGCAGAAGCTGAGGAAGATTGTCATTCTGATACTGTCAGAGCAGATGATGATGA 

AGAAAATGAA 

Sequence 1710 

ACGCGTCCGGCGAGTGGCCTTCCCGGTTGGCGCGCGCCCGGGGCGGCGGCGCTGGAGGAG 

CTCGAGACGGAGCCTAGTTATGTCTGGGAGGCGAACGCGGTCCGGAGGAGCCGCTCAGCG 

CTCCGGGCCAAGGGCCCCATCTCCTACTAAGCCTCTGCGGAGGTCCCAGCGGAAATCAGG 

CTCTGAACTCCCGAGCATCCTCCCTGAAATCTGGCCGAAGACACCCAGTGCGGCTGCAGT 

CAGAAAGCCCATCGTCTTAAAGAGGATCGTGGCCCATGCTGTAGAGGTCCCAGCTGTCCA 

ATCACCTCGCAGGAGCCCTAGGATTTCCTTTTTCTTGGAGAAAGAAAACGAGCCCCCTGG 

CAGGGAGCTTACTAAGGAGGACCTTTTCAAGACACACAGCGTCCCTGCCACCCCCACCAG 

CACTCCTGTGCCGAACCCTGAGGCCGAGTCCAGCTCCAAGGAAGGAGAGCTGGACGCCAG 

AGACTTGGAAATGTCTAAGAAAGTCAGGCGTTCCTACAGCCGGCTGGAG 

Sequence 1711 

CNCGCGTCCGAAGGCACAGGCGTCTTGCTCTGTTGAAGCAAGTCAGTATCCGAGAAAACT 

GCTGTTCCCTTTGTTGTGATGAGGTAGCAGACACACAATTGAAGCCATGTGGACACAGTG 

ACCTGTGCATGGATTGTGCCTTGCAGCTGGAGACCTGCCCATTGTGTCGTAAAGAAATAG 

TATCT AGAATCAGACAGATTTCTCATATTTCATGACACATGTGAAGAGGCATCGTGGACT 

MM ICTACTCAATTCCAGCCAATGTTGAAAAGAAAAAGAAAAAAAAAACTCTAATCAGT 

TGTACACACATTGAAACTTATAGCCATGGCCAGATTTTATGCTAAAAATGGTAGTTTGTC 

AAAGACAAAATTCTCTTAGAATCTMTCCAACTTGCCAGCCCTGAGAAAATCCCTTTTAA 

GGCCAAGGGAAAGCTGAATGCTAGCAGCCAGGCCTGTGGTACTTCCATGAGAAACCATAG 

CAGGACAATGCCCTC 

Sequence 1712 

CCACCGTCCGGGCGGCCAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAA 

GATGGCAGAGATGCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGC 

GCGGTCGGCGGTTTCCAGCCAATGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCT 

GGAGAAGCCGAAACCAGAGAGCCTTTTGTT7TCTC I INI I CCTGTCTATGCTCTGTCTC 

ACTTAACACT ACGTT TTCTGCTATGGTCTGTGGTTGATGACCTCMTATGAGTTTCGATT 

GTTAACCGTGTTTTTGTTTGGGAAGTAATTTTGTTTGAAAATGCTCTCACATACAGGAAT 

TAGGGCCTAGATTGTAAGCTCTTGCAGCAGTCACATTTGTTCCCGGGCTTTGGTGGTTAT 

TTTCTAAATTTTTGAGGTGCCTTTGCTATTTCTTGTGTGACCTGATAGCTTCCCCTG 

Sequence 1713 

GCGTCCGAGCCTCTGGGGGTGGATCCTGAAAGGTGGTCCAGCCGCCTGGCCCTGCGTGGG 

ACCCTCCACCTGGCAGCAGGGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGAT 

CTTGGCTCATCGTAACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCG 

AGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCC 

CCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGC 
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CCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAG 

GAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTNCATCGAGAACTCCGAAGA 

GGCCCTCACAGTCCATGCCCCTTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTG 

Sequence 1714 

GTCGCCACGCGTCCGCAGAAGATTGACAAATCTGAGGGCCGCTTCCATGTCCAGAACCTT 

AGCCAGGTGGAGCAGGATGGGCGGACGGGGCATGGACTCCGCAGATCTTCCAAGTTCTGC 

TTGAAGGAGCACAAAGCCCTCAAGACGTTAGGCATCATCATGGGCACTTTCACCCTCTGC 

TGGCTGCCCTTCTTCATCGTTAACATTGTGCATGTGATCCAGGATAACCTCATCCGTAAG 

GMGTTTACATCCTCCTAMTTGGATAGGCTATGTCAATTCTGGTTTCAATCCCCTTATC 

TACTGCCGGAGCCCAGATTTCAGGATTGCCTTCCAGGAGCTTCTGTGCCTGCGCAGGTCT 

TCTTTGAAGGCCTATGGCAATGGCTACTCCAGCAACGGCAACACAGGGGAGCAGAGTGGA 

TATCACGTGGAACAGGAGAAAGAAAATAAAC 

Sequence 1715 

CCCCCGTCCGCTTTTGTTNATCTAMGGCTTNAGTCCCATTTTTTTA 

AAAAACGTTTGAAAGGAGTCTTACACCTGTATCATGAAMCTGMTCCTTTTGAAATACC 

ACTATATGMGAGAGAGATGAAATTTAGTGAACAGAATTGGAAAAGGTGCTCATAATTTC 

ACTAT GCAM CTTACCCCAGTCTCTAAAAMGTAATTTAGATTTAAAGTTCTTTGATGTA 

TTTGATTTTCTAAATCTTTATGGTTATGATTTGGAATAAAATGTGCCTAATCCTGTGTTA 

CATTCTGTTCTTAAATCTGAATGCCTTCTCATTTAATTCTGAGGAAATATCACACAAGTG 

TCTTCAT TGAC CTTGMGAMTGTATATACAGTTGCCTTATAAAACAACATAMTTTAGA 

CCATMCTTTTATAGAGAAAGGGTTTTGTCAAATGTTTTCTGAAAATCTGAGTAATTCAA 

AGCATGCCTCTGCCCCCTTTAATA 

Sequence 1716 

NGTCGCCACGCGTCCGGCGCTCTCGGCCGCCGCCGCCTCTGCGTGGGCCGGCCGGGAGGG 

CCTCGGGGGACTGACTGACAGAGTTTCACTCCTGTTACCCAGGCTGGAGGACAATGATGT 

GATCTCGGGTCACCACAACCTCCGCCTCCCGGATTCAAGCGATTCTCATGCCTCAGCCTC 

CCGAGTAGCTCAGATTACAGGCATGTGCCACCACGCCCGGCTMTTTTGTATTTTCAGTC 

GAGACGGGGTTTCCCCATGTTGGTCAGGCTAGTCTTGAATTCCCGACCTCAGGTGATCTG 

TT CGCCTC GGCPTCCCAAAGTGCTGGGATTACAAGGCGTGAACCACTGCACCCGGCGAGG 

CATTTTTTACTGTCTACAGAAACTTATTGTMTTCATTTTTCCTCACTCCAAGTAGTAAG 

AATTATACCAAATTGAAAAGATATGAATGAGTATCCTAAAAAAAGAAAAAGGGA 

Sequence 1717 

CCGAGGCNCTGATAAGCCNTGGTAACGGGAAACACAGCTCTAACCTCACCTCATTCTCCA 

GGTTACAAAGGCCATGTGCCCCTTTGAATCTGGCAGAGAAAGTTTCCTCGTTGTAAGTAT 

TTGCATCTACTTCAAGCCAGATTCTTCTGCCTCTTTCTCCTTTCCAGACCCCTACTCTGT 

GCAGTGCTGACCACAGCTAGAGCCACCGCCCCATTGCTCAACCAGTATTTATTTCCCTAA 

ACGACCCTTCCTCATATTCCCTTCCCTCCACCTCTCCTTACCAAGCACCCAAAAGAGGAT 

TTAGMCTAGCAGGGTGGACATCATTCTGGTTGTTTCTACTTTTCTCTGCCTAGCACAAA 

ATTGGGAGAAAACTGGAGCCTCCATCCGCAGTCACACGTGTACAGATCTGGGNGATTTGG 

ATGTAGGCTTTTTCTAACTTCTCTCTCAGAAGCTTCTACA 

Sequence 1718 

CGGACGCGTGGGTGCCGCCGCCGCCGTCGCTGTCGTAGTCGCCGCCGCCGCTGCCGGAGA 

AAGAGCACGA GCGG GGAAGCCCCAGAGTGAAATCTAGCATCCTGCCGGCTGGTCTGCCCG 

CCCCTCCTTCCTTTTCCCCCCGGCCCCNGTCCCCTCCCNCCGCAGGTGCCATCCGTCGCC 

ATNCGCCTCTCTACCCTCNCATCCCCAGGTGAGGGGGGTGAGTTCAGGAAGCGGNNACCC 

CNAGGAACCCANC AGGGTCA CCATTTGCAGCGCAACATGGCAGGAGCTGGAGGAGGGAAT 

GATATTCAGTGGCGTTTTTTTCAGGTGAAAGGAGCAGTATGATGATGATGTAGCAGNAAG 

CANGATATNATTTCTACAGTANAATTTAATCNTTTCTGGGAGAAATTCTAGCAACANGAA 

GATAAAAGGTGGGTAGAGGTTGTCATTCTTTCAACAGGGAGCAGGGAGAAC 

Sequence 1719 
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TCCGGCCGGCCAGCGTCATCACCATCGTCAAGTCCACCCCGGGCTCGGGGTCTGGCCCCG 

CCCACGGNCCGGACCCCGNCCACGGCCCGGCACNGCNGCTCCTAGTGCCCGCCCCCGCCA 

TGCCTCANCCACGCCCCACCGGCCATAGNTNCCCTCCCCCAAAGNGCNCTGAGGCGCATG 

NCTTCNTGNGACNCAANGNTCCCATTNTCTATTGCAGGCAACATGGCCATTCCCCGAAAC 

TAAAAAGCAGTTGGGGNGGGNGAAAGTACNANGTGGAAAACCCAGNNATCACCGGNANTG 

GNGGGAMACAANGGCCNGNAGGGACACATTTCCAACTTTAAGCTGGGCNAGTGNNTGGG 

GAACCAAAAAACCTGGGGNGGNCCCNTCANTGGCANGGCCCNTGAGNNCNCAGCNCATGN 

GCATTTNCAGGGGNNGAACCAGNGGGACAAGGGGGACCTCAANAAAAATGNTGGNTGGGG 

ANGGGCCCANTCNCTGGGCNNGGGGGAAGAACCCGNATTCTACNAGCAAAAGGGGANGGG 

GGGGACNCAGCAAAGGCAATCANGGGAAGAAGGAAGATAAGGGTCNACCANCGGGAATNG 

GCAAAGAAANGGGGGAA 

Sequence 1720 

CTGANGCTCGTTTTCGTGAAATTAAGCTTCAGAGGGAAGCCCGNGAAACACAGGAGAGCG 

AGCGCAAGCCCCCACCATACAAGCACATCAAGGTGAATAAGCCTTACGGGAAAGTCCAGA 

TCTACACAGCGGATATTTCAGAANTCCCTNAGTGCAACTGCAAGCCCACAGATGAGAATC 

CTTGTGGCTTTGATTCGGAGTGTCTGAACAGGATGCTGATGTTTGAGTGCCACCCGCANG 

TGTGTCCCGCGGGCGAGTTCTGCCAGAACCAGTGCTTCACCAAGCGCCAGTNCCCAGAGA 

CCAAGATCATCAAGACAGATGGCAAAGGGTGGGGC 

Sequence 1721 

CATGGCTTCTGCGAGAAAAGTGATTTAGGCAGACGGAGGTTTTTTCTCAATCAGAGGCTT 

TCAGTAACTCTGCTGATGCACAGAGAAGAGACTTCCTCAGCCTGCAGGCTACAAGAGCCA 

ACTGTTAGTGCAAAAAAGGACTTTAATACAAATTTCTTATTCCAGAAATTTTGTTCCAGG 

TCTGGACAAGCTGAGAMTTTATCATTGTTTITCGAGTTTTTMGATACCCACTTTT^ 

GAGAGGTATGGGTGTGTGTGCAAGGCACACACATACAGTCTTTCTGTACATGCATGCATA 

TTTATGCATGTACAGGGMGTATCCAGACACCAAATTTTTAATAAAATGAATTCCCCAAA 

GGGGAGTCTTGAC CTGAA TTAAGGCTGTTGTTTATAGGGAAGCCAGATATAATTGATGNT 

GAAAAANAAACTAATTTTTATACTTAATCACCGGCAGNTANCGGGGGCANGGGGGAAAAA 

GTACAGANGGGGTGT A I I I I M GG I M I 1 1 I CT 

Sequence 1722 

TCGCCACGCGTCCGCTCTTAACACAGAGTCTGCAGCCCCTMCTGACACCCTGTCCTTCC 

TCCTAGGAAGTGCTGGACTCCCTGGTCAGCAATGTCAACATTGAGCTGCTCAATGCCCTC 

CGCTACCATATGGTGGGCAGGCGAGTCCTGACTGATGAGCTGAAACACGGCATGACCCTC 

ACCTCTATGTACCAGAATTCCAACATCCAGATCCACCACTATCCTAATGGGATTGTAACT 

GTGAACTGTGCCCGGCTGCTGAAAGCCGACCACCATGCAACCAA 

Sequence 1723 

ATCCGGTTCGCCCCNCGTCCGGGCGGCCGAGGCGGAGGCAGCGGCGGCGGGATGGCGGAC 

GCCAACAAGGCCGAGGTGCCCGGGGCCACTGGTGGCGACAGCCCGCACCTGCAGCCCGCA 

GAGCCGCCGGGCGAGCCGCGGCGAGAGCCGCACCCCGCGGAGGCGGAGAAGCAGCAGCCG 

CAGCACAGCAGCAGCTCCAATGGCGTTAAAATGGAGAATGATGAATCAGCAAAAGAAGAG 

AAATCTGACTTAAAGGAAAAATCTACAGGAAGTAAGAAGGCCAATAGATTTCATCCTTAT 

TCAAAAGACAAGAAT 

Sequence 1724 

GTCGCCNCGCGTCCGTGCTTTTTTTCGACAT 

TTCTTCTA TTTCT TGTGGATATTATGGCTAATAACACAACAAGTTTAGGGAGTCCATGGC 

CAGAMACTTTTGGGAGGACCTTATCATGTCCTTCACTGTATCCATGGCAATCGGGCTGG 

TACTTGGAGGATTTATTTGGGCTGTGTTCATTTGTCTGTCTCGAAGAAGAAGAGCCAGTG 

CTCCCATCTCA CAGT GGAGTTCAAGCAGGAGATCTAGGTCTTCTTACACCCACGGCCTCA 

ACAGAACTGGATTTTACCGCCACAGTGGCTGTGAACGTCGAAGCAACCTCAGCCTGGCCA 

GTCTCACCTTCCAGCGACAAGCTTCCCTGGAACAAGCAAATTCCTTTCCAAGAAAATCAA 

GTTTCAGAGCTTCTACTTTCCATCCC 

Sequence 1725 
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MTNTGCCAGCCTTTATTGAGCTTTACAACTAACAACCTGATAGTTGGCAGTTAATTCAC 

AGTTACAGA TAATG CTnTATTTACATAMTATACCAAGTAGTACCCTCTTATTGTATTC 

ACTTCATCTATTTTCTTAGMTACTTGCMTTTCTMTGACCCCTTCCCTTTCCCTCCTG 

CTGCCCTGTCCACCCTCTTTCCCCTTCTAACATCCTTAGAGGGATGAAATCTCAGCATAT 

GTTGCAGGACACCAAAAGGAAGAAAACAATCAAGCAAATAAAATAAACAGTCAAACAAAC 

CAGGAGTTTAAAACAACAACCCCAACAACAGAAGCCTTGGCAAAGAGGAATGAGTGATCA 

GCMGTGAACACACTCTATGTCMCTCTCCTTTTTATCCAGCTGAGATTTTATGGTAACC 

TTTAATTTAA 

Sequence 1726 

CCNCGCGTCCGGAGCCGAGAGTGTGTGGAGCAGTTACAGCTGGAAGACCGGGTCCTCTGC 

CTCCACAGTAGATGGCTGAATCCTCTATGCGGGACTGGCAAATGGCACTGTGGTCACCTT 

CAACATAAAGAACAACAAACGACTTGAGATCTTTGAATGCCATGGCCCTCGGGCAGTCAG 

CTGTCTTGCTACAGCTCAGGAAGGTGCCCGAAAACTGCTGGTCGTGGGGTCTTATGACTG 

CACAATTAGTGTACGCGATGCCCGGAATGGACTGCTCCTCAGAACTCTGGAGGGCCATAG 

CAAAACCATTCTTTGCATGAAGGTGGTGAATGATCTCGTGTTCAGTGGCTCCAGTGATCA 

GTCAGTCCATGCTCACAACATTCACACTGGTGAGCTCGTGCGGATCTATAAAGGTCACAA 

TCATGCAGTGACTGGTGGTGAATATCCTAGGAAAAGTGATGGTGACTGCTTGCCTGG 

Sequence 1727 

CNCGCGTCCGGATNMTATTTTCATCCCTGAGGTTAACAATTACCATCAAMTGTTTTGT 

GGAGACTATGTGCAAGGAACCATCTTCCCAGCTCCCAATTTCAATCCCATAATGGATGCC 

CAAATGCTAGGAGGAGCACTCCAAGGATTTGACTGTGACAAAGACATGCTGATCAACATT 

CTGACTCAGCGCTGCAATGCACAAAGGATGATGATTGCAGAGGCATACCAGAGCATGTAT 

GGCCGGGACCTGATTGGGGATATGAGGGAGCAGCTTTCGGATCACTTCAAAGATGTGATG 

GCTGGCCTCATGTACCCACCACCACTGTATGATGCTCATGAGCTCTGGCATGCCATGAAG 

GGAGTAGGCACTGATGAGAATTGCCTCATTGAAATACTAGCTTCAAGAACAAATGGAGAA 

ATTTTCCAG 

Sequence 1728 

TCGACCACGCGTCCGATCCTGGATCTGGAGAGAGAGCTCTCCAAGCAAATCAACGTGTGC 

CTCTGAGCCAGATGACGGGGTGGGACCCCGGTTAGTAAGGACCGGGCGCCCAGTGGCTAA 

GGCGGTGCCCTGGTGACCAAGGAGAGCCAGACCTGTTGCTCAGGCCGAGCTCCTGGTTGC 

CAGCGAGTTACCACGGGACCAGTCGCGTGTATGGCTGAGACTCATTCCCAGTTTCCAGGG 

CCCGGTATTTGGACACTAGTTGCCMGTCTGGGGCCTGGGGATTTTAGGGACCAGCGGTT 

GTGACCATCTTTCCTGAGCACCAAGGGCTTCCCCTTTTGTTGCCAAAAAGGTAGTTCTCG 

CGCTTGCTAGGCTGGCCTCTCTTGCCTCCCCTTGGCCGGGGC 

Sequence 1729 

TCCGAMCACCTGTCATTTTACACAMTGCGTTTTGAATGTCTGAAAGACAGCTCCTGCC 

CTTMTTTAGATGTAAACCATTTAGTTTCAAACTAACCACCTGATAAAATCTATCAACAT 

TTTATCATGAACTAGAGCAGATGTCTGTTATTTGATGTCTATGTTATTTGAGTTTACTGT 

TTMTMGTGAATTCATATCMTTMTCCTGCTMCMATTTGACACTTAAGGTGATTCT 

GMMTCCTTTAMCTTAAAGTAGATGGMTCTTAAGTATGGGGCCTTTTAGTGTCCGTA 

AAGAAAAACTGCATGCAACAAAATATAGCAGGTCCTCACTTGTTGAGATTCATGGAAATT 

GTGACTTTAAATGAAATGACATGGCTGGGCATGGTGGCTCACACCTGTAATCAGCACTTT 

GGGAGGCCACGGCGGGTGGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 

T 

Sequence 1730 

CTGNGAGTTAAATTGGTCCAGAAACAGTTATGACCCTCTTTATACTGCCAAGAAATACGC 

AGTCCCAGCCTTGGAAGCACACTGTGTAGAATTTCTCACCAAACATCTTAGGGCAGATAA 

TGCCTTTATGTTACTTACTCAGGCTCGATTATTTGATGAACCTCAGCTTGCTAGTCTTTG 

TCTAGATACAATAGACAAAAGCACAATGGATGCAATAAGTGCAGAAGGGTTTACTGATAT 

TG ATATA GATACACTCTGTGCAGTTTTAGAGAGAGACACACTCAGTATTCGAGAAAGTCG 

AC I I I I IGGAGCTGTTGTACGCTGGGCAGAAGCAGAATGTCAGAGACAACAATTACCTGT 
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GACTTTTGGGMTAMCAAAMGTTCTAGGAAMGCACTTTCCTTAATCCCGGTTCCCAC 

TGATGACAATTGAGG 

Sequence 1731 

ACCCCGCGTCCGAGCAGCCTCCAGTTGCCCCTACTTAGTGGCTTGCCCTCTGCCTGCCTC 

AGCTGCTGCCTGACCGGCTGGGGGAGGCACTGGCGGGAGGCCTCGGGCTCCCCTGGAAGG 

GCGCTGGGCTGGCGGGTCAGCTGGTGGTTCTTAGGTTTCCTTCTGTTTGTTAAAAGGGAC 

MTGTGGCCACTTCTCTGTGGAAAGGGAGTTGGTTGGGGGGTTGAGATGGCCCGTGTTCA 

TMCTCAGTTTCCTGTTTTGCACGATGTAAMACCCTGTCTTTTTGCACGATACAGCCAA 

MGTATTGGCTGATTTCTTGCTGAGTGCCCTCTTAGTTGGTGTGTGAGGTCTTGGTGGGC 

TCAGGCCAGCTGTTTGCGAGTGTGGGMCTCATAGGTTCTGTCTTTGTCTCTTCCTTTCA 

CCTCATTCTGGTAGCAGCATAAAGGTTAGGCAATCACTGGGACCC 

Sequence 1732 

GCAAATCATACATTGCATTCCCCAAAGCATCTGAACGTACTTCTAGAAAACAAACCAACC 

AAAAGGGAAMTATGCATGCTTTTTGTAATTAACTGGTGCTTTGAAAATC I MM IA AGG 

GAGMAAATCTCMCCAMGTTATGCTCATCCAGACMGCTGACCTTTGAGTTAATTTCA 

GCACAACTCATTCTTCAGTGCCTCATGACTGAAAACAAAAAACAAAAAAAGAAAGCATCT 

TCNCMTGMGCTTCCANATAGCACCGTTTTGCTAAMGATACATTCTCATTGTTTTCCA 

ACAGNGATGGCTTC CACATAA NGGTTAAACAAACTGGGNGCTTGGAAATAATTTNTNACN 

GGTTACTTNTTCGCATTTTTTNGAACNMGGAAANGGGATTCCCTTTTTTTA 

GAATTGNGGNCAAGTTTAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1733 

CGCGTCCGAATTTAGGAAGACCCCGGCGACCTGTTCCTCACCCCCGCTTCGCCCTCACAC 

TTTCGGGATGTCTGCGATTCCTGCTGAGGAGAGCGACCAGCTGCTGATCCGACCCCTTGG 

AGCTGGGCAAGAAGTAGGAAGATCATGTATTATTCTCGAGTTCAAAGGAAGAAAAATAAT 

GCTCGACTGTGGGATCCACCCTGGCCTAGAAGGAATGGATGCTCTTCCTTATATTGATTT 

MTTGGACCCACCTTGGMTTGNACCTCCNANTMNTNANNNCATmCATTTTGGAANC 

^GGGGGGGCNTNNCCCCNNGGTTTTTCNNAAAMNNCCCCTTNAAANGNNAAAAAAT^ 

TTTTNTTTTTTNTCCCAN AAAAAA I M 1 M I I NAAAANNCCCNCTNNTNTNAGGGGNTNT 

TNAAAAAATTNCCCAAAMMNNNNCCCCCCTTTNTNTNTTTTNAAAAAAAAAAAAAAAA 

AAAANATNGNCCCCNNAA I M M I M M I ATATTTTTNANNAMANAMNNTNTTTTTTN 

AAAAAANTNTATN \ WWW I CNNTNNANNNNGTGNNNNCCCNNN I I M I M M I 1 1 

Sequence 1734 

CCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGGGCCGCCCGACC 

CCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGTCGATTCTGCTGCTTTTCCTGACG 

GAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATAACGCGGAGATCTGTCTC 

CTGCCCCTAGACTACGGACCCTGCCGGGCCCTACTTCTCCGTTACTACTACGACAGGTAC 

ACGCAGAGCTGCCGCCAGTTCCTGTACGGGGGCTGCGAGGGCAACGCCMCAATTTTTAC 

ACCTTGGGMGGTTTGCNACNATTCTTTGTTTGGANGANTAAMMAGGTTCCCAAAATT 

TTCCCNCTTTNAAAAGANNNTNGNGACCNNNCACTNGGGGGGGGGCCCCAAAAAAAAATT 

ITTm I NTTAANNCCCCCCCGGGGNNGNAAAAAN I MUM I GGGGGGGGNCCCCCCCC 

NNGGGNNAAAANNNTTTTANAAAAAA I I M M M 1 1 I NTTTTTCCCCAAAAAAAAATTTT 

T 

Sequence 1735 

GCGTCCGAAAATACAATACACGGACTCTTTCGAGGCCCTGTAATTGGAATGAGTCCACTT 
TAAATCCTTTAACGAGGATCCATTGGAGGGCACTTCCAGAATACCTCCTCCCCCAGCGCC 
CGCTGCCAGCCCCACACCAGGTGTGAGAACCAAGGTCTGGTGGAGGCAGCTCCAGGCACT 
GCCCAGTCCGACACAACCTGCAAAAATCCATTAGAGCCACTGCCCCCAGAGATG 
Sequence 1736 

CGAACCTCCTGGTTCCAAGTGGGAGACATGGTGCTGTCGGAGCTAGCGGCGCGCCTCAAC 
TGCGCCGAGTACAAGAACTGGGTGAAGGCGGGCCACTGCCTGCTACTGCTTGCGCAGCTG 
NCTGCAGGGTTTCGTCGNCCGCNAGGTGCTCTCCTTTCCACCCGCGGCCTACTCGCCGCA 
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GCCCCCGGC 
Sequence 1737 

ATCCTTTTGCCTAAAGATGTAAACAAAACTCAAGACAGAAGGAATCAGGGAATATGTGCT 

NTTGTGNGC ATCTT GTTTACATTTNGGGATCAANTGATGGCAAAAGAAGTAATGAGACCA 

CTNN AAAT TGTTTTNCANTTGNNTTTAAMNACCAGGGTTCCTCATTTTCTTTG 

NMGTTTTMCAATTGACCTTCTTMGNGACATTCTCTTTCAAAAAAGANANGTAAANCA 

GGNGAAATGAAGGGTGGNTGGGGAAA 

Sequence 1738 

CCGGCTCATTCCCTGAGGCCGGCCCGCGTCCCGTCAGGGCGCCGCGCGGGGTTAGCGCGG 

GGTCAGCGGAGGTCAGCGGGGGTCAGCAGCAGCGGCTCCGAGGGCGCGGCGGACGCAGGA 

TGTACACGCTGCTGTCGGGCTTGTACAAGTACATGTTTCAGAAGGACGAGTACTGCATCC 

TGATCCTGGGCCTGGACAATGCTGGGAAGACGACCTTCCTGGAGCAGTCGAAAACCCGAT 
TT 

Sequence 1739 

GTCCG NCCAC GCGTCCGATTTCTTGTGTGCTTTGAAAAAAGTTTCAGCTTGCTGTCTCTT 
TTAGTGTTTTAAAGAAGTGTTATACAAAGCATTGTTTGCAAAATATAGGGAGGATAATGG 
GAGTCC CACTT TMN TTNGG GAANTCNTTGGGNGANCTTNTTNATCCAAGGTTTANTCAA 
GCCTTCNTTTTCCMCTTTTAAAAAATTTTTTGTTM 

AAAAAl l l l I AAAMTATTTTANTGTTCTTGCNACCAAATTTGTTCTTNAAAAAATAATA 
AA AACTTG NTG G CAAATTCNTTTGGTCN NTTTAAAAAAAAAAAAAA 
Sequence 1740 

ACGCGTCCGCAGCCATCTTGGGATCTGGGCAAGTGAGCGAGCTCCTTCCTCACCGGGCTG 

ACTAGCCTCTCCTTTCCCTGTCCCCCTCCATCGCTGCTCTGCAGGAAGCCAGCCCCCAGG 

GCCAGTCCCGGAGGGGCTGATCCGCATCTACAGCATGAGGTTCTGCCCCTATTCTCACAG 

GACCCGCCTCGTCCTGAAGGCCAAAGACATCAGACATGAAGTGGTCAACATTAACCTGAG 

AAACAAGCCTGAATGGTACTATACAAAGCACCCTTTTGGCCACATTCCTGTCCTGGAGAC 

CAGCCAATGTCAACTGATCTATGAATCTGTTATTGCTTGTGAGTACCTGGATGATGCTTA 

TCCAGGAAGGAAGCTGTTTCCATATGACCCTTATGAACGAGCTCGCCAAAAGATGTTATT 

GGAGCT 

Sequence 1741 . 

CANCGCCCCGGCCCGNTCAAGCAGCNTAATAAAGCTCATANAGGCGGACNGGCATCNGGG 

TCTNGGGATCTGCACAGCGGGACTGGCATAGGGCCGTCTGGCACTGAAAACCCTAANCAA 

GAAGGTGTGAAAAGAAACTTNAGCNGANNCGACCNGAGGCATCNNCGCCAGCCAGCTCCG 

AAAGCAGAAGANGGAGGCGGTTCTGGCATGAGAAAGANACAGCTGGGCTGGCNAGGATAG 

GCCCCTCCTTCATCAAGGTACTGGTGGTGCCCCTGCACAGNANAATTNTNCCTGCCAGAG 

GCCATGCANCCTGCTTCAAGATAGGGACACTGGAACAGATACACTTGAATGAATTGGGGA 

AACACCCCAGAACTTTT 

Sequence 1742 

CACGCGTCCGTTTTTTNCMGGGTCTATTGTTTCGATTAGTTTCCTTGCAGGAGGTAGAA 
GGTTTCCTCCATCCCGAGTAATTCATAGAGTGTTTGGCTTGNCTACCTTCCTCCTGACTG 
AAGTCACCTGATACTTTTTGTTTTTTCAAGAAGGMGAGAGAACCCTGTTGCCTCAGTTA 
CTAGCAATGATACMTTCTCAAMTCTGGTCTTTTTTTGmCTTT 

ATGTTGTGTGACACAGCAGCCCCTGTCTTATCATAGTTGTCTTCCCTCCACCACCTGTAC 

CAGAGATGTTGGATATGTTGGAGGTGMGGTGTGCMGGTTTTTAACTAACTGTTCTAAT 

T AMGGATT CTGCAGGAMGAACATGGGmACAAMGAGMGCTTTTGTATTATTAGTA 

A I u I I I I I CTTTGATGMTTTATGTGCTTAGTTTGGAGAATCGAGAGTTGGCTGGGAAA 

AGATTTCTGAGGAGTTAAGGGACTCTGGTGCTGTTTGGGAA 

Sequence 1743 

GTGATTAGAAGTAAGCGNTGATGAGGCTGAAGAAAAGGAAGACAAAACTGAGTACTTGGA 

GGAACGAAGAGTMTGGGATATCCAATAAACTGAGMTGTTTCTTCACAATCTCCTTTAT 

TCTTCGTTCCTCCAAGTACTCAGTTTGGTCTTCTTTCAGGTGTAGGAACGTGATAAAGAA 
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GTAAGCGATGATGAGGCTGAAGAAAAGGAAGACAAAGAAGAAGAAAAAAAAAAAAAAAGG 
Sequence 1744 

CGATTTCTTTGTTGGACAACCCAGCTGGGGCTAGGMTGGTTCAGAAGGTTTAAGGCCGG 

MNGGGNAATGAAGGGGCCCGGCGCTAACCCTCTAGGGACCTGTTTTGCTTCTGTTTAAA 

CCAAATGGGCAGTCTGTCATTACACACACCCTGNGTCTTCATATGTGGCTCGCCAGTATA 

ATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCA 

ATGGCATTCCGGGTACACCTGGGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGG 

AATGTCTGAGGGAAAGCTTTNAGGAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGA 

GTTC ATTGAATTATGGCATANATCTTGGGAAAANTGCGGAGTGTACATTTACAAANATGC 

TTTTCAAATAGTTGCTNTAANANTTTTGTTCAG 

Sequence 1745 

GGACGCGTGGGTGGAAATGTAMCAAGAATAGACTGTTCATTCCTGATGGCTTTTAGTCT 

ATACTAACATATTGTTTGTCATGGCATCCGAGACTGAAAAGACCCATGCTTTACTGCAGA 

CTTGTAGCACTGMTCTCTTATTTCCAGCCTTGGGTCTGGGGGCATTTTGCCTCGTAGCT 

GACAGACTTCTTCAGTTTTCCACAATTCAGCAAAATGACTGGCTTCGTGCTCTCTCAGAT 

AATGCAGTACATTGTGTAATTG GCAT GTGGTCATGGGCGGTAGTCACTGGAATCAAGAAG 

MG ACTGAC TTTGGAGAAATCATTTTAGCTGGATTTTTAGCCTCTGTTATTGATGTAGAC 

CACTTTTTTCTAGCTGGATCCATGTCTTTAMGGCTGCTTTGACTCTCCCGCGAAGACCT 

T TCCT TCACTGTTCTACTGTGATTCCCGTTGTGGTTCTGACCCTGAAATTTACTATGCAC 

CTTTTCAAGCTCAAAGACTCATGGTGCTTTCTTCTGGGATGGTATTTATATCC 

Sequence 1746 

GTGAACAGGI^ATTGACTATGGTAAACTTATTTTATTGMGTTTTCAACCGGAAAATAGT 

MG TGGAA TATGATACAACTGTTATTTTCAGMCATATTTCTTTAGGGCTATTTAAAATA 

ACCI I I I I AAAGGGCAAAAACTTTCAATTTGAGAGAACAAATTCCTCTCCTCTGTGGGAA 

ATATTGGCTGAGATTTTGTATAGAATAAGAGACATGTATGTAACATATATTTATATTCAG 

CATAAGTCTACTGCMTCATGTACACATCTTAGCAAGACGAGAGGATTTTGTTTAGTCTT 

TGTTTATGACTTCTACAGTTTCCTGTATCTAGTGTTAAGTTGTAAGGAAAAACTAAACAT 

GCMTTTA MGG TAMCTTGATMCTATTTATGGAACATMGCATACACCAATGGTTATT 

TCTCACAGTTTTCATGCGCATTTGTTTATTGTTTACTTGGATTAGGCTCATTAAAAACCA 

TAATGCTGGTCACAATTAGAATGCTAATATTTGGGGAAGCTATGCAGAAAATATTT 

Sequence 1747 

CNTGTGTGCCATGTATACCTAACGGGAGTCCCAGAAGACAGGAGAGAAAAAAAGAAAGAA 

ATAAAAAGAATATTTGMTTTAAAATTGCTTGAAAATGTCTCAAATTTGATGAAAAATAT 

TACTCTGCACATTCAACCCATGAACTATAAGTTGTATAAAATCAAAAAGTTTCACACCAA 

GGCGTGTCATAGCCAAACTGTCAAAAGCCAAAGACACAGAATCTTGAAAGCAGTGAGAGC 

AAAGCAGACAAGGGATCCCCAATAGGATTAACAGCAGATTTCTCATCAGAAGCCATGCAA 

GCCAGAAGGCTATGGGAGACATACTCAAAATGCTGAAATAAAAGACTGTCAACAAACATT 

TCCACATCCAGCAAAAATCAAAAACGAAGGAGAAATCTGTTGCATGTGAGCTGAATAGAA 

TTTGTATTCTGCTGTTGTTGGATTGAAGTATTCTTTMATGTCMTTANATCMTTTG 

Sequence 1748 

CGATGGCMTACATGTACTCAGATAGTTACATCCCTATATAAAAAGTATGTTTACATTTA 

A AAAA T TAGT AGATMCTTCCTTTCTTTCMGTGCACMTTTCATTTTGACTTGAGTCAA 

CTTTTGTTTTGGAACAMTTAAGTAAGGGAGCTGCCCAATCCTGTCTGATATTTCTTGAG 

GCTGCCCTCTATCATTTTATCTTTCCCATGGGCAGAGATGTTGTAAGTGGGATTCTTAAT 

ATCACCATTCTTGGGACTGGTATACATAAGGCAGCCGTGAAACTGGAAAGTCATTTTGAT 

GACTG ATGTG ATACATCCAGAGGTAAMTGCATTTAAACATATTAAAGTATTTGCCAAAG 

ATACAATTTTCTTGCTGACATAAAAATCACACAAACAAGTCCCCCCCAAACCACAACTGT 

CTCTCAAATAGCTTAAAAAAATTGAMAACATTTTAGGATTTTC^ 

AAAAAGATGTTCAAGCTATTAGAGGGAATGGTAAA 

Sequence 1749 
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GCCCNTCCCGACCCTCCAGCTCATGGTGTCTGGGGCCTGCGGCTAGACTCTTGGAACATT 

CTGGAACTCTCTCCTTTCCTGGCTGGGGCTCTGACCACAAACTCCCCTCCAGGCTGCCCC 

TGGGACATGGTGGTGATGTGGGTGCAGGAGCCAGTGTCTGTTGTCGGGACTCGCAAGTGC 

CCTCATCACAGCCACCCCCACCACGAGTGTCTCCCCAGTGCAGACTCAAGTTATGCTTGA 

AATGAAAAAGTCTATCTGGTAGTGGGTAAACGTAGACCTGGCACTGTTCCACGCGGGCGC 

CCCAAGCCTGCCACTCCTGTGTCCCTGCCTCCCTGGGCTCCCGAGATAGGCACCACTGTA 

TCCTCCAGCTCCTTCCTTCCTTCCCCCAGGAACACGGAGGCCACCGAGGGGGCTGGGCCT 

ATCAGGAGGACACAGGCTGCAGCCTGGCACCCACCCCTCCATCTTCACCCCACATGGAAG 

ACTTGTCTTCAAC 

Sequence 1750 

GCGTCCGGATTCTAAAMAAAMMAAAAAAAAAAAMMTTAATTGGGGTGCCTTTTTG 

TTATAGTTTCTATTTTCTGTTTTGTAGGACMGCTGCATTTTCTGTAAATATAGGTCTGG 

ACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTAT 

TTATCTCTAAGTATATTTGAAGTGATTGCTGNTTATATGTTGTCATTTTAAMTTGTGTG 

TCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAAATAAAGCTCTCAGGGAGACATGA 

ATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCT 

TMTATCC TTAMTTATGTGATATATAAAGAGGACTGTTACTTTrTTACTTTC I I I I H I 

I I I I I 1 1 I GGCTTTGCTTTATTTATTTGGAGT 

Sequence 1751 

GGGCATGCTCATAGGCACAGCTGTTGGTCAGTATGCCAATAACATCACACTTTGGATCTT 

TGCAGTCACTGCAGGCATGTTCCTCTATGTAGCCTTGGTGGATATGCTTCCAGAAATGTT 

GCATGGTGATGGTGACAATGAAGAACATGGCTTTTGTCCTGTGGGGCAATTCATCCTTCA 

GMTTTAGGATTGCTCTTTGGATTTGCCATTATGCTGGTGATTGCCCTCTATGAAGATAA 

MTTGTGTTTGACATCCAGTTTTGACCTTTCCCAGTAATCACTGTTGATTACGAGAATGT 

TACCATGCAGCTTTGCATCTGTTCCTTGTACTGTATGCACATTGCTCAAAGGAAAGTCAG 

TGGCTT GCAC TACTTACMGTTTCATAGATTTGAGCCTAACCACAAGAGGCTGGTGCTTA 

GTACTGTTTTCCCTGCACGTAGGGGTCTTTTAAAAATATAAAGCTTGTGATAAAGAGAGG 

A 

Sequence 1752 

CTGGTTCAGCAGCCGCCCACCCACCTCTGAGTCTGACCTGGMCCTGCCACAGATGGGCC 

AGCCTCCGAGACCACTACCCTCAGCCCAGAGGCCACCACCTTTAATGACACCAGAATCCC 

TGATGCAGCTGGTGGCACGGCCGGCGTGGGTACCATGCTTCTGTCCTTTGGGATCATCAC 

GGTGATAGGCCTGGCTGTGGCCTTGGTTTTGTACATCAGGAAGAAGAAGAGGCTGGAGAA 

GCTACGCCACCAGCTCATGCCCATGTACAACTTCGACCCCACGGAGGAACAAGATGAGTT 

GGAGCAGGAGCTGCTGGAGCATGGGCGGGACGCCGCCTCTGTACAGGCTGCTACTTCTGT 

GCAGGCCATGCAGGGCAAGACTACTCTGCCCTCCCAGGGCCCACTCCAGAGACCCAGCCG 

GCTTGGTGTTTACCCGATGTGGCCAATGCCATCCATGTGTGAGTGGCCTGGGACAAGC 

Sequence 1753 

GTCGCCCCGCGTCCGGTGCTCTCATGTCATCTCAGAGTTCCAGCTTATCAGAGGCATGTA 

GCAGGGAGGCTTATTCCAGCCATAACTGGGCTCTACCTCCAGCCTCCAGAAGTAATCCCC 

AACCTGCATATCCTTGGGCAACCCGAAGAATGAAAGAAGAAGCTATAAAACCCCCTTTGA 

MGCTTTCATGAAGCAGAGGAGGATGGGTCTGMCGACTTTATTCAGAAGATTGCCAATA 

ACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATCTCCCAACCTCAGG 

AGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCTCAGCAAATCAACC 

TTGGCCCGTC GTCC AATCCTCATGCTAAACCATCTGACTTTCACTTCTTGAAAGTGATCG 

GAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCA 

Sequence 1754 

TCGCCCCGCGTCCGGACTGATCATAAACCATGCTGGTATTGCACCTTCTGGAACTATGGG 
CTTGAGAAAACCCCCAGGATCACTTCTCCTTGGCTTCCTTATTTTCTTGAGGCAGGTCGC 
ACGTTCTACCTGCCCAAGACGTGTGATATCAGCTTCTCAGATCCAGACGACCTCCTCAAC 
TTCAAGCTGGTCATCTTGTCCTGATGAGGGCTTCTACAAGAGTGGGAAGTTTGTGTTCAG 
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TTTTAAGGTGGGCCAGGGTTACCCGCATGATCCCCCCAAGGTGAAGTGTGAGACAATGGT 

CTATCACCCCAACATTGACCTNGNGGGCAACCGTCTNGCCTCAACATCCTTCAGGAGAGG 

ACTGGA MGCCAGTCCTTACNATANACTCCATAAATTTTATTGGGCCTTGCNGGTATTCT 

I I I I I I NTTTGNGAGCNCCAAACCCCNNGANGGACCCCCACTTGGAACCNAAGGGAGGGC 

NCGCAAGAAGGGTTCCTTGGCAAGAAACAAACCCGG 

Sequence 1755 

TCCGGCCCGCCCACCCGCCGCCAGCTACCATGGATGATGATATCGCCGCGCTCGTNTGTC 

GACAACGGCTCCGGCATGTGCANGGCCGGCTACGCGGGCGANGATGCCCCNCGGNCCGTT 

TTCCCTCATCGTGGGGCGCCCCANGCNCCAGNTGCGTGATGGTGGGCATGGGTCAGAAGG 

ATTTCTATGTGGGCGACGAGGNCCAGAGCAATGAGAGGCATNCTNACCCTGNATTACCCC 

ATCAGAGCACGGCATNTNTCACCAACTGGGACCGACATTGGAGAAAATCTGGCACCACAC 

CTTTCTACAATGAGCTGCGTGTGGCTCCCGAGGAGCACCCCGTGCTGCTTGACCGAGGCC 

CCC CTGA ACCCCCAANGNCAACCCGNCGAGAANGATNACCCCAGATNCATGTTTTGAGGA 

CCCTTTTCAACACCCCCAAGCCCATGGTTACCGTTTGCCT 

Sequence 1756 

GCGNNGGGGCCGGGGCTGGGGTCGGGGCCAGGGGTCGGCCGGNGGCGGNGGACGGGCGTN 

CGGTGCCTGGGCTCTCTCCGCTCCCTGAGGCGCCGCTGGGGAGTCGAACCCTGATGATGG 

GACCTGTGAATTCAGCGGGGTGCCMGTCGTTTTCTGTGTGGGTTGAGAGACAGGCTGNG 

CAGCCCACTGTTGCATAGGACTAACTACTACAAATCATGCTGAGACCGAGCTATTTTTGC 

TGCTTAGAGGCTTTGCAGCCTTGAGAGGAATTTTGTTGTCAGGAGAATAAAAGGAGGTTG 

TCCATNATTGACTTTAAGCAGCAATCAAGTAAAACATTGAGCTCTTNAGCTCCGCCTTTC 

TTGCTCTGAAMTTGGAAAACCAANAAGGTTTTGNTGTTATTGTGTGACCCCACCTGAAT 

TATAMCCAAGATGAACATAACCAAAGGTGGTCTNGGTGTTGTTTTTCAGCAAAAC ' 

Sequence 1757 

GGATCTACCANGTGTCTNGAGCAGCAGGAGAGAGCCTTCACGCTGGGGCTAGCAGGACTT 

CTCGGCGAGGGAGNNTTTAACTTTGGAGAACTNCTCATGCACCCTGTGCTGGAGTTCCTG 

AGGAATACTGACCGGNAGTGGCTGNTTGACACCCTCTATGCCTTCAACAGTGGCAACGTT 

GAGGCTGGTTCCAGACTCTGAAGACTGCCTGGGGCCAGCAGCCTGATTTTAGCANCTAAT 

GAAA GCCCAGCCTTCTTGAGGAAACATTTCAGTTTGTTTGTTGCCTCATTGGNAGATGAC 

TTTTCACACCGACCCTGGCCNATTCACAAGACANACTTNACTTTTTTGAAGGAAAAATTG 

CCAMAMGTTGGCCTAAAMAATCNACAGGTTGAMTGGMGGGTGNGMGNCTTTTCT 

GGGGTGGATTGGAAANGGGCACCCTTTTTCGGNTNGGGGGGCNTGGGT 

Sequence 1758 

GAGCCGGCTGGCTCGAGTGGCCTTCGTCGTCCCTTGGCGCCCTGGGAGAGTCGCTGACGG 

GTGGACTGACGGACCGCCTGAGGACGGCCGGCCAGGGCGGTGAAAGCGCCAGCCCTATGG 

CGCGGGTCGCAGTGAGGCGGAAGGCCGAGGACGGCCGGCGGCGGCGCCCGCCCCGGCGAT 

GCGGGCCCCGCCCGTCGCCTNAGGTGCCATTTGGATTGTACTTTAGTGGCACCGATGTAC 

TCTGAGTGGAGGTCACTGCATTTGGTGATTCAGAATGATCAAGGCCATACCAGTGTGCTG 

CACAGCTATCCAAAGAGCGTTNGACGAGAGGTGGCAAATGCTGTAAGTCCGGCCTCTTGG 

GCAGGTGTTAAGTACCCCTTCAGTGGCTGGTAGTGAGAATTTGTTAAAAACTGACAAAAG 

AAGTA 

Sequence 1759 

GTCGACCACGCGTCCGGTCTCGGCGCCCGCTGCCCTCTCACCGCCCCACGCAGGATCCCG 

GCCTGGTCACCGGGCAGTGTGATGCTTCCCGACTGCCGCGGGGACAGCGAGGCACACACA 

GGGCTTGGGCCGCGCCGGAGGCCACACGGCCTGGCTGAGTTGCTCCTGGTCTCCCGCCTC 

TCCCAGGCGACCCGGAGGTAGCATTTCCCAGGAGGCACGGGCCCCCCCAGGGGGATGGGC 

ACAGCCACGCCAGATGGACGAGAAGACCAAGAAAGCAGAGGAAATGGCCCTGAGCCTCAC 

CCGAGCAGTGGCGGGCGGGGATGAACAGGTGGCAATGAAGTGTGCCATCTGGCTGGCAGA 

GCAACGGGTGCCCCTGAGTGTGCAACTGAAGCCTTGAGGTCTCCCCAACGCAGGACATCA 

GATTCCTCATGGTGCAAAATGGCCATTCCAAGCTTCCATCCAGCCATCACATCACAGGAG 

GAAGGGAAGA 
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Sequence 1760 



CCGATGTCTACAAGCTCGTCAAAGACTAGGGTGCCCTCTGCGCCTCCTTAAGGATGCAGG 

GTGAGCATCTCCTCTCCACACCTGCCTGGCACCCCTGGGGGGGTCCAGGATTGAGGATTC 

ATCTACCTGCCAGGCCTCAGGCCCAGGACCCAGGAGGCCTCCCCACCTACCCCAGCACAC 

ACACTCCCTGCCACTGTTCTGCGCTTTAATTGTGGGAGAAGAGAGGAGAGGAGGGCTCAG 

CGGTGGGGCAGCCTGTCCGGGGCGCTGACCCACCATCACCCTGCTCTGCCCAGCCTCGCG 

TGACCTCAGAGAGGTGGGGATAGGGGACACCTTCAGCCTCCAGCATGTGTGGCCACTTGT 

ACCCCCACCCACCCTTGGGGGAGCATGATGGGCAGGTTGAGGGGCAGGATGGAGACCAAG 

GGGAANTCAGTGAGCAGNAGGCCCTGGGGAAGTGTCCGGTCGGGGTTGGACTTGAGGGAC 

AGAGGGGCCCAACACTTTCTTGCCCCTTTTG 

Sequence 1761 

CCGCCAGGCCCACCATCACCACCGCCTCTGGCCGNCACCCCCATCTTNCACCTGTGCCCT 

CACCACCACACTACACAGCACACCAGCCGCTGNAGGGCTCCCATGGGCTGAGTGGGGAAG 

CAGCTTTNCCCTGGCCTCAGTTCCCAAGCTAACTGCCACGTCCACCCACGCATACACACA 

TGCCCTCCTGGACAAGGCTAACATNCCACTTAGCCAGCACCCTGCACCTGCTGACGTCCC 

CACNTCCCTTGGTTGGTGGGGACATTGCTTCTCTGGGGCTTTTTGGAATTGGGGGGCGCC 

CTTCTCTGCTCCTTTCACTGGTTAGNCTCTGGCTACCNATAAGCANGAGGCCNTGGGAAG 

GGGTTTCTCCCCTGGGCCCTTAAAAAAAGGNGGGCCCCAAAGC 

Sequence 1762 

AGTCGACCCCGCGTCCGGCAAGCGGGACCGGTAGGGGCCGGAGCATGCGGCGGCGGCGCT 

CGGACTGTCCCATCCGCCCCGTATTGAGGCGCTGGGAGCGGCGGGGCGACAGGAAAGCGA 

TGGTGAAAGCGGGGCCGTGAGGGGGGCAGGNGCCGGGAGCCGGACCCGCAGTAGCGGCAG 

CAGCGGCGCCGCCTCCCAGAGTTCAGACCCAGGAAGCGGCCGGGAGGGCAGGAGCGAATC 

GGGCCGCCGCCGCCATGGAGCTGAGAGTCGGGAACAGGTACCGGCTGGGCCGGAAGATCG 

GCAGCGGCTCCTTCGGAGACATCTATCTCGGTACCGGACATTGCTGCAGGAGAAGAGGTT 

GCCA 

Sequence 1763 

CGCGTCCGGGGTGATTTGAGCTCATCTTGTTGAGCAAGGGGAGTGAAACCACAGAAACTG 

TTAM TTGAACGTAMTAACTTTGGAAMCAGTTTTGACACTAGAACAAGGTCGCTTCTC 

TTTTCCTCTTCTCTCCCAACCTGTTAACCTAGTGCTAGATAGTCAGCTGCTGTTGCAGGT 

AGAMAGCTCACCAATGCCACATTTGTTCATTCTGTTCMCCTGTTTTGACCTCCCAGGC 

ATCTGMCAGGATGACATCATCTTATTATATCTTACTTTACACATACGTGTATGTACACA 

CACACACACMTACMGACATTTTTCCTGTTTTAGMAATATAGCCCTTGTTTGGATTAC 

TGCTGTCTGAGCACTAGAAATTTCTAATGGAAAGAGGCCTCTGAATGGCTAAGGGAACAT 

CTGGAAAGGAAGGAAAATGAGCCTNAAGGTTTTTGGTGCTTGGI I I 1 1 GGTTCTTTTTCT 

TTCNTGGTTCC I I I I I IGGTTT 

Sequence 1764 

AGTCGACCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGGGCCCTCCGGGAAGATGGC 

GGCCGTGCAGGCGGCCGAGGTGAAAGTGGATGGCAGCGAGCCGAAACTGAGCAAGAAGTG 

GTGGTAATCATTAGTTCCAGGGTGCTCTGCCATGTTGACGCAAGCTGCTGTAAGGCTTGT 

TAGGGGGTCCCTGCGCAAAACCTCCTGGGCAGAGTGGGGTCACAGGGAACTGCGACTGGG 

TCMCTTGCTCCTTTCACAGCGCCTCACMGGACMGTCATTTTCTGATCAAAGAAGTGA 

GCTGAAGAGACGCCTGAAAGCTGAGAAGAAAGTAGCAGAGAAGGAGGCCAAACAGAAAGA 

GCTCAGTGAGAAACAGCTAAGCCAAGCCACTGCTGCTGCCACCAACCACACCACTGATM 

TGGTGTGGGTCCTGAGGAAGAGAGCCGTGGACCCAAATCAATACTACAAAATCCCGCAGT 

CAAGCAATTCATCAGCTGAAGGTCAATGGGGAA 

Sequence 1765 

TCACCCCGCGTCCGACTTGAATCCCGTCAGCTTAAAACTTGTGTAAGGGAATCCTGACTT 
TTAAMTGTGAGGGTAmGGATCTGTGTTGAMGTCGTATATTTTTATCTGTGCGGTGC 
TGAGTGCAGGCCACCAGCTCCTAAATAGAGGTTCCCTATATGCGCGTATGACATGGTGAA 
TAAACACAACTCTCTCCACTCAGGACATCCGGAGCGTTATGGACCGTGGTAGGTGGTCGT 
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TCTGTGTGCTTGTGAAAGTGTCCAGGCGTGTGCACAGCCAGTGCGCCCACTTCCGGGCTC 
CTTGCTCCCTGCTGTACTGAAGTTTTGGATTTTGCATCCAATCCTGTGTGCCTGCCCTTC 



Sequence 1766 

GTCGACCACGCGTCCGGCTCCCGAGCCTGGAGAGCTCGGACTGCGAGTCCCTGGACAGCA 

GCAACAGTGGCTTCGGGCCGGAGGAAGACACGGCTTACCTGGATGGGGTGTCGTTGCCCG 

ACTTCGAGCTGCTCAGTGACCCTGAGGATGAACACTTGTGTGCCAACCTGATGCAGCTGC 

TGCAGGAGAGCCTGGCCCAGGCGCGGCTGGGCTCTCGACGCCCTGCGCGCCTGCTGATGC 

CTAGCCAGTTGGTAAGCCAGGTGGGCAAAGAACTACTGCGCCTGGCCTACAGCGAGCCGT 

GCGGCCTGCGGGGGGCGCTGCTGGACGTCTGCGTGGAGCAGGGCAAGAGCTGCCACAGCG 

TGGGCCAGCTGGCACTCGACCCCAGCCTGGTTGCCCACCTTCCAGCTGACCCTCGTGCTG 

CGCCTGGACTCACGACTCTGGCCCAAGAT 

Sequence 1767 

CCGGMGAGTGCTTTATTGTGAMTTATTTAAAACTGTCCTTTAAAAGAAAAAGAGGAAA 

CGATGAACAAMACTAATCTMTTGCCMGTTAGAATTCATNATTTAATTTACCTCCTAT 

GCMTGATTAATGCTGCAAMTGTATGGTTATGTTACCGTATATTCACAAAAGAAATATT 

ATTACMGGTTTCAGAGGTAGCCAATTGCATTCTTTTGGAMTTTACTGTACTGTTTCAA 

TGT GTTAAG TGCCTTGTTGTAMGTAAAATTTTAAGTCTAGATTCATTATTTTCCTGACA 

TATA TTTTTCATTATGATATCTACTGTATGCTATTGTGATAGTTTTATGAAATGCTTACA 

TTTTMTCAAATATGTAAATTTCAGAAGCTCTTTTTTCCTACCCACCAGTACCTTAATCA 

TTTGGTTTATCACATTGGATTCAMTTCAGGTTCCTTTTTTGATAM 

T 

Sequence 1768 

CCGGTGGGATAGTAAAAGAGAAGACGCGGAGAAGAGGAGAGGACCTACAAGAACGGAGGA 

CAGGGGCGCACGATGGTCCCGGGGGGAGCGGAAACAAAGGNACGCAAAACGGAAAAGCGT 

GTGTAGGGGAGCGGAAAAGGAAGGTCACCACCTGTGGCCTGCGACCGAAATGGCGAAAAG 

TCTTT TGAAG ACAGCCTCTCTGTCTGGAAGGACAAAAATNTGCTACATCAAACAGGATTG 

GTCACTTTTA TAGTA CATCCCATGGTATGTTTATGNAGGGAAGGAAGTGNAAAANNAACA 

CCTTCAGCAGTTTTCCTGGNTGGNATTCCATTGNACCTNTNAGNAAGGGAAGNACAANTG 

GGGCATT TGGGNCATTTC1TTACNTCTTGMCCMTTCCCAAGTTAGGAATGGAAATGCC 

CI 1 1 I I I CAGGTTGTTAATTGNATGCTACAAACTTCTTGGNAMAAAGTTAATTTGGAAT 

TTGGGAAAAMTTTGGGAACAANAGGGGGGAAAAAGGCCTCCATTTGTNCCCGGTGGGGG 

GCCAAAAAAATACCTTTTCTTC 

Sequence 1769 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGATTTG 
TGTACAAATTGAAATGTCTGTACTGATCCTCAACCAATAAAATCTCAGTTATGAAAAAAA 
AAAAAAAAGG 
Sequence 1770 

GTCGCCCCTCAGCCGCTTCCCCTCGCCATGGAGGCGAGGCCGCCGCCGCCGCCGCGGGGC 

TCGGAGCCGCGGGCCGGGCGGCGGCCCTGAGGGCTAGTGGCGGCCCGAAACGCCGCCGCG 

GAGCCGAGGCGGAGCCGCTGTCCTCGTCCCCAGCGGTCCCGCCCAACGCCCGACTCTGTG 

ACACAACTAAAAAAAAACAAAGGTATTTATGGAATTCCACTGAGTGGTAATGGATGATGC 

AGTTCAAATAACTAAGATAAGCATGGATTCCTGGTTCATTCTTGTTCTGCTCGGCAGTGG 

TCTGATATGTGTCAGTGCCAACAATGCTACCACAGTTGCACCTTCTGTAGGAATTACAAG 

ATTAATTAACTCATCAACGGCAGAACCAGTTAAAGAAGAGGCCAAAACTTCAAATCCAAC 

TTCTTCACTAACTTCTCTTTCTGTGGCACCAACATTCAGC 

Sequence 1771 

GCGTCCGGGCCACATGGCCGCCGCCGCCGCTTGGAGCTGAAGTGCCGCCGCCGCCGGGCA 
GCCACGGGGAATCCGCCCGCATCGCCGCCCTCGCCGGCCGGGCGGCCGTGGGGCCCAGAG 
CGCCGGAGGCCAGGGCTGGGGCGGCACCGCGCAGCGGCCACGGGGTCCCGTTAGAGCAGC 
GCCCGGCGGCTATGCCGAGAGCCCGGAGCGGCCGGAGGAGCAGAGGGGCCGNCGGGAGGG 



T 
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AGGAAACCTTTCTGCGAGTACGAGCCAACCGGCAGACCCGACTGAATGCTCGGATTGGGA 
AAATGAAACGGAGGAAGCAAGATGAAGGGCANAGGGAAGGCTCCTGCATGGCTGAGGATG 
ATGCTGTGGACATCGAGCATGAGAACAACAACCGCTTTGAGGAGTATGAGTGGTGTGGAC 
AGAAGCGGATACGGGCCACCACTCTCCTGGAAGGTGGCTTCCGAGGCTCTGGCT 
Sequence 1772 

TNGATGTGTATAT GGACT GTTNTGAAGGGTTTTTTTCTTTATAGCCCANTTMGTTNTGT 

TTGGCTCGGTGCATTTTTCATTTATTTAATTAGTAATTTMGTAACNGTGTT^ 

ATCATTGTG AAGTT TCAAGATTCATTATGGGANGAGTTGATGGTNCANTNANGCATGATG 

GTTTMCAA ATTTT AACACCAAAAATGTTAAATCCTGCATAAATTCAACTGTANATAATA 

AATANGGTGTTTTCNT GTATAT GATAGNAATGCAATTAGAAGTACCTNTAGTAAANTCTT 

TTGGMTCACCAATNCTTTTTTGGCTTGAAMTTGGGAMGAATTTCTGTTTATAATNCC 

TTTNNAATTAAACCTTGNGNGGGGGGAGGGGGAAAATAAAAAATTGCAGGAAAAACCTGC 

ATGAGNCACCTTMNMCCTTTAAAAGTMGGGGGCTTNCCAATCTTTTTANTCCCNGGA 

AAACCCTGGTTGCCTN I I I I IGGCA 

Sequence 1773 

CGTCCGTTTAAAGGCTCTGACTCTTGATCTTGAAAGCCGGACGCGGCACTGGCACTCGGC 

TTCAGTTTCCACTGTGACAGATGGAGGTCTCCTTTCGCCCCAGCCCAGGTGGCCAAGCCC 

ATCCTGGCCTCAGAACATGCTGAGCACATTTTGTAGGGTGGCACCTTTTTATCCAAGTTA 

CTAGCTACACATCAGTGTTTAAAGAGAAAAAAGTGACCTTTC AI > I I 1 1 I I I I ICTTGAA 

ACTTGAGGAAACAAGATACATACTACTGA I rTTTTTTTTI I CTTAAAACTAAATGCATG A 

CTGCAGAGCGGTAGAGGTGTATATTTTTCATACTGTGGGGCAAAGTATTTGTGCTGCTTT 

TTGGAGATGGACTGGAACGTCTGGTTTCTGTCCCCGGGCCCGGCAGCTACCGTCTATTTT 

CTGTAGAAGGTGCCACAGTGAGACCTGGAG 

Sequence 1774 

CCCCGCGTCCGCTTCCGGTTGCTAACGGTTCCCAAACAGCCCCCGAAAACGCTACGTGAG 
CTGGGCCCTGGGCCAGAGGCAGAAAACGGACGGAAGAAAAGGTCTGGCCGGAGATGGGTC 
TCACTCTGTCACCCAGACTGGAGTGCAGTGAGTGGTGCGATCATAGCTTACTGCAGCCTG 
AAACTCCTGGGCTCAAGTGATCTTCTCGCCTCAGCCTCCTGAGTAGCTGGAGCTACAGGT 

GTGAGCTACCCAGCATGGCTCATTTGAGATTTCTGAGTAGAGAAGTAACATGATTAAAC 
Sequence 1775 

GGAACCTCCCCTAGATTTCAGAAGATGTATGGAAACGCCTAGATGCCCTGGCAGAAGTTT 

GCTGCAGGGGCGGGGCTCTCATGAAGAACCTCTGCTAAGGCAGTGTGGAAGGGAAATGTG 

GGGTTGGAGCCCCCAAACAGAGTCCTTAGTGGGGTGCTGCCTAGTGGAGCTGTGAGAAGA 

GGGCCATCATCCTCCAGACCCCAGAATGGTAGATCCACTGACAGCTTGAACTGTGCACCT 

GGAAGAGCCGCAGACACTCAATGCCAGCCCGTGAAAGCAGCCAGAAGGGAGGCTGTACCC 

TGCAAAACCACAGGGGCAGAGCTGCCCAAGACTGTGGGAACCCACCTCATGCTTCAGTGT 

AACCTGGATGTGAGACCTGGAGTCAAAGGAGATCATTCTGGAGCTTTAAAGTTTGACTGC 

CATGCAGGATTTCGGACTTGCATGGGCCCTGTMCCCCTTTGTTTTGGCCMTTTCTCCC 

GTTTG GAACGGCTG TAATTACCCAATACGTGTATCCCCATCGTATCTAGGAAGTAACTAG 

CTTGCTTTTGTTTTTACAGACTCATAGGTGGAANGGACTTGCCTTGTCTCAAATGAGACT 

TTTGGACTGTGGACTTTT 

Sequence 1776 

GCGTCCGGMCTTTATAAGAATTTATGCCGTTNTACATGMCCGTTAAGTTTTGTACTTG 

ACGTTTCTGTTTATTANGCTAMTTGTTCCTCAGGTGTGTGTNTATATATATACATATAT 

ATATATATATATATATGTATATATATACACATATATACGTATATATACATATATATGTAT 

ATGGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACGATCCCAGCTCACTGCAA 

CCTCCGCCTCCCGGGTTCGGGCGATTCTTCTGCCTCAGCCTCCCTGGTAGCTGGGGCTGC 

AGCCATGTGCCACCAAGCCCAGCTAATTTGTATTTTTGGTGGGAGACAGGGTTTCACCAT 

GTTGGTGAGGCTGGTCTGGAACTGACCTCAAGTGATCTGCCCACCTCGGCCTCCCAAGGT 

GTTGGGATTMCAGGTGTGAAGCCACCACGCCTGTCCCAGTATATTGTTTAACAAGTTTA 

TTTTTGGGTGAAAAATTCTCTTTAATGGGAAGA 
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Sequence 1777 

CCACGCGTCCGGCGCCCCCTCCCGGCCGCCATGTTGGCTGGTGTGTGGGTGTCAAACTGA 

GCCAGACGCGGCGGTGGCGGCGGCTCCGCGGGCTACGGTCGCTCCCGCCTCTCGAGCGCT 

GCCGGTGGCCGCAGCGGCGCACCCACGCCGGCCCGGAGGAGCAGAGTGCTAAGTGCTGGG 

TGCTCACTGGTGATGAGGCAGATGAAGGTTACCAAACTTGTGGACAGGAGCCTCATATCA 

GAGACGTGGACCTCACTGTAGCCTGGTCATGGCTTCCAGCTTTTCGAATCTGAGGCTCCA 

AAGGAGGAAATGACCATTCAGGGATCTTACTCCAGCTTGATTACGGAGACTGAACCTTCA 

TAGGGTGCGCACTTACCMGGACAGGAAGGTTTCTCTGTTTGMGGGCTTTAAACTTATA 

ACAAAGAAAATAA 

Sequence 1778 

CGCGTCCGAGACAGTTCATACTGGAGACAAACCCTACAAATGTAATGAATGTGGCAAAAC 

CTTTAAACGGAACTCAAGCCTCACTGCACATCATATAATCCATGCAGGAAAGAAACCATA 

TACATGTGATGTATGTGGCAAGGTCTTTTATCAGAATTCACAACTTGTAAGGCACCAGAT 

AATTCATACTGGAGAGACACCTTACAAATGTAATGMTGTGGCMGGTCTTCTTTCAACG 

TTCACGTCTTGCAGGGCACCGGAGAATTCATACTGGAGAGAAACCCTACAAATGTAATGA 

ATGTGGCAAGGTCTTCAGTCAACATTCACATCTTGCAGTGCATCAGAGAGTTCATACTGG 

AGAGAAACCTTACAAATGTAATGAATGTGGCAAAGCCTTTAATTG 

Sequence 1779 

NCCCCGCGTCCGCTGTTGGAGCAGTAGACTTCTCACATCTTTTTGTCACTTCAtCGTTTG 

ACTGGACAGTAAAGCTTTGGACMCTAAGMTMCMGCCTTTGTATTCATTTGAAGATA 

ATGCAGACTATGTTTATGATGTTATGTGGTCACCTACCCACCCAGCCCTGTTTGCCTGTG 

TGGATGGCATGGGGAGATTGGATTTGTGGAATCTCAATAATGACACAGAGGTACCAACTG 

CCAGCATTTCTGTGGAGGGTAATCCTGCTCTTAATCGTGTGAGATGGACCCATTCTGGCA 

GAGAGATTGCTGTGGGTGATTCTGAAGGACAGATTGTTATATACCGATGTGGGAGAGATT 

GCTGTTCCCCGCAATGATGAATGGGCACGGTTTGGCCGAACACTTGCAGAAATTAATGCA 

AACCGAGCTGATGCAGAGGAGGAAGCAGCTACCCGAATCCTGCTTAGTTCCTGAAAAGGG 

GAGTGTAACTAGTGGATTGGGAAAGGGTC 

Sequence 1780 

AGTAGGMCCMGAAAACTTCTTTTGCCMCTTTACAGGATATCTGGTAAACTATTACAT 

NGTCAGGCCMACATGCTCCTTGCATTTTTGTTGGCTGAATNTGGGTACAGAGTGGTTCT 

ATACGATGGTAATAACCAACTTGNAATCAAAGGAAGNATTCCAACAGAAACAGATAGGAN 

AANGTCTTGAGAAGATATATNAAGGAATNTGTCACTTGTCACACNATGCCCGATCACCGG 

GACACAMTTCCTGCAAGMGGGACCACACCGGACTCTATTTTCCTACAAGNGCAGGAAA 

CCTTGTTCATTCTAAGCATGGTTTCTGNTTGCCCAGGTATTCAAAAAA 

Sequence 1781 

ACCCACGCGTCCGGCTGCGTTGGGCTTGCGTGCGGCTCGCTAAGACTATGGCGTCCGGGC 
CTCATTCGACAGCTACTGCTGCCGCAGCCGNCTCATCGGCCGCNNCAAGCGCGGGCGGCT 
CCAGCTCCGGGACGACGACCACGACNACNACCACGACGGGAGGGATC 
Sequence 1782 

CCGCGTNCGTTTGTGTTGMTGGNTTGTATACTTCTTTACACAACCTATCCATTACTTAA 

GGAATCTGCTCTTATTCTTCTACAAACTGNTCNGGAACAAANTGATATCAGAAATTNGAT 

AAMGMCTTCGMATGNTTGAAGGAGTNGAGGMGNTCATTGAATTNCATGTTTGGCAA 

CTTNGCTGGAAGCAGAATCATTTGCCACTG 

Sequence 1783 

GTCGACCCCGCGTCCGGGCCCGTCTACAAGGNTTGTAGATAAAATAGAAACATACCTTCC 

TTGAAMTGCAGMTAAATTTTTTAAAGGCAGGAAGGAAGTGTTTGAACCATGTGTCAAC 

AAGCTTTACTGTCAMGCAGGCTTTTGGTATGGGAAGAAAAATACTTATAAATACTNGTN 

TTAATATTTGCTTTATTAAAATACATTTMMTACAGCATTTTTAAATCTCTAAGCTCAA 

CTTGMGATATAAGAACAGTAAATTTGATAAAAATGAGAMTTACATTCCCATTTCTTTA 

AC AATT TGTAAATTCCAATTATCCTGAACATTTAACACCATTTACATATTTTATTAATCA 

CATTTTCTTAAACATTTGATAAGAGATTTAATATTTTGATCCAACTACCA 
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Sequence 1784 

GCGTCCGCGGACGCGTGGGTGTGGAGCCTGTTACCTCCGCATTTGACACGAGTCTGCTGT 

GAACCCCGCAACCTCCTCCCCACCTCCCATCTCTCCTTCCAGGCCCATCCCTGGCCCAGA 

GCAGGAGGGAGGGAGGGACAGATGGCGGTGGGTTTTTGTATCTGAATTTGCTGTCTTGAA 

CATAAAGAATCTATCTGCTGTTGAAAAAAAAAAAAAAAAAAAGGnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnTCACAAATAMGCAnnnnnnnnnnnnnnnnnnnnnnnnnGTTTGTCCAAACTCATC 

AATGGTATCTTATCAATGTCTNGGATCCCCCGGGTACCCGANGCTCGAATTTAATTTCCT 

TTTTTC 

Sequence 1785 

ACGCGTCCGGCTTCGTTCTCCCGTCCCTCCCCGCGCCTTGGCGCGGGGGGTCGACTAGCC 

AAGTGAGGCGGGAGGCGACTCGGACCTTTCCCTGCATTTCGTTTCGGCCAGTGCCGGGGG 

CTACCCGCCCTGGGGCCTGGGATCCTTGGGGCCCGTGAGGCCCACTCTTAGCGGCCGGGG 

CCTACCGCGGCCCGCCGCTGGCCCTCATGAGGCATAGCCTGACCAAGCTGCTGGCAGCCT 

CGGGCAGCAACTCCCCAACCCGCAGTGAGAGCCCGGAGCCGGCTGCAACTTGTTCGCTGC 

CCTCTGACCTGACCCGGGCTGCAGCGGGGGAGGAGGAGACGGCGGCGGCNCGGATCTCCC 

GGCCGCAAGCAGCAGTTTGGCGACGAAGGAGAGTTGGAAGCCGGGAGGGGGGGAAGCCCG 

CGGCGGCGTGGCCGTGCGCGCGCCCTCCCCCGAGGAGATGGAGGAGGAGGCGATCGCCAG 

CCTTCCGGGGGAAGAGACGGAAGGATATGGAC I Mill I CTGGGCTGGA 

Sequence 1786 

ACGCGTCCGGCCAGGCAGTGATGGAATCCCGGGGTCNGCAGGAGAGAAGGGTGAACCAGG 

TCTACCAGGAAGAGGATTCCCAGGGTTTCCAGGGGCCAAAGGAGACAAAGGTTCAAAGGG 

TGAGGTGGGTTTCCCAGGATTAGCCGGGAGCCCAGGAATTCCTGGATCCAAAGGAGAGCA 

AGGATTCATGGGTCCTCCGGGGCCCCAGGGACAGCCGGGGTTACCGGGATCCCCAGGCCA 

TGCCACGGAGGGGCCCAAAGGAGACCGCGGACCTCAGGGCCAGCCTGGCCTGCCAGGACT 

TCCGGGACCCATGGGGCCTCCAGGGCTTCCTGGGATTGATGGAGTTAAAGGTGACAAAGG 

AAATCCAGGCTGGCCAGGAGCACCCGGTGTCCCAGGGCCCAAGGGAGACCCTGGATTCCA 

GGGCATGCCTGGTATTGGTGGCTCTCCAGGAATCACAGGCTCTAAGGGTGATATGGGGCC 

TCCAAGGAGTTCCAGGATTTCAAGGTCCAAAAGGTCTTCCTGGCCTCCAGGGAATTAAAG 

GTGATCAAGGCGATCAAAGG 

Sequence 1787 

AAGCTATTGTCACACCTTTGAAACCAGTTGACAACACTTACTACAAAGAGGCAGAAAAAG 

AAAATCTTGTGGAACAAGTCCATTCCGTCAAATGNTNGTTCTTCCCTGGAAGTTGAGGCA 

GNCATAT CAAG AAAAACTCCANCCCAGCCTCAGAGANAGATCTCTTAANGCTTTCTGCTC 

AGAAGGATTTTGGAACAGAANAGAAAAGCATCATGGTAANAAATGAAAGCCAAGAGATGT 

G CCCA CTCCTGTTAATTCATTCGATTGAAAATTCATACCCCTTCTAAGAAAAAATGAAAA 

GTTTTNTAAACAACAAAAAAGAAAGCCCNNNAGGCAAGGAAAGGGCANGTGCTTCATNAA 

GNATACCTGNTGNAAAANAGGAAATGGCATCTTCCNCCAAGAAGAAAAAGCCCAANGGGT 

TTGTACCATACCTGGTGCNCTTTGTATGCNCACCCTGCCAMGGCATAMGGTTTTCTAA 

AAAAAGTTACCTTGAGNGAAGCCAANGAAGCCTGGGAGAAAANAGTANTGNAAAAAATGC 

AAGCNAAAGAAGGNTGGGTGGGAAG 

Sequence 1788 

CCCCGCGTCCGAGCMACATAAGAAACCTGAGTCATTTTGTCATTTAGAGTATTCTGATA 

A AATC TCTTGAAMTACTGAAATCAAMGGTTAATGATTTTTTGTTCATTCTGATTTGTC 

ATTTTATTATCTGTCTAGCAGAAAAATCAAATGGGTAAATTAGCACTTTAGACAGCCAAC 

ATAGTGMACCCTCATCTCTACTAAAAGTTGGCAATTAAATCTGAATTTACAGATACAGA 

TAACAGTTTATCAGAAATCATATTTTTCCTGAAGAMATTTAAAATTAGGAGTTGTGGGC 

CTGGTGCGGTGGCTTACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGCAGATCA 

CTTGAGGTCAGGAGTTCGAGACCAGCATAGCCAACATAGTGAAACCCTCATCTCTACTAA 

AAGTACAAAAATTAGCCAGGTGTGGTGGCCGTGTGCCTGTAGTCCCAGCTACTCGGGAGG 



• 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

296/467 



CTGAGGCGGGAGAATCGTTTGGAACCCG 
Sequence 1789 

GACCACGCGTCCGGTCGGCGTCTGGGCCTCGTCCCCTTCTCTCTGTCTCCCTTGCCTCCC 

CCATCACGTCCCCTGACACCGACACCCCATTGCTCCCACAGTCTCCCCAGNCTCCACTTT 

GGTCCCCAGCGCTGTCTGCCCGAGGATTTGCCTGAAGGCTGCCCCCAACTCTGCACCCGC 

CCCCCGAGGGCCACCGAGGACCATGACTAAGACAGATCCTGCCCCGATGGCCCCGCCACC 

CCGAGGAGAGGAGGAAGAAGAGGAGGAGGAGGATGAACCCGTCCCCGAGGCCCCCAGCCC 

CACCCAGGAGCGCCGGCAGAAGCCTGTTGTGCACCCCTCGGCACCTGCCCCCCTCCCTAA 

GGACTACGCTTTCACCTTCTTCGATCCCAATGACCCGGCGTGCCAGGAGATCCTGTTTGA 

CCCTCAGACCACCATCCCCGAGCTGTTTGCCATTGTGCGCCAGTGGGTGCCCCAA 

Sequence 1790 

CGGGGTCTTCTTGCTGTGAGGTCGCGTTCCCCAGTGTTACGGAGGGTCCTTGAGGCAAGG 

AGTGAAAATTGGGTCTGGGGGTTAGTCCTGGGGTGGAGGTCTGGGCACGCCGGGTCGGAC 

CCCCTNCATCTTCGGNTTTGCACACCCCGCTTTCCAGCGCGGAGTCCGCGGCGGGTAGGG 

CNGGCGTCNCGTGCGTGACGTCATCCAGCGGCGCCTNGCGAGGCTNCAGTGGCCTTGACC 

TCCCGCGGNGTGGGAGGCTGCGCGGCGATGCTGCANTTTCGTCCGGGCCGGGGCNGCGGG 

CCTGGCTT 

Sequence 1791 

GGGTTATGAGAAGAACGCTCAGAGCAGAGCACQGAAAGTGGCCACTACCAGCATGAAGAG 

CCCAACAATTCAAACTGGNAGMGTGAGAAAAACAGAATGCAGCTTTCAAGGTTCGTTTC 

AAGCAGTTGGCTTGTGGGACTCTGAGAGATGCTGCTGNCCATGACATGCGGGAATTATCA 

TGATCMCTA CCCAG CTTGGATTTCACCCAGTGGCCAACTAGCTTTGTGTGGGAGACGGC 

AAGGGTTGGATTTTTCAAAAGAGTAAACCAGACCCGTGACCAAGGTGTNAACTAAGAAGT 

GGAGTCATGCTTCACACGGNCNTATCNTGCTGGCAGCCATTCCTGGGTCTGGCTGTGGTG 

TTAATCTTCATGGGATCC 

Sequence 1792 

GTGCTGGTTTNTCTTGCAGATGCTGCTGCTAGGGGTGGTGGGAAGCAGCCGTGGGACGCG 

TGGCCGGNAGCGGNGGTGACAGCCTGGGANNNCGGGGGCTTNTCTTCCTTGTCCTCCTCC 

TNTCCTGTCTATTCCCAGNGNGGGGCGTGGCTGACACTAAAGACTNTGTANNCATCAACC 

CGAGTGCANNTTCNATGGAAAATGAAGGTTGCACGTTTTCAAAAAATACCTAATGGTGAA 

AATGAGACAATGATTTGCTGTATTGNCANCCAAAAAAAGCAAAGGGNATTNCCANTCATT 

GAAGNTGCAAGCATTCTNCAAGCTGATCTTCNAAATGGGTCTAAACAAAATGTNAAGNTA 

AGTTCNTNAGGCNAGCCCTTTCATGGTTTNGAMTGAGNTTTCGNTTTTTATGTGAAAGN 

TGGAGGCCNCCNTGTNGGGAAAGAAANGNGTTTNTTTTCNCCAGGTTTTNAAAA 

Sequence 1793 

GTCCGTTTTACAACCTAGTAATAATGTGGATAAATGTATCTACATGACACATGTCAAGAC 

CAAAATAACTGTGAATGACACACCTTGCTGTAAATGAACTGTGCTAACCCTGACTGTGGG 

CTTGAGAACAAAGATGAACTCTAGAACTCTAGCAGCCTAACTGCTGCTTCTCAAATAACT 

GTGTGMCAGTGAG ATATTAC TGTTTGTTTCTAAAMTCCTACTGTGCCCAGTTTCCTTC 

ACTACATGCCCTGCATTTTTTATTTAAATATTTAGCTGTAGCGCCATCAGATATGGATGC 

CTTCTAACAATTGCTGTTTGTAAAATAAATCAGGATGGTAGAAAGTGATTATATGGAAAA 

TTGGMCCTGGATGAGACCTTTTCGTTGAATTCTGAAGAGTAATGATGTGAAAATTGATA 

CAGGGCAAGAGATGATTCTTTTGGGTTTTCTTCTACTTCATGTCCAGAAGAGTAAGAGGG 

GAAAA 

Sequence 1794 

TNGJTGCCTGCMTACTACACTTTACMACAATGTTAACACTGTGATTCCTTCATTGTTT 

TAAGAAGTTAACCTAGGGCCGGGCATGGTGGCTCATACCTGTAATCCTAGCACTCTGGGA 

GGCCGAGGC AGGAGGAT CCCTTTAGCCCAGGAGTTAAAGACCAGCCTGGGCAACATAGGG 

AGACCCTGTC I I I I I I I IGGGCAGCGTGGTGGGGGATAAATAAATAAAAGGAAAAAAAAC 

TTAGCCTAGAATTAGAATTAATTTAATTGMTTCATCTAAAGATGTCTCTGGTGATTTTT 

ATATGTTCCGCTATATAATTGATGCTTTATAGTTTTATCATAATCCAACAACTTCAG7TA 
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TATTTMTTATTGGTMNGGAGTTTAAGACTNGAAAGACTAGAGTGCTTTCTAGTCCAAA 
TAGAGGGTCANGTGAMCCAGCTTTTTGACATCMGATTTTCATTTTGAGAAGGGANAAG 
CCTGTGGGACTGGGCTTAAA 
Sequence 1795 

GTCCGGAGAAAACTGCAACTTGCCCAGNTCCTCTTCTCCACCAATAGATGCAGCATCCGC 

AGAGCCCTATGGCTTCCGTGCCTCAGTGTTATTCGATACAATGCAACACCATCTAGCCTT 

GAATAGAGATTTGTCCACACCTGGCCTGGAGAAGGACAGTGGAGGGAAGACACCTGGTGA 

CTTTAGCTATGCCTATCAAAAGCCTGAGGAAACAACCAGGTCCCCAGATGAAGAAGATTA 

TGACTTGAGTTATGAGAAGACCACCCGGACCTCAGATGTGGGTGGCTATTACTATAGAAA 

ATAGAGAGAACCACAAAATCTCCAAGTGACAGTGGCTACTCCTATGAGACCATTGGGGAA 

AACTACCAAGACCCCTGAAGGATGGTGACTATTCCTATGAA 

Sequence 1796 

CCGGCNTAGGCGGGGGGAACACGCCGCCTGCGCTCTCTTGGGACCCTAGATTTGGGGGAG 

GAGGTAACGAGAGGCGGAGAGGGTGGCTCCTCAAATATACACCCCTCCTGTCCTCCGCCA 

CCCCACCTTTGATTTCTCTTCCCTCAACCCAGCACTCCAGCCCCACCCCAGGGTCAATTT 

TTGCCCCCTTCCATCTGAGCAGTGTTACCAGGCCCCAGGGGGACCGGAGGATCGGGGGCC 

GGGTGGGGGGTCCCATGGAGTACTCCAGCACACGCAGGGGCTCCCTGCAGACAGGGGGGG 

CCTTCGCCCTGGAAGCCTGGACGCCGAGATAGACTTTCTGAGCAGCACGCTGGCCGAGCT 

GAATGGGGGGTCGGGGTCATGCCGTCACGGCGACCAGACCGACAGGCATATGAGCC 

Sequence 1797 

TCCGATTNNGCCNAGGGTTGCAGTTTGTAGACCCCTGATCTAGACCCTTAAGTAGCCTTG 
TTTGTGCCTGAAGTTTACAGATGATCCCCAACTTATTTTTATTTT 

AGTCTCTCTCTGGAGCCCAGGTTGGAGTACAGTGGCAAGATCTTGGCTCACTGCAACCTC 

CACATTCCGGGTTCAAGTAATTCTCCCGCCTCAGCTTCCTGAGTAGCTGGGATTACAGGC 

GTGTGCCACCATGCCAGCTTTTTTGTATTTTGTATGTTAGCCATGTTGGCAAGGCTGGCC 

TCAACCCCTGACCTCAAGTGATCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGG 

AGTGAGCCACCATGCCCGMCCCCAACTTATGTTTGACTTACAATGTTTGGACTTTATGG 

ATGGTGCAAATGTTATATGCATTTCAGNAGAMCTGGGCTTCAAAATTTTGMTTTTTGA 

ATCI I 1 1 IATTTT 

Sequence 1798 

TCCGCTGCCGAAGTCAGTTCCTTGTGGAGCCGGAGCTGGGCGCGGATTCGCCGAGGCACC 

GAGGCACTCAGAGGAGGTGAGAGAGCGGCGGCAGACNACAGGGGACCCCGGGCCGGCGGC 

CCAGAGCCGAGCCAAGCGTGCCCGCGTGTGTCCCTGCGTGTCCGCGAGGATGCGTGTTCG 

CGGGTGTGTGCTGCGTTCACAGGTGTTTCTGCGGCAGGCGCCATGTCAGAACCGGCTGGG 

GATGTCCGTCAGAACCCATGCGGCAGCAAGGCCTGCCGCCGCCTCTTCGGCCCAGTGGAC 

AGCGAGCAGCTGAGCCGCGACTGTGATGCGCTAATGGCGGGCTGCATCCANGAGGCCCCG 

TGAGCGA TGGA ACTTCGACTTTGTCACCGAGACACCACTGGGAGGGTGMCTTTTNCCTT 

GGAANCGTTTTGCGGGGGCCTTGGCCTGCCCAAGCTNTACCTTTTCCAACGGGGCCCCGG 

CGAG 

Sequence 1799 

GGCGNAGCTCGNCTTCCTCCNCGCCCAAGTTCCGGCGCCGCTCTTGCGGGAGCGTGCCGC 
ATCACCCCGGGGGCCCCTACGCGAGGATCTCCGGGGCCGTTGGCAGCAGCCTG 
Sequence 1800 

TCACCCCGCGTCCGGGCGGGCGTGGGGCGGTGGGAGGTAGTGAAGAAGGGTCGGCGGCCT 

GGGGTCGGCGCCGGCGCCGGCGGCCGAGGAGGCGGTAGGAACCGCAGGGCGCTCGGGGAA 

GCAAACNGAGTGTGGAAATACNACCTGACCCCTGCAATCCAGACCACAAGCACCCTTTAT 

NAGCGGNGCTTTNAGAATATCATNAAGNGGNTTTAATAAGGAGCAGGTCCCACCCCCTGC 
TGTGGAA 

Sequence 1801 

GTCGACCCACGCGTCCGGGAGCAGAGTCGACTGGGAGCGACCGAGCGGGCCGCCGCCGCC 
GCCATGMCCCCGMTATGACTACCTGTTTAAGCTGCTTTTGATTGGCGACTCAGGCGTG 
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GGCAAGTCATGCCTGCTCCTGCGGTTTGCTGATGACACGTACACAGAGAGCTACATCAGC 

ACCATCGGGGTGGACTTCAAGATCCGAACCATCGAGCTGGATGGCAAAACTATCAAACTT 

CAGATCTGGGACACAGCGGGCCAGGAACGGTTCCGGACCATCACTTCCAGCTACTACCGG 

GGGGCTCATGGCATCATCGTGGTGTATGACGTCACTGACCAGGAATCCTACGCCAACGTG 

AAAGCAGTGGCTGCAGGAGATTGACCGCTATGCCAGCCGAGAACGTCAATAAAGCTCCTG 

GTGGGGCAACAAAGAGCGGACCCTCACCACCAAAGAAGGTNNGTGGGACCAACACCACAA 

GCCAAGGGAGGTTTGGCMGACTTCTTTTGGGGCATTC 

Sequence 1802 

NCCCCGCGTCCGCGGACGCGTGGGCGGAGCTGCTGTGCAGTGGAACGCGCTGGGCCGCGG 

GCAGCGTCGCCTCACGCGGAGCAGAGCTGAGCTGAAGCGGGACCCGGAGCCCGAGCAGCC 

GCCGCCATGGCAATCAAATTTCTGGAAGTCATCAAGCCCTTCTGTGTCATCCTGCCGGAA 

ATTCAGAAGCCAGAGAGGAAGATTCAG7TTAAGGAGAAAGTGCTGTGGACCGCTATCACC 

CTCTTTATCTTCTTAGTGTGCTGCCAGATTCCCCTGTTTGGGATCATGTCTTCAGATTCA 

GCTGACCCTTTCTATTGGATGAGAGTGATTCTAGCCTCTAACAGAGGCACATTGATGGAG 

CTAGGGATCTCTCCTATTGTCACGTCTGGCCTTATAATGCAACTCTTGGCTGGCGCCAAG 

ATAATTGAAGTTGGTGACACCCCAAAAGACCGAGCTCTTCTTCAACGGAGCCCAAAAGTT 

ATTTGGCATGATCA 

Sequence 1803 

CGCGTCCGCGCTTCTGTTACGGCCAGTGCAACTCTTTCTACATCCCCAGGCACATCCGGA 

AGGAGGAAGGTTCCTTTCAGTCCTGCTCCTTCTGCAAGCCCAAGAAATTCACTACCATGA 

TGGTCACACTCAACTGCCCTGAACTACAGCCACCTACCAAGAAGAAGAGAGTCACACGTG 

TGAAGCAGTGTCGTTGCATATCCATCGATTTGGATTAAGCCAAATCCAGGTGCACCCAGC 

ATGTCCTAGGAATGCAGCCCCAGGAAGTCCCAGACCTAAAACAACCAGATTCTTACTTGG 

CTTAAACCTAGAGGCCAGAAGAACCCCCAGCTGCCTCCTGGCAGGAGCCTGCTTGTGCCG 

TAGTTCGTGTGCATGAAGTGTGGATGGGTGCCTGTGGGGGTGGTTTTTAGGACACCAGAA 

GAAMCACAGTCTCTTGCTAGAGAGCACTCCCTATTTTGTAAACATATCTGCTTTAAGGG 

GGATGTACCAGAAACCCACCTTACCCC 

Sequence 1804 

CCCGCGTCCGTAGATTAAATTATGCAAGTTGCAAGAATGTAGTAACTCTGATCAGCTACA 

AGGAAAGGAGGAAAGAGTAAATGAAGAAAGTCATCTAACTGAAAAGGAATATATAGAACA 

TTGTAACACCCCTACAACTGATTCTGATTCATCTATAGCAGTTAAAGCACTACAAATAGA 

TAGCTTTGGTTTAGTTACATGCTTTCAACMGAGTCTCTTGATGTTTCTCAAATGATACT 

TGGAAAATCTCAGCAACCTGAGTCAAAAATGCAATCTGAATTTATAAAAGAAAAAAGTGC 

TACTTGTTCAAATGAGGAAAAAGGTAACTTAAACGAGGTCAGTAATAACTGAAGAGAAAG 

AAACAGATGGGAGATCACCCTATCTTCATTACTGAACCAAAACTACCNGTTCACAATATA 

CCTGGATTCGACAGCATAAAAGAAACC 

Sequence 1805 

GCGTCCGGCAGCTGAAAGGGGATTTGGGCCCGGAAGATCCGAGtCCATCCGCGGCGGGGA 

GAGGGCAAGCGGGACCGGTAGGGGCCGGAGCAGCGGCGGCGGCGCTCGGACTGTCCCATC 

CGCCCCGTATTGAGGCGCTGGGAGCGGCGGGGCGACAGGAAAGCGATGGTGAAAGCGGGG 

CCGTGAGGGGGGCGGAGCCGGGAGCCGGACCCGCAGTAGCGGCAGCAGCGGCGCCGCCTC 

CCAGAGTTCAGACCCAGGAAGCGGCCGGGAGGGCAGGAGCGAATCGGGCCGCCGCCGCCA 

TGGAGCTGAGAGTCGGGAACAGGTACCCGGCTGGGCCGGAAGATCGGCAGCGGCTC 

Sequence 1806 

GTCGCCCCGCGTCCGCACAGTTGATTCTGMTTTTTAAGGCTTTCCTAATAGGCTGATCA 

CAGAGMTAA TCCAT TTTGAAGGTATAAAACTGCACTGTATGTCTGTCACTTGTAGCTGA 

A CTGAT TCACATTTTGACAAAAGAGAGAAMTACAAMATGAGTTTTGCAAATGTAATAA 

CI I I I ICTGCATATAGAACTAAATAATTGAAAAATATGGGCTATAGTTCTCAAAGGTAGA 

TAGTAMATCACTGGCTT7TTCCAGCTGTATGTTTTTCCACTGTGCGTGTACACACACAC 

TGGAAMTMTTAGGCTGATTTTGCAGGTCTTCATTGTTAGAGATTCTGAAGTATTTACT 

GTCAATTCATAGGTTTCAGTTTATTTAGGAAATTAGTGTTTGAC^ 
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TTTCACTGAAGCTGAGATTATTAGTGATCAMGTTAAMTTTCMTATTTMTTTCTCTA 

TATATTATTAATATTAAAATGGTTT 

Sequence 1807 

GTCGACCCCGCGTCCGGTACTACATCCCCTGAAGAAACATGTGCCCAGGTGATTCGTGAA 

GCTAAGAGAACAGCACCAAGTATAGTGTATGTTCCTCATATCCACGTGTGGTGGGAAATA 

GTTG GACC GACACTTAMGCCACATTTACCACATTATTACAGMTATTCCTTCATTTGCT 

CCAGTTTTACTACTTGCAACTTCTGACAAACCCCATTCCGCTTTGCCAGAAGAGGTGCAA 

GMTTGTTT ATCCGT GATTATGGAGAGATTTTTAATGTCCAGTTACCGGATAAAGAAGAA 

CGGACAAAATTTTTTGAAGATTTAATTCTAAAACAAGCTGCTAAGCCTCCTATATCAAAA 

AAGAAAGCAGTTTTGCAGGCTTTGGGAGGTACTCCCAGTAGCACCACCACCTG 

Sequence 1808 

CCCCGCGTCCGGCCTTTAAAAGAAGACTTGAATTCCTATGGAACAATAAAGACACAGCAG 

AAAACAGGGATTCTCCTGTTTCAGAGGAAATAAAAATGACCTGTCMCAATTTATCCATT 

ATCACCGTGACCTCTGTATCCGAAACATTGTCAAAGAAAGAAGGTGTGGTGCAAAGACTT 

CTGCTGGAACTTTCTGTGGCTGTGACCTGGTGAGCTGGCTAATTGAAGTCGGCCTTGCCT 

CCGACCGTGGTGAAGCTGTGATATACGGAGACAGGCTGGTACAAGGGGGAGTCATCCAAC 

ATATTACCAACGAGTATGMTTCCGGGATGAGTACTTGTTTTACAGATTTCTTCAAAAGA 

GTCCTGAACAGAGTCCTCCTGCTATTAATGCAAACACTCTCCAACAGGAAAGATATAAAG 

AAATTGAGCATTCATCCCCACCCTCACATTCCCCTAAGACCTAAATTATGCAGGGGAGAA 

CCCTACATGGAATCAT 

Sequence 1809 

CGCGTCCGCTGGAGTGCTGCTGAGGAGCGANGGGCCCATCTGGGGTCTCTGGAAGTCGGT 

GCCCAGGCCTGAAGGATAGCCCCCCTTGCGCTTCCCTGGGCTGCGGCCGGCCTTCTCAGA 

ACGAAGGGCAGTCCTTCCACCCCGCGGCGCAGGTGACCGCTGCCATGGCTTTTCCCCATC 

GGCCGGACGCCCCTGAGCTGCCTGACTTCTCCATGCTGAAGAGGCTGGCTCGAGACCAGC 

TCATCTATCTGCTGGAGCAGCTTCCTGGAAAAAAGGATTTATTCATTGAGGCAGATCTCA 

TGAGCCCTTTGGATCGAATTGCCAATGTCTCCATCCTGAAGCAACACGAAGTAGACAAGC 

TATACAAGGTGGAGAACAAGCCAGCCCTCAGCTCCAATGAACAATTGTGCTTCTTGGTCA 

GACCCCGCATCAAGAATATGCGATACATTGCCAGTCTTGTCAATGCTGACAAATTGGCTG 

GCCGAA 

Sequence 1810 

CGCGTCCGGTGCATCTGAGGACTGGTGGGAAGGCANGGCACAACGGGATTGACGGGCTGG 

TGCCTCACCAGTATATAGTGGTGCAGGATATGGATGATACGTTTTCAGACACTCTGAGCC 

AAAAAGCCGACAGTGAGGCCAGCAGTGGGCCAGTCACGGAAGACAAGTCCTCATCCAAGG 

ACATGAACTCCCCGACAGACCGTCATCCTGACGGCTATTTAGCCAGGCAACGAAAAAGAG 

GAGAGCCACCCCCTCCAGTAAGGCGTCCTGGCAGGACCAGTGAT 

Sequence 1811 

TCAGGAGTCGACCCCGCGTCCGGAAGGCCGATGCTGTGGGGGTGGGCGTGGAGAGAATTC 

TTCTGTGGGTCCTCTGGTGTTGAGTGGTCGGCTTGGTGTGGTGTGCGGAGGAGCTCCAGG 

CCCGTCGGCGCGGAGTGGTCTCACGTGTGAAACATGGCTACAGATTGGCTGGGAAGTATT 

GTGTCCATCAATTGTGGAGATAGCTTGGGTGTCTATCAGGGAAGAGTGTCAGCTGTGGAT 

CAGGTCAGCCAGACCATTTCTCTCACCCGGCCTTTCCATAATGGAGTGAAGTGTCTTGTT 

CCAGAAGTCACCTTCAGGGCAGGTGACATTACGGAGTTAAAAATTCTGGAGATACCAGGA 

CCTGGAGACMCCMCATTTTGGAGACCTTCATCAAACAGAATTAGGCCCCTCTGGTGCT 

GGCTGCCAAGTGGGCATCAATCAGAATGGCACAGGCAAGTTTGTCAAG 

Sequence 1812 

CCGCGTCCGCCCAGTCCNAGTGCTGGCTTTCCCTGTATCTGCCTCTGCCAGGCAACACTT 

ATCATGGCTCCCAATCAGCAGGAGCCTCCATGCTCCACTTTGAACAGCCTCTATGCTCCA 

GCAATGGGGCATTTGTGMGAGTGACTTGATTAACTTTTCTGACCATGGGTATAATACAG 

TTGCTTCAGAGGGC AGTGGTT CTGGGTGTGATTTTTACACTGTAACATTGTATACAGTGT 

CATGGATAATTACTATTTTTTTCTGGTCATTAACACTCACCTACTCTAGTACTAGGATTT 
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CAGACCAAGGTCCTCATGACGCCTGGATATTTTAGTATCTATATCCMTMTCTTTTCTC 

TCCTACTGMTATCCAGGCAMGATGMATCGTTTTCTTTAAAACTGTCAAATTCTGTAA 

AACTCAGGAGCCAGTTCAAGGGAACAAGCATCTTCACAATAGATGGAATCAAGAGTTAAA 

TGTTATAGTGGCAAGCTTGTCTACTGGGCAACAGAC 

Sequence 1813 

CGCGTCCGCCGCCGCGTCCGCTCCCGGTCCCTGGCCGCTCAGCGGCATGGCGTGCGGGGC 

GACGCTGAAGCGGCCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCCGGCTCCCCGAAGCG 

GCGGCGCTGCGCCCCTCTGCCCGGCCCCACTCCGGGCCTCAGGCCCCCGGACGCCGAGCC 

GCCGCCGCCGTTTCAGACGCAGACCCCACCGCAGAGTCTGCAGCAGCCCGCCCCGCCCGG 

CAGCGAGCGGCGCCTTCCMCTCCGGAGCAAATTTTTCAGAACATAAAACAAGAATATAG 

TCGTTATCMGAGGTGGAGACATTTAGMGTTGTTCTTAATCAGAGTGAAGGCTTGTGCT 

TCGGGAAAGTCAACCTCACTCCTCAGCACTCACAGCACCTAGCTCTTCCAGGTTCCTCAT 

GGATGAAGAAGGACCAGCCCACATTTACC 

Sequence 1814 

CGCGTCCGTTGMGAATAATATTGTATGTGCATTTTATCCATTMTGTTTCATACTTTCT 

G AGAGTAT AAT ACCCT TTTAAAAGATATTTGGTATACCAATACT7TTCCTGGATTGAAAA 

CTTTTTTTAAACTTTTTAAMTTTGGGCCACTCTGTATGCATATGTTTGGTC7T 

GAGGAAGAAAGGATGTGTGTTATACTGTACCTGTGMTGTTGATACAGTTACAATTTATT 

TGACAAGGTTGTAATTCTAGAATATGCTTAATAAAATGAAAACTGGCCATGACTACAGCC 

AGAACTGTTATGAGATTAACATTTCTATTGAGAAGCTTTTGAGTAAAGTACTGTATTTGT 

TCATGAAGATGACTGAGATGGTAACACTTTCGTGTAGCTTAAGGAAATGGGGCAGAATTT 

CGTAMTGCCTGTTGTGCAGATGTGTTTTCCCTGAATGCTTTCGTATTAGTGGCGACCAG 

T 

Sequence 1815 

GTCGC CCCGCGTCCGATTTAAACTGGGTCTTTATAAAAGTAAAATGGCCAACATTTAATT 

ATTTTGCMAGCAACCTMGAGCTAAAGATGTMTTTTTCTTGCNATTGTAMTCTTTTG 

TGTCTCCTGAAGACTTCCCTTAAAATTAGCTCTGAGTGAAAAATCAAAAGAGACAAANGA 

CNT NTTC GANTCCANNTTTMGGCTGGGGGAMNTGGGTTTTTTAGCNCAACCNTTNCAA 

AAGTTTTNTTTNGGGGATTCATAACANCNCCACCNAATTG^ 

CATTTCMTACTAGTTATATTNANNAGGAGTNGGTAGAGAGGAAACATTTGACTTATCTG 

GNAAAAGCAAACTTGTACTTAAGAATAAGAATAACATGGNCCATT 

Sequence 1816 

TCGACCCCGCGTCCGCTCTGCTCCTTGTCTCCNTCTNCCI 1 1 1 I CTGTCTTTGCCGGGTC 
TCTGGGTCTCTGACCCCCATCCGGCCCTCATGGCTTTGGGTCNGGAGCTNTTGAAGCAAT 
GTTCATCAT 
Sequence 1817 

CCACGCGTCCGGGGGAGCCGGACGCCAGAGTCCCCTCTCCACGCCGTGCAGCTGCGCTGG 

GGCCCCCGGCGCCGACCCCCGCTGCTGCCGCTGCTGTTGCTGCTGCTGCCGCCGCCACCC 

AGGGTCGGGGGCTTCAACTTAGACGCGGAGGCCCCAGCAGTACTCTCGGGGCCCCCGGGC 

TCCTTCTTCGGATTCTCAGTGGAGTTTTACCGGCCGGGAACAGACTGGGGTCAGTGTGCT 

GGTGGGAGCACCCAAGGCTAATACCAGCCAGCCAGGAGTGCTGCAGGGTGGTGCTGTCTA 

CCTCTGTCCTTTGGGGGTGCCAGCCCCACACAGTGCACCCCGATTGAATTTGACAGCAAA 

GGCTCTCGGCTCCTGGAGTCCTTACTGTCCAGCTCAGAGGGAGAGGAGCCTGTGGAGTAC 

AAGTCTTTGCAAGTGGTTTCGGGGCCAACAGTTCGAGC 

Sequence 1818 

TCGACCCACGCGTCCGGTGAAACACAAAACCAAGGAGTACATTAAGAAGTACATGCAGAA 

GTTTGGGGCTGTTTACAAACCCAAAGAGGACACTGAATTAGAGNGACTGTTGGGCCAGGG 

TGGGAGGATGGGTGGTCAGGTAANGACAAGACTCTAGGGNAGAAGGAAANCCTGTGGGCC 

TTTNTGTCCCACCCCTGTCAGCACTGGTGCTACTGATTGATACATNACCCTGGGGGGNAA 

TTNAACCCTGCCAGNATGTCAACNTGGAANGGCCACAAAGAAGTGGAACCTCCCATCTAC 

MMNNGAGTTTACNCTTANATTTTGTAGAAGCCTNGTTTGGCCATTGTTGCNNNTAGAN 
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AGTCCATNNANTAGGNGGCAAGGGGGCCTNTANTAAAAATGAACCCCTGGNACAGAACCT 
TGNACTTNCACATTCTTTAAAANCCCTGGGAGATGNTTTGCTTTCTNTGGGNCGNGTTGN 
TTTGNTTC AAGCT GCTACNAAGTAACTCTCAATGGCCGGCAATTATCCCCAACTCNCACN 
AAACTCCNTTTTTMCCCCTGGCANGGMTCCTTGCAMTTTMTAATTTTTTAMTGG 

Sequence 1819 

GTCCG GCTT TAGTGMTTCTTAATAGATNGTATATATAAMGTACATTTTAATAGAAAGC 

CAGGGTTTTAAGGMTTTCACATGTATMGGTGGCTCCATAGCTTTATTTGTAAGTAGGC 

TGGATAMTGGTGCTTAAATGGTAATGTACTCCACTTCTTCCTATTGGAAGATTAACATT 

ATTTACCAAGAAGGACTTAAGGGAGTAGGGGGCGCAGATTAGCATTGCTCAAGAGTATGT 

AAAAAAAAAAAAAAAAAAAANGAACCAAACCACTGGAAATAATCAAATGCAAAAAGGTAA 

CAMTTCATAACTGGAMGCAAAGAGAAGMCAAGTATGATTTGGATGATAAAGCATTGT 

TTTMTGGTG AAAACT TCACCAGATCACTTMTGTTTCTAGGAGGTTAACTTCAAGTGGG 

CAANTGGGGGTTTTTAGGTAGGTCAGTGGCCCTAAGTTCCTAAAGCCCACAGATTAGGGA 

TCTGTTAMCTGAATGGTCTGTTGGAMGGTTTTGTTTTAACTGCTTGGGAGGCTTTCCT 

TTTAAG 

Sequence 1820 

GCGTCCGGGAAMGTTTGCCTT CCANG CCGAAGTTAACAGAATGATGAAACTTATNATCA 

ATTCATTGTATAMAATAAAGAGATTTTCCTGAGAGNACTGATTTCAAATGCTTCTGATG 

CTTTAGATMGATAAGGCTAATATCACTGGACTGATGAAAMTTGCTCTTTNTTGGAAAT 

GGAGGAACTAACAAGTCAAAAATTAAAGTGTGATAAGNGAGAAGAACCCTGCTGCATGTC 

ACAGACACCTGGTTGTCTGGAAATGACCAGTAGAAGAGTTGGGTNTAAAAAACCTNTGGT 

ACNCATTAGTCCAMTCTGGGACMGNCGGAGTTTTTTAAAACAAAMTTGACTTGAAAG 

CCACCAGGG AAAG ATGGCTCAGTTCAMCTTTTTTGGAATNTGGATTGGGCCCAGTTATG 

GTGGTCCGGTTTTNCTATTTCCCNCCTTTCCCTTGTAGCAGATTNMGGTTTATTTNGTC 

ACTTTCAAAAAACAACAAACAAACNGATTACCCCAAGCAACATCTTGGGGGAGNTCTTGA 

CTTCCAAATG 

Sequence 1821 

CGTCCGCGGGTAAAATGTTATGGTAAGCATGCACANGTTTGCAGTCTACAGTTTTTTTAT 

GTA GCACAA AATAGGTGTACCTTTATMGTACATTCMTTTTATGATTTACATTTATCAT 

GT AATTTT TAAAAAMTCCATCTATCTAGGATATGTTGGATACAAAGTCTGCTTTTGCTA 

TTCTTTTTGCTTAAATACTCCTATCATTTTCTGAATTACTTGGTATTTAGAACTCCTAGC 

ACCACGGGGAAGAATAGAGGTATCATCAAACGTGGCAAATTTTCTTTCAGGAATAATAAA 

GAGCATGATTCCACAGCTTTTCTGGGGATGTTTGAGATTCTTTTTTAGTACTAAGCAAAA 

TTCTCATCACAGGAATGTAGCCCAGGCCAATTTATMCTAMTCTCTATTTTGTTCGGAT 

GATGCTTCTAAAACAGCATTGATAGGTTAMGAAGCTTGGGTATTTTTAATTTACTTCAA 

TGATTAGCTCAATTGCTTCTGGGAGTTTTAATCCTGTGGATATGTCAT 

Sequence 1822 

GGGCGCCCCGCGTCCGCTGATCTCGGGCTCCTATTTCATTTACATTGTGTGCACACCAAC 
GTAACCAGTGGGAAAACTTTAGAGGGTACTTAAACCCCAGAAAATTCTGAAACCGGGCTC 
TTGAGC CGCTA TCCTCGGGCCTGCTCCCACCCTGTGGAGTGCACTTTCGTTTTCAATAAA 
TCTCTGCTTTTGTTGCTTCATTCm 

TTCAACACGCCAAGAACCTGGACACTCTTCACTGGTAACATATTTTGGCAAGCCAACCAG 

GAGMAAGMTTTCTGCTTGGACACTGCATAGCTGCTGGGAAAATGAACATCAGTGTTGA 

TTTGGAAACGMTTATGCCGAGNTTGGTTCTAGATGTGGGAAGAGTCACTCTTGGAGAGA 

ACAGTANGAAAAAAATGAAGGATTGTAAACTGAGAAAAAAGCAGATGAAAGTGTCTCACG 

AGCTATGTGTGCTCTGCTCAATTCTGGG 

Sequence 1823 

CCCCGCGTCCGGTCCTTTGGCACTGTCATNTGTGTCCCTCGAGTGAGCNTCACCAGAGCT 
GCAGTAANNGNGCACCTACTACNGGCTCTGNGCTGAGTCCTTCCAGTGNGCCTCTCACTG 
AATNCTCACCCCACTGNCATGAGGTTTNCCCCATTTGACTGATGAGGGTGNAGAGCCAGG 
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GAGCCTTGNTCACTGGTTCATTGANTACATTTACAAATATTATTNACAGAGTGGGAGAAG 
AGCCGTATAGGGNCTTMTGCCATGGTNGGGACTTTGGMTTTAATTCAAGTGATGTGGG 
AGTACCTGCCAGATGGATGGAGGGTGAAATMTGCTTAAGCCCTCAGCC 
Sequence 1824 

CGCCCACGCGTCCGCACCGGCCCGATGAGCTGCAGCGGCTCCGGCGCGGACCCCGAGGCG 

GCTGCCGGCCTCCGCCGCCTCGGCCCCGGGCCCCGCGCCCCCGGACTNGGCTCCCGCCGC 

GCTGCCCTCCAGCACCGCCGCGGAGAACAAAGGNCAGCCCCCGNGGGGACAGGNNGGGGA 

GGACCTGGGNGCTNGGAGCAAGCATGCTGGGGGGCACAGGGACCCTTTGGCGGGAAGCCG 

GGCCGGTAGAGCNCAGCNTGAAGCAGGNTNTGAGGAGCTGGNTCCCTGNGNATNGGNNGA 

GGATGGGCCGCNGGANGCCCTCTCCGNGAAGGGGGGNGCTCAATAATCTTAACCGGGGGG 

TTTNAA 

Sequence 1825 

CGCGTCCGGCTGAAGGCTCCCCTGGGNTTNCTGGCCTCCTGGGGCAGAANGGGAGAGAAA 

GGCGATGCTGGCAACTCCATTGGAGGAGGCAGAGGGGNACCTGGCCCTCCAGGGCTCCCT 

GGGCCCCCAGGGCCAAAGGGAGAAGCAGGTGTCCGATGGCCAGGTTGGCCCCCCAGGGCA 

GCCAGGAGACAAGGGGGAGCGTGGAGCAGCTGGAGAACAGGGACCAGATGGCCCCAAGGG 

CTCCAAGGGAGAACCAGGGAAAGGAGAGATGGTGGATTACAATGGAAACATCAATGAGGC 

TCTCCAGGAGATCCGGACGCTGGCCTTGATGGGGCCTCCTGGTCTTCCTGGGCAAATTGG 

CCCACCTGGAGCTCCAGGGATTNCAGGCCAGAAGGGGGAGATTTGACTGCCANGGCCTTT 

CAGGACACGATGGGGAAA 

Sequence 1826 

gtcgaccacgcgtccgg i i i i 1 1 i i 1 1 1 1 i i gagacagagttttgctcttgttgcccagg 

ctggagtgtgatggctcgatcttggctcaccacaacctctgcctcctgggttcaagcaat 

t ctcc tg cctcag cctcttgagtagcttggtttataggcgcatgccaccatgcctggcta 

attttgtgtttttagtagagacagggtttctccatgttggtcaggctggtctcaaactcc 

caacctcaggtgatctgccctccttggcctcccagagtgctgggattacaggtgtgagcc 

actgtgccgggcccgtcccctccttttttaggcctgaatacaaagtagaagatcactttc 

cttcactgtgctgagaatttctagatactac^^ 

tcaagtgtgaccaggatggcgggagggggatctgtgtcactgtaggtactgtgcccagga 

AGGC 

Sequence 1827 

CGACCNCGCGTCCGGCACTCTGTTCTTCCGCCGCTCCGCCGTCGCGTTTCTCTGCCGGTG 

AGCGCCCCGCCCCGGGGCCTGAGCTGGACGTCGCAGGCCTGCGCCCCCCGACCCCGGCTG 

GCCCCGCTTCCAGCTGCCGAGGCCTCGTCGCGCCTTCCCCGGGAACAAAAGGCGGGGTCG 

C AATG GAAGAAGAGATCGCCGCGCTGGTCATTGACAATGGCTCCGGCATGTGCAAAGCTG 

GTTTTGCTGGGGACGACGCTCCCCGAGCCGTGTTTCCTTCCATCGTCGGGCGCCCCAGAC 

ACCAGGGCGTCATGGTGGGCATGGNCCAGAAGGACTCCTACGTGGGCGAC 

Sequence 1828 

CNCCACGCGTCCGGACCGGGGAAGACGCCTCTTCGCCGCCTCCGAAAACCGAGGCAGCGA 

GCGACCCCCAGCATCCCGCGGCCTCCGAAGGGGCCGACGCCGCCGCCGCCTCGCCGCCAC 

TGCTGCGCTGCCTAGTGCTCACCGGCTTTGGAGGCTACGACAAGGTGAAGCTGCAGAGCC 

GGCCGGCAGCGCCCCCGGCCCCTGGGCCCGGCCAGCTGACGCTGCGTCTGCGGGCCTGCG 

GGCTCAACTTCGCAGACCTCATGGCTAGGCAGGGGCTGTACGACCGTCTCCCGCCGCTGC 

CTGTCACTCCGGGCATTGGAGGGCGCCGGGGTGTTNTGATCCGCAGTGGGCGAGGGGAGT 

CAGCGACCGCAAGGCAGGAGACCGGNTGATGGTGTTGAAC 

Sequence 1829 

GGTGTCGCCCCGCGTCCGCTTGTC I MM I GGGGGTGTGAA I I I I I I GCATTGTTCTGAT 

CATATTCTTTATCATGTMTTTATGTTCTTTTTTACTMGTATTATGTGTGGTTATTATA 

GATTTTCACAMGATATATTGCTGGTMTATATTTTATTGTGTAGTCTTATMTTTACTT 

AACCTTCTTTCAATTGTTAGAMTTTAGGCTATTTCCAGATTTTCAGTATTGTAAATAAT 

GCTGTGATGACCMTTTTGTGMTAMATGTTTrTATGTATTTCAGATTATTCCCTTAGG 
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ATAGTCTCTCAGTGCCMGTTGTCAAAMCATCTCTATTTTGCTTATCTTCCTGCTCTCT 

TGCTGCCTTAGGGGGTAGTAAACTGAAACATAAAGTAAACATGCATACAAATAAAAAACA 

TAAMCAAAAATAAGCAACCTGATGGTMTAGGTGAMGTGGTAACCTGTTTTAACTTTG 

AATTCTTGCCCGGCGCGGTGGGCTCACGCCCTGTAATCCCAGCA 

Sequence 1830 

CGCGTCCGGTAAACCAGCCGGAGCGGCGCGGNAGCGGCAGGACCGCCGTGGCGCCTAGAG 

TAGCAGACCCGGGGGGAGCGCGGGGCGACGCTGGCTGCAGGGACCCGGTGACAGCGTGAG 

AGGTTTTGACAAGCTTGCATCATGCGTGAGTATAAGCTAGTCGTTCTTGGCTCAGGAGGC 

GTCTGGAAAGNCTGCTTTGACTGTACCMTTTTGTTCAAGGMTTTTTGTAGAAAAATAC 

NGATCCTACCGATAGGAAGATTCTTATAGAAAGCAAGTTTGAAAGTAAGATGCACAACCA 

GTGTATGCTTTGAAAMTCTTTGGATACCTGCCAGGNAACCGGGAGCCAATTTACANCCA 

ATGGAGGGGGATTTATMCATGNAAAAAAATGGGACAMGGGATTTTGCATTTAGTTTAA 

TTNCAATCACCAGACACAAGTTCCCACCAATTTTAAMCGAATTTTACCAAAGAACCCTG 

GAGGAGNAACCAAGAANTTCTTTNNGAAGGTTTAAAAAGGAACCACCTTGAATTGGAATG 

GTTTCCCAAATG 

Sequence 1831 

CCAATTATAGACTATATAGGGGGAAGAGCACTGGATTTGGAGTCAAGAAACCTGGACACT 

TGGCTCCACACTTCCTTAGCTGGGTAACTTTGGGCAAACCGCTTGGTCTCTCAAGCCTAA 

GGTTCTTCAGCTATAAAATGGGAATAATACTTCACTAACTACCTCACAGAGTTGTGGTAA 

GAATATAATCAGATAACTGGATAAAAACACTATATAAACTGGAAAGCGCCGTACAAATGT 

GAGAGATCAGTTTTATTATCAMTCACTGTTTTCCACTGCCTCTTGAATCGGCTTTATTC 

TAACCAACCATTACATCTTTCTCATCTTTTGGAGTATGGGTAATTGAGGCTTGGGTGTGT 

CATCAGGGACTGGGAAGTTATTTCAGCTCCCATGTAAAAGGTGGGAGAGGTGGTTTGTGG 

GNGCAG 

Sequence 1832 

GNGTCGACCNCGCGTCCGCTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 

AACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACA 

GTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGC 

ACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGA 

CCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCC 

CAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGC 

AGAAATGTAGGGAGGAGCTGCT 

Sequence 1833 

GCCNCGCGTCCGTGAAACGCAAAAGAAGGAGCTCGGAATATAAGAACGTCAGAACGAGTG 

ACACTAATAGTGGATAACACTAGATTTGTTGTAGACCCATCCAI I I I I ACTGCACAGCCA 

MTACAATGTTGGGCAGGATGTTTGGATCTGGCCGAGAACATAACTTTACACGACCCAAT 

GAGAMGGAGAGTATGAGGTGGCAGAGGGMTTGGTTCCACTGTGTTTCGAGCGATTCTG 

GATTACTATAAAACAGGAATAATCCGTTGTCCTGATGGCATATCTATTCCTGAACTGAGA 

GAAGCATGTGACTATCTTTGTATCTCTTTTGAATATAGCACTATTAAATGTAGAGATCTC 

AGTGCCCTMTGCATGAGTTATCAMTGATGGTGCTCGTAGACMTTTGMTTTTATCTG 

GAAGAAATGATCCTCCCTCTCATGGTAGCTAGTGCCCAGAGTGGGGAACGG 

Sequence 1834 

CCNCGCGTCCGCTTTAACCACCATAAGAAATCAGAAAACGCAAGATAAAGTTCAGCACAC 

AGTATGTATGGATTGCAGTAGCTACAGTACATACTGTTATCGCTGTGATGATTTTGTGGT 

TAATGACACCAAGCTGGGACTGGTACAGAAAGTCAGAGAACACTTACAGAACTTGGAAAA 

CTCAGCTTTCACAGCTGACAGGCATMGAAMGAMACTTTTGGAAAACTCAACACTAAA 

CAGCAAGTTATTAAAAGTAAATGGAAGCACCACTGCCATTTGTGCCACAGGCCTTCGGAA 

TTTGGGGAACACATGTTTCATGAATGCCATCCTTCAGTCACTCAGTAACATTGAGCAGTT 

TTGGCTGTTATTTCAAAGAACTGCCCCGCCGTGGAGTTAAGGAATGGGAAAACAGCAGGA 

AGGCGGACATACCACACCAGGAGCCAAGGGGATAACAATGTGTCTTTGGTAGAAAG 

Sequence 1835 
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TTACTGGGCACCCANCTCCATGTGCANGACTTTTCCCAACACAGCCTTGGCCAGTCAGAT 
GGTGTGNCAGGGCCNNAGGTTTNCGTANCCTCTTGGGTGATAGAMGGGGCCCAGGCCCT 
GGGCTGGGGCTCATAANGGACTCAAANGAGGCACCTTGCCC 
Sequence 1836 

TCGCCCGGGGGCCATGGCAGCAGCGGCTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCC 

TCCCGGGGAAGTCATTCATCTGAATGTGGGAGGCAAGAGATTCAGTACCTCTCGCCAGAC 

TCTCACCTGGATCCCAGACTCCTTCTTCTCCAGTCTTCTGAGCGGACGCATCTCGACGCT 

GAAAGATGAGACCGGAGCAATCTTCATCGACAGGGACCCTACAGTCTTCGCCCCCATCCT 

CAACTTCCTGCGCACCAAAGAGTTGGATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCA 

TGAAGCCCAGTTCTATGGGCTCACTCCTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTT 

GGATCGATCTTCTTGTGGAAACGTCCTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCC 

AGTGAAGCGGCGGAACCGGCACAGCCTAGTGGGGCCTCAGCAGCTAGGAGGACGGNCAGC 

CCCTGTCCGACGGAGCAACACGATGCCCCC 

Sequence 1837 

CGCGTCCGTTCTA GATCGC GAGCGGCCGCCC I M I I I I I I I I U I ATCTTTCTGTTTTTC 

CACTAAAAGCTCCGTTTTTCCATCTTCCCATTACTCTCCCTTTCCTGTGNACACTCCTGA 

AG ACAGG CATCCTCATAAGGTGTTCTGMTTAACTTTAGGGNGTTCTCCAGGTACTTTGC 

ATCTTTTTATATTTCTTGTAAATTGNTAATTTCTAAGCTCCATGATTAAAGAGAATTCAC 

CACTAAAAAAAAAAAAAAAGG 

Sequence 1838 

CGCGTCCGGGCTGGCGAGCGCCGNCGCCGGCGGAGACCGACNCTNGGCCAGAGCCNGCCC 

GCGGCGCCCGGGCCTGGCCGGCTGCTTCCCGCCTCAGCNNGGCGCCCCNGCCTCCGTGCG 

CCGCAGACTTTGCCTAGGCGGGCNGAAGCTGAACAAGAGGTCCCTCGGCCCTGCACGGTC 

CGGCCGCGGTCCGGAGCCGANGCGCATGAGATTCCCCAGGAAGCCATCACACTCCTTCCC 

ACTGTGGTTTNGGGAGCATGAAGGCGTTG 

Sequence 1839 

GCCCCGCGTCCGTTTTATTTTGCACTTTTATC 

TAAMTACACTCMGTGACTTGGACTTAGATGCTTATCCTTACGTCCTTGGTACCTTTTT 

TGTATTAACA AACACTGCAATTTATAGATTACATTTGTAGGAAGTTATGCTTTTTTCTGG 

in n GTTT TACTTTCAACCTAGGTTATMGACTGTTATTCTATAGCTCCAACTTAAGGT 

GCCTTTTTAATTCCCTACAGT1TTATGGGTGTTATCAGTGCTGGAGAATCATGTAGTTAA 

TCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCAGCCTACATTCACTTCTAAA 

GTCTATGTMTGGTGGTCATTTT7TCCCTTTTAGMTACATTAMTGGTTGATTTGGGGA 

GGAAAACTTATTCTGMTATTMCGGGTGGTGAAMGGGGACAGTTTTTACCCTTAAAGT 

GCAAAAGTGGAACATACAAAATMGACTAMTTTTTNAGAGGTAACTCAAGTAATTTC 

Sequence 1840 

GTCCGGCCAGCTGATGCCGGGAGCTAACTACCGCGCCGGGGCCGGGGCCGGGGCCGGGGC 

CCGACGTCCCCGCGGGGCCCGGGACCGNGAGGAGGACGGCGGGGGCCTGGAGCCCGCGGC 

CGTGGCCCGCGACCTATTGAGGGGCACATCTAACATGTCATTTGAGGAGCTGTTGGAATT 

GCAGAGCCAAGTGGGGACTAANACNTACAAACAATTGGTAGCTGGAAATAGTCCTAAGAA 

ACAAGCTTCTAGACCACCTATCCAAAATGCATGTGTTGCANATAAGCACAGGCCTCTGGA 

MTGTCANNCMGATCCGAGTNCCATTTTTACGTCAGGTTGTTCCCATTAGTAAAAAGGT 

AGC 

Sequence 1841 

GCACCCGCTGNGAGAGGCGGTAGCGGCGGCGGCGGCGGTGGTATCGGCGGCAGCTGTGAG 

GGGGTTCCGGGAAGATGGTGCTGATCAAGGAATNCCGGGTGGTTTTGCCATGTTCTGTTC 

AGGAGTATCAGGTAGGGCNAGCTTTACTCTGTTTGCAAGAAGCTTTGTAATGAATTGAGA 

CTGGGTGGGTGGAGAAGGAATTGAAGNTCTTAAAGAATGAACCTTATGAGAAGGATGGAG 

AAAAGGGACAGTTTACGCACAAAATCTATCACCTAAAGAGCAAAGTTGCCTGCATTCTGT 

GAGGATGATTGCTCCCGAGGGCTCCCTTGGTGTTTCATGAGAAAAGCCCTGNAAATGCCG 

TACCCCTACTGTAGNAACAATTNTNAACCGAAATGGAATATATTGAAANGAATGATTTTC 
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CTTCAATTNAAAANTCCGAAAACAATNGGCANCCAAAACCCATNACTTTGGGGAACCATN 
TAGGAAAAMNNGTACCNATNGGGTTTTTAGGATTCCCAAAACCACNATTGGGAAAAAAC 
CTTG 

Sequence 1842 

CCCGCGTCCGGCGGAGAAGACTCAAAAGAGTGTGAAGATTGCTCCTGGAGCAGTTGTATG 

TGTAGAAAGTGAAATCAGAGGAGATGTAACTATCGGACCTCGGACAGTGATCCACCCTAA 

AGCAAGAATTATTGCGGAAGCCGGGCCAATAGTGATTGGCGAAGGGAACCTAATAGAAGA 

ACAGGCCCTTATCATAAATTGCTTACCCAGATAATATCACTCCTGACACTTGAAGATCCA 

GTAACCAANMCCCTATGATTCATTGGCACCAATAMTGTTGTTTGGAAGTTGGGCTGGT 

TTATTCCCAAAGGCCCATTGAAAGATGGGGGAGGATAAAATAAATGGTCATTTGAAATCA 

MAAGTCMTTATTGTTAGGCCAGAAAATGGTAAATATTTGGACCAAGTTGGCTTGCATT 

CATTTGGGNGGGCTTTGGTTGCMCCTTAMNTACATTTTGTAAGTTCATCCCCTGGAGA 

ATACCNGGTGGAATCCTTAAT 

Sequence 1843 

CGACCNCGCGTCCGGGGGATCTGTGCCTGGCATGGGGACGAGTTCTGGCCTCCTTAGGGT 

ACGGGGAGAGCTTGGACTTTGGTCCTGACGTGGTGGACGACACACCTTCGAAGAGTGGAC 

GTTACCTCAGTTGTCTGTTGTTAGAGTTTAATCGATCACTCCTCTGTTTTGTTGTGTTCT 

TTCCCAGAAATMCTTTACCAMGGAAAGCTATTTTGCGAACTATCTTCTCCAGCGGAGA 

TGGCCMTGTGCTTTGTMCAGAGCCAGACTGGTTTCCTATCTCCCAGGATTTTGCTCTT 

TAGTTAAMGGGTTGTCMTCCCAMGCCTTTTCGACTGCAGGATCATCAGGTTCGGATG 

AGTCTCATGTGGCTGCTGCACCTCCAGATATATGCTCTCGAACAGTGTGGCCTGATGAAA 

CTATGGGACCCTTTGGAC 

Sequence 1844 

GGGACAGAGCCCCCGATCCGCCAGCACCACCTGAGGATTNNGAAACCGCCCCAGCGATGG 

AAGAGGGNCAGGAGCTGGAGAGGAAAGCAATAGANGAACTGCTTAAGGAGGCAAAACGTG 

GGAAAACTAGAGCTGAAACAATGGGACCCATGGGTTGCTTATTACAGGGACAAGATACAA 

ACTAAAATCAGCCAAAATAAGACACAAAGATTAAAGCCA 

Sequence 1845 

CGTNCGGGACCCGNCAACATGGGCCGCGTTCGCACCAAAACCCGTGAAGAAGGCGGCCCG 

GGTCATCATAGAAAAGTACTACACGCGCCTGGGCAACGACTTNCACACGAACAAGCGCGT 

GTGCGAGGAGATCGCCATTATCCCCAGCAAAAAGCTCCNCAACAAGATAGCAGGTTTATG 

TCACGCATCTTGATGAAGCGAATTTCAGAGAAGGCCCANTTAAGGAGGTATCTTCCATCA 

AAGCTGCAGGNAGGAGTGANGAGAGNAAAGGAGTAGACAATTATGTTTCCTGAAGGTCTC 

AGCCTTGG GATCAA GNGAGAATTATTNGAAAGNTAGATCCCTGNACAACTAAAGGAAATG 

CTTGAAGNCTTTTTGGACCTTCGNCAGGTTCTTGTCCAACCCTTTCAAGGTTCACTTCAG 

GCCT^CAAGTTGGGGATTGAAATTTTTCAAAAACGCCNTCGGGGGMCCCT 

A I I 1 1 1 1 I I CTTGNTAGNTGCCTTGTATTAATTTTT 

Sequence 1846 

GTCGACCCCGCGTCCGCAGCCTGGCCTGTGAGACCCTCGTGGACAACAACCTGCGGGTCA 

CCAACTGGMCCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCACATTGTGGACTGGTGTG 

GCTGGTCCCCCAACGACTTCAAGCCACAGGACTTCCTCCGGCTGCAGCAAGTCTCCAGAC 

CCACCTTCTTCGCCCGGAAGTTCGAGTCGACTGTGAACCAGGAGGTGCTGGAAATCCTGG 

ACTTCCACCTGTATGGCAGCTACCCCCCCGGCACGCCAGCCCTCAAGGCCTACTGGGAGA 

ACACCTACGACGCGGCTGATGGCCCCAGTGGGCTCAGTGATGTCATGCTCACTGCTTACA 

CAGCCTTCGCCCGCCTCAAGCCTGCACCATGCCGCCACTGCTGCACCCCCAATGGGCACC 

CCACTCTGCAGGTTTGAGCCCAGGGGCTTTGCCGTCCAAGCGTGCACCTGTATTTCTATG 

ACGACCATTTC 

Sequence 1847 

TCCGCAAGAGTTATGCTTAAGACCAGCCAGCCTTGATAGTGGCAGAACATCCACTAGCAA 

TAGCAATAATAATGCTTCACTACATGAAGTCAAAGCAGGTGCAGTTAATAACCAAAGCAG 

GCCACAAAGCCACAGCAGTGGAGAATTTAGCCTGCTTCATGACCATGAGGCTTGGTCCAG 
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CAGTGGTAGCAGTCCAATCCAGTACTTGAAAAGACAGACCAGATCAAGCCCAGTGCTCCA 

GCACAAAATATCTGAAACACTGGAGAGTCGACATCACAAGATCAAAACTGGTTCCCCTGG 

MGTGAMGTTGTTACTCTACMCAGTTTTTGGAAGAAAGCAATAAGCTTACCTCAAGTA 

CAGATAMGTCCTCAAGTCAAGAGAATCTTTTAGATGAAGTAATGAAAAGTTTGGCTGNC 

TCTTCTGACTTTTTGGGAAAAG 

Sequence 1848 

GCGTCCGCGCCGGGCCGCTCTAGCCGGTGAGGCCGGCGGGCTCTCTGTGGCTGCGGCTGG 

GAAACCGCGCGGAGGAGGTGCCCGGCCGGGGACCAGCCCTGGTCCAGCGCCTCCCTCTCT 

CAGCATGGACGAGGAGAGCCTGGAGTCGGCCTTGCAGACCTACCGTGCGCAGCTGCAGCA 

GGTGGAGCTGGCCTTGGGCGCCGGCCTGGATTCGTCTGAGCAGGCTGACCTGCGCCAGCT 

GCAGGGGGACCTGAAGGAGCTCATCGAGCTCACCGAGGCCAGCCTGGTGTCTGTCAGGAA 

GAGCAGGTTGTTGGCCGCGCTGGACGAAGAGCGCCCGGGCCCGCCAGGAAGATGCTGA 

Sequence 1849 

AGTCACCACGCGTCCGGGTTGCACTCTTCCTATAGCCCAGAGGGCGAGAGGGCCTGTGGC 

CTGGGGGAAGGAGGACGAGGTTCTGCCTGGATCCCAGCAGTAGGACGCTGTGCCATTTGG 

GAACAAAGGAATAGTCTGCCTGGAATCCCTGCAGATCTTGGGGCCGGAGGCCAGTCCAAC 

CCTTGGAGCAGGAAGAAACGCAAAGTTGTCAAGAACCAAGTCGAGCTGCCTCAGAGCCGG 

CCCGCAGTAGCTGCAGACTCCGCCCGCGACGTGTGCGCGCTTCTCTGGGCCAGAGCGAGC 

CTGTTTTGTGCTCGGGTTAAGAGATTTGTCCCAGCTATACCATGGGCCCGCACTCGGNAA 

AGCTGGCTTGCGTGGCCGCTGGTGTGGTTATCGGGGCTGGTGCCTGCTACTGTGTATACA 

GACTGCTTGGGGAAG 

Sequence 1850 

TCGACCNCGCGTCCGCTCAGGAACCTTNGAGAAGATNAGNNCCCCACTTAGATTNTTAAG 

GAGTAAMAGGGCTGAGTTATGCCTTTMGNGCTGTCMGAATTCACTTGGGTTTGGGAC 

ATTTGCTGGTGTAATGCTAGATGCCCACAGCANCATAATATTGNNCTTTGTCAAAGGTNG 

GTAAATNCTNTGNTTNTCANCANCCCTTTCCCCA 

Sequence 1851 

AGTCACCACGCGTCCGCGGCTGGTGGTGGGCTCGGGCCGCTCGCCTTGCCCGTCTTCGCT 

TCCGGAGGTCGCTACTGCCGCCTCAGCGGCCCCGGAGCGGGGGCGCCCGGGGGTCCTTCG 

CCCCCGGCCAGGGTCCCCGCGCCGGGGCTTCGCCGCCCCCAGTGTCCGAGCTGGATCGTG 

CGGACGCCTGGCTCCTCCGAAAAGCGCACGAGACAGCCTTCCTCTCCTGGTTCCGCAATG 

GCCTCCTGGCATCGGGCATCGGGGTCATCTCCTTCATGCAGAGTGACATGGGTCGGGAAG 

CAGCATATGACCATCCCCCGACCTTGGCCCTGTTGCTACCCTGCCTCTNCCGCAGGCTTC 

TTNCTGCTGGGCGGCCTGTGCGTGGTGTGGGGCAAGCGCCTTGTACCCGTGGGCCTGGCG 

GCGCTTNGAGGACCCATGCAGCTGACCTGGGGGGCCGGCCTGGG 

Sequence 1852 

GCGTCCGCTCGCTGCAGCCCCGCCTGGGCCACGGCACCCTCGAGCGCCAGCCCCGCGCCC 

CACCCGGGAGCAGCGAGCCACCGGCGCGCTCCCCAGGAGCCCCTGCAGGCGCCGGCCCTG 

GTCCAGCGCCTCCCTCTCTCAGCATGGACGAGGAGAGCCTGGAGTCGGCCTTGCAGACCT 

ACCGTGCGCAGCTGCAGCAGGTGGAGCTGCCTTGGGCGCCGGCCTGGATTCGTCTGAGCA 

GGCTGACCTGCGCCAGCTGCAGGGGGACCTGAAGGAGCTCATCGAGCTCACCGAGGCCAG 

CCTGGTGTCTGTCAGGAAGAGCAGGTTGTTGGCCGCGCTGGACGAAGAGCGCCCGGGCCG 

CCAGGAAGATGCTGAGTACCAGGCTTTCCGGGAGGCCATCACTGAGGCGGTGGAGGCACC 

AGCAGCGGCCCGTG 

Sequence 1853 

GCGCCCCGCGTCCGGAAATTGACCCCTAGAGAAAATCCCATTAACTTGTTAAATTAGTGG 

MTTMCMCAMTAAAGCATGTTTGAGACCTGGCAAAAATTCCTCTGGTAGTATTTATA 

AATAGAGCTGCATGCCTCTAGTATGAAAACCGTATCAGTTGCAAGTGCCACTTCTACAAG 

TTACTCAGTTTACTCTTTGTATCAGTAACTTTAAAGGTTGGATGATCCTTGCTGGTTAAA 

GCTAAATCTCAACCTAGCAACTAAATGAAAATATTTAGAATCATCAGAATCTGAACAGAC 

TAAAATTATCAGCGATAAGCAGAATCAAGCAGGGTATAAGTTTTATCTCAATTATTTGAA 
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ATTGACTGGAGTTTTCTTAMGTGTAMGCTGAAATTTGCTMCCATGTTTTTGATGAAC 
CCACAGTGCAGCATTGGGTTGGGGTTTTAGATTTGAATGACTCTCTGCTATAATTATCAT 
GACTTTGAAA 
Sequence 1854 

TGTCGACCCCGCGTCCGAACCTTTGGGAGACTCCAAGACAGCATGCTCCGAGGTCGGCGG 

GGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAGG 

AGGACNAGGAGAGGGACGAGGTTGGGCCCGAGGGGGCGCTGGGCAAGAGCCCCTTCCAGC 

TGACCGCCGAGGACGTGTATGACATCTCCTACCTGTTGGGCCGNGAGCTTATGGCCCTGG 

GCAGCGACCCCCGGGTGACGCAGCTGCAGTTCAAAGTCGTCCGCGTCCTGGAGATGCTGG 

AGGCGCTGGTGAATGAGGGCAGCCTGGCGCTGGAGGAGCTGAAGATGGATAGGGACCACC 

TNANGAAGGAGGTGGAGGGGCTGCGGAGACAGAGCCCTCCGGCCAGCGGGGAGGTTGAAC 

CTGGGCCCAAACAAAATGGTGGTT 

Sequence 1855 

TGT CTGCCG AACAACTGAAAACTATGCAGGTTATATCATTCCACCAGCACCACCAAGACC 

TGATTTTTTGATGCTTCAAGGGGAGAAAACTACATGAAGCTTGGNGAAGNGAGAAGGGTC 

AAGTG ACCGA ANNGCAAGNAATTTCACAAANAGAATGAAACNAGGCAACTGGAAAGCTCG 

AAATANTTTTGGCAATTATTTCAGAGGANAGCACCAGATTTGCCGTATTGCATNGGAANT 

TGNTACCCNGTAGTTCCNTGTTGGANATGTCAACANTCCCCTANT^nTTTGNACGAAACT 

MGGMTTNTAGAAATTTTTGCMTNGTTCCTTTACTTGGGGGAAAATT^ 

AAAGGAATCTTTNAGAGNTTGNNTNGCCGTAANGGCAAGAGAGAGNCNATNNTNGAAACT 

NAGGAAGCGGAAACCTTTTGNAMNMCTTTTCTTTTTTA 

Sequence 1856 

GAGTCGACCNCGCGTCCGGCGGCGAGCGGGACTGGCCATTGGAGTGCTCCGCTGCGGAGG 

GAGGGGACCCCGACTCGAGTAAGTTTGCGAGAGCACTACGCAGTCAGTCGGGGGCAGCAG 

CAAGATGCGAAGCGAGCCGTACAGATCCCGGGCTCTCCGAACGCAACTTCGCCCTGCTTG 

AGCGAGGCTGCGGTTTCCGAGGCCCTCTCCAGCCAAGGAAAAGCTACACAAAAAGCCTGG 

ATCACTCATNGAACCACCCCTGAAGCCAGTGAAGGCTCTCTCGCCTCGCCCTCTAGCGTT 

CGTNTTGGAGTAGCGCCACCCCGGCTTCCTGGGGACACAGGTTTGGCACCATGGGGCCCA 

CCAGCGNCCCGCTGGGCAAGGCCCACCGCAGGCTCGGTCTNTGACTACGNCAACTATGAT • 

ATTCATTCGTCCGGCATTACAACTACACGGGGAAAGCTGAATATCAG 

Sequence 1857 

CGTACGCCGAGCGCCGCTCCGGCTGCACCGCGCTCGCTCCGAGTTTCAGGCTCGTGCTAA 

GCTAGCGCCGTCGTCGTCTCCCTTCAGTCGCCATCATGATTATCTACCGGGACCTCATCA 

GCCACGATGAGATGTTCTCCGACATCTACAAGATCCGGGAGATCGCGGACGGGTTGTGCC 

TGGAGGTGGAGGGGAAGATGGTCAGTAGGACAGAAAGGTAACATTGATTGACTCGCTTCA 

ATTGGGNGGGAAATGCCTCCGCTGAAAGGCCCCCGAGGGCGAAAGGGTACCCGAAAGNCA 

CCANGTMTCACTT GGGTGT TCCGAATATTGGTTNATTGAAANCCATTCAACCCTTGCNA 

GGGMAAACAAAGGTTTTTTCNAACAAAAAAAGAAAAGCCCTTACAANGGAAAAGGTTAC 

CATTCANANAAGAATTTAACATTTGGMAATTTCAAATTCTAAATAGGGGGGAAAAACCT 

TTTGGAAAGGAAACCAGGANGTACCCCAGGANAAAGGAAGGTAAAAAA 

Sequence 1858 

GNGTCGACCCCGCGTCCGGTGGAGGGTCAGGAGCTGCCCCGGATCCTCTCCATGTAGTTG 

CGAAGCTCCTCAGGGTCCTTCAGCCCCATGTCCTCACACACCCAGCGGATGTCCTCCTCG 

CCTGCCACAAGGATGGACTGCACAGCAGGGGCCCCTACAGGCTCCTCAGGTGACTGGGCT 

GGAGGGGCTGGCGCAAATGTCACAAACTCTACTCGCTTCCGCCGCCCCCCAGCCTCCTTT 

CGGGCCAGGGTGCTTGAGGAGCTGGTGGTGCCCCCAGGAGGGCCAGGGGCCAGGGTAGGG 

GCCTCCCCTCCTCCCCCACTCTCACAGGGGCAGCTCCCCTCCCCCTTGGGCGGGCCAGGG 

GACTGCCGGTCCAGCTGGCGGCTCAGTTCCTCCTGGTCAGTGCCCAGCCAGACCCAGTTG 

TGGGGCTGGGGGAGGTTGGGG 

Sequence 1859 

GCGCCCCGCGTCCGGCGGCGGCGGCGGCGGCAGCGGCAGCCAGAGGACTCCCAGCGGCTG 
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GAGCAGAAGTGTTAGCCGGCCAGAGCTCCCAGACCCCTACCCACAGCCAGGCGGGACGCG 

CACAGTCCCTCCACGCGGAAAGAAGTACCTTCGCCGGTCACCGGCTCCTGCAGGGTTGCA 

NATATATACAGAGCTTCATAATCAGCCCAAGACCACATAGAGCAAACATGAATGATATTT 

CCCAAAAGGCTGAGATTAAAGAAATGCTTGCTTCTGATGATGAGGANGATGTATCTTCTA 

AAGTNGAAAAGGCTTATGTTCCAAAA 

Sequence 1860 

CGACCCCGCGTCCGACCCACTGAAGACGTCTGCGTGAGAATAGAGACCACCGAGGCCGAC 

TCGCGGGCCGTTGCACCCACCGCCAAGGACAAAAGGAGCCCAGCGCTACTAGCTGCACCC 

GATTCCTCCCANTGCTTANCATGAAGAAGGCCGAAATGGGACGATTCAGTATTTCCCCGG 

ATGAAGACAGCAGCAGCNTACAGTTTCCAACAGCGACTTCAACTACTTCCTACCCTCACC 

AAGNCAAGCTTGCTCTGAAAAGCCCANTTATGCCANAATGTAGGATCCTGAAAACCAAGA 

AACI I I I I I ACTTG AAATT 

Sequence 1861 

GCGTACGGCCTGTTGGGCTGTCTGGGGGGTGGCCATTTAGGGATCGTGGGGACGGGGTCC 

ACCCCANNCAAGAAAGAACAGGCCCGTCCACAGGCCCGGCTCTGGGCCACAGTGCCCCGG 

AAGCAGGTGTGTCCAGAGTCANGCTTGAATGGCTCTCCCCACAACCACCCAGCNAGGCGC 

TGGTGCNTCCTTCTGCCTCATGGGACCAGTCCAGCTTNCAGCCGCTCTGGGCTCGAGGGT 

NGGTACTGACCACTTTCCTTCTTGAGNTGGGAGCATTCTCTTGGGGGAGNCTCTTCCAGT 

G GGCA CCTGCCTGGGACNCTTGCCCACCGGTTTTCTTGTAAAAATCAGGAATACCGGTGG 

CTTTTAGTAAAAGGCAAGACCANAGNCGCCTTNCGTTGGGCAGGGGAAAAGCCAAGCGTG 

CCGGNGGGNAAGGTCACTGGAAAAAGGTGGCTTGCCCTAAGGGGGAAGTTTGGGAAAATA 

GTCCCCCTGTTCCAAGMNTGCCTTTGMTTTTTAAAAACATTTTTGGCT 

Sequence 1862 

CAATNTACAACGCCATGTNCACCCANATGTTCCAGACTAAGCGCTGNTTTCGACTGGCCC 

CCACCTTNAGCAACCTGCTCCTGCAGCCNACCACCAACCCTCATACCTCGGNCAGCCACA 

GGCCTTGCGGTCAATGGGGATGTAGACAAGCCTTCAGAGCCAGCCTCTGAGGAGGGCTCT 

GAGTNGGAGGGGAGTGAGTCCAGTGGACGCTCCTGTCNGAATGAGCGCAGCATCCANGAG 

AAGCTTNAGGTCCTGATGGCCNAAGGNNTGCTNCCTTGCTGTGAAAGTCTTNCTGGACTG 
GCTTCNGACCAAC 

Sequence 1863 

NGGAGTCGACCCACGCGTCCGGCCGCCAGAACACAGGTGTCGTGAAAACTACCCCTAAAA 

GCCAAAATGGGAAAGGAAAAGACTCATATCAACATTGTTCGTCATTGGACACGTAGATTC 

GGGCAAGTCCACCACTACTGGCCATCTGATCTATAAATGCGGTGGCATCCGACAAAAGAA 

CCATTGAAAAATTTGAGAAGGAGGCTGCTGANATGGGAAAGGGCTCCTTCAAGNTATGCC 

TGGGTCTTGGATAAACTTGAAAGCNTGAGCCGTGAAACCGTTGGGTATCACCATCTGGAT 

ATTCTTCCTTTGTGGGGAMTTTTGGAGNACCAGGCAAGTTACCTATTGGTGGACTTATT 

CATTTGGATGGCCNCCMGGGACCACCAGGAGGACCTTTTATCAAAAAAAACANTGATTA 

CAGGNGGACATC CTNNA GGCCTGGACTGGTGGCTGGTCCTGNATTGGTTGCTGGCTGGGT 

GGTTTGGATGGAATTTTGAAGGCCTGGGNTATNCTTCCCAAAAGAAAT 

Sequence 1864 

GCGTCCGATGGCGTGNTGTCTCACAGAAAGTTCTCCGCTCCCAGACATGGGTCCCTCGGC 

TTCCTGCCTCGGAAGCGCAGCAGCAGGCATCGTGGGAAGGTGAAGAGCTTCCCTAAGGAT 

GACCCATCCAAGCCGGTNCACCTCACAAGCCTTCCTGGGATACAAGGCTGGCATGACTCA 

CATCGTGCGGGAAGTCGACAGGCCGGGATCCAAGGTGAACAAGAAGGAGGTGGTGGAGGC 

CTGTGACCATTTGTAAGAGACACCACCCATGGTTGGGTTGTGNGGCAATTGGTGGGCCTA 

CCGTTGGGAMACCCCTCGAGGCNCTCCGGTACCTTCAAGACTTGTCTTTTGCTGGAGCA 

CAATCAGTTGMTGAMTGGCAAGAAGGGCGGTTTNTTATTAMGTMTTTGGCCATTAA 

AAATTCTAANGAAAGGAAAGGGCCTTTACCAAAGTTACCTGCAAGGAAAATGG 

Sequence 1865 

CCGGCCGGGCTGGGTCCCAGCACNTGACCCCAGCTGCACTGCTGTACAGGGCTGCNCGTC 
ACCCGAAGTCAGAAACGTGGCATCTCATCGGAAGAGGAGGAAGGAGAGGTAGACAGTGAA 
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GTAGAGCTGACATCNAGCCAGAGGTTGGCCTCAGAGCCTGAACATGCGCCAGTCACTATC 

TACCTTCAGCTCAGNAGAATCCATCAGATGGGGAGGAAGGCACAGCTAGTGAACCCTTCC 

CCCAGTGGCCACACCTGGAAGTTGGCTAGCACCAACACNTGATGAGTCCGGGCNCAGNAT 

GAAGCNGNGTTCTGATGAACATGTTGACTCCCAAGGGGCTCAAGAAAATCCCCACNTGGG 

ACNCCACTCTCCTTCAGGAGGGTCATACCCTGGGCCCTGGAACCCAGCTCCCTGGCCCA 

Sequence 1866 

TTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCCGCAAGCCCAACGCCGGGGGCAGCCCCGN 

TCCGGTGCGCCGCCGGCCGGAGGCCTCGCCGGTGCAGAAAAAGGAGAAGAAGGACAAGGA 

GCGGGAAAACGAGAAGGAGAAGAGTGCCCTAGCCCGGGAGCGCAGCCTCMGAAGCGCCA 

GTCGCTGCCCGCCTCCCCACGTGCCCGCCTCTCTGCCAGCACCGCCTCTGAGCTCAGCCC 

CAAATCCAAGGCCAGGCCATCCTCTCCCTCCACATCCTGGCACAGGCCTGCCTCCCCCTG 

CCCCAGCCCAGGGCCAGGCCACACTCTGCCTCCAAAGCCACCGTCCCCCCGAGGCACCAC 

TGCATCCCCCMGGGGCGGGTTCGGAGGAAGGAGGAGGCAAAGGAGAGCCCCAGCGCCGC 

AGGGCCCGAGGACAAGAGCCAGAGCAAGCGCAGGGCCAGTAACGAGAAGGAGTCAGCAGG 

CCCAGCCTTACX)GGCACCTTTGGCGGNGCCTTGGCCAACCCAAGCCCGGCCCAAAAGGAG 

CMGCCCCCGNGGAGACCCCTTACAGACCTGGTTTCTTTGACTTAACCCCAAGCCCTTGT 

TCCCCGGTGGACCCTAGCAAANCAATGGGCCGGNACCACAAGANCGAGAAGAAGCCCCTT 

GGNTNTTGGCTTGAAAGCGGCGCCAGGNCCCGGGAACAACCGGAAGCCCCANGAGCAAGA 

ACCGANGCTTTANGCAAAAAAGGACAAAGCCAATTGCAAAAGGACAACTTGCACCGGAAG 

GCCCAGGC 

Sequence 1867 

CCCCGCGTCCGTTTMGAGTTGCATATTTTACTTTATTTTTATTAMTTAAAAG 

TCTGGCAGCGATTCCAGAACAGGGTAAGGAGGTTCCTCACAGGGGTCAGAGAAGAGCGGA 

GAAAGACAGACTGACGGAGACTGAGACACAGGAGAGAAAGGACAAGGTTAAGGGAGAACT 

GTATCTGATGAACACACACAGCCGGCTCCATGGCGGGTGACGGGGAGCTCACATCAGCCC 

AATTTCTCCTCCCCGGCACCCGAAGTTCAGCGGTGGAGCAGTATGTGGGGGCGGTTAGGA 

ATCAAGAGACCCTCCCTTCCCCACCCTAGGTCCTTTCTCGGCTTGGTCGTGGAGCACAGC 

ACATACCAGAAAAAGCCAAGGGCAATGGAGGGGCAGGGAAACCGGGAGTATATGTACACG 

GGGAGGGGAGAACAGAGCCTTGGAGGTCGGCCTCTGCCAGAAGGGAAGTGGCTCACACTT 

GCATTTGNMCACTTGGCCAGTGGGGGATGGGGGAMGGMTTGCCCCTTCCTTTTGG 

Sequence 1868 

CCCNCGCGTCCGCACACCCTTCCTGTACTCAGTCCTCAGTTTGCCTGGTGAGAGAGCAGC 

CTCCTCCCGTGTGCTCTGCCAGCTGGACCCAGACTGGCCATATTACCAGTGAGACCAAAA 

AGATGGAGGTGGGGAGGTAGCTCTGAGGTCTGGGAAACCATTCCAGCTCCTGCCAGTTTT 

MCTTGTGTTTMTTCCTGGCACAGTTGTCCTGGAMTGCCTTTTTCTCTTGCCTGGGAA 

CCACTAGAAGGGGATGTTGTCTGTGTTGGCCAGGGCCATGCAMTTCAACATCTTGTTTC 

TGCCCTTCCCCCGTGTAGCTGAGGCTAGGTGTTGGCATTACCCAGTGCTTGTTCTTCAGA 

GAGCAAAAGCACTGCTCGTCATGTCTGAAATTTAGTGAGTGAGCTCACCCACTAGGCTGG 

TGTTTCCTGCCCGTGGCTGCACATTGGAAGCACCGGGGCACTTTGAGAACTACAGATGCC 

TGGGTCCCAGAGCATCTAAGGTGCTCTAGGGTGTGTCCAGGACACAAGCCCTGGTTGAGG 

ACCACTGCTATATTGTATGGCCTCTTTTAAAMAGTTMTTTTACTTGGAMTGATTTCA 

MGCTACAGAAAAGTTGCAAGAATAAAAACTGTACAAATGAGGCTTAAATATTCTTTGGC 

CAAATCACCTATTAACATTTCGTTCCAAAAAAAAAAAAAA 

Sequence 1869 

GGGCAGCGCCTCCGACATGAAGGCTGAGCTGTCGCAACTTATTAGCGACCTGGGCGAGCT 

CAGCTTCGGCAACGACGTGCGCACCCTGCAGGCCGACTTGCGGGTGACGCGCCTGCTGTC 

AGGCGACAGCACGGGCAGCGAGAGCTCCATCGAGGGCGGGGGCCCTGACGCCACCTCCGC 

CACCGCCGGGGACTCGTCCCGCCAGGCCGACGGCGCCAGTGCAGACGAGCCCCACTCGGG 

CTGAGCTCCTCCGCGCGTCGCCGGCGCTCCACCGTGGCTACCCATCCGTGGTCCCGACAA 

CCTCCCTGTCCCTTGCCCGCCCCCAGGAAGGGGGAAATGGGGCATTTGGGGCCCAGACCT 

ACACTTGGAGCCCAGGTCCAAGCGTTCCCCGACCGCTTTCCCTACTNCCGGNCCCCGCTC 
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CCGCCCCMGMCTTTTGGCTTTTTTGCGCGTGGGGATGCGGGGAGATTTGAGANGGGNA 

AACCCCCGCCAGGAAGGAAGAGAAGAGGCACCCCTTTGGGAATGCCGGTGAAGGGAAGGT 

TGGCTGAAGTTCCTAAGTTTMGGCCTAAGGTGCCTTGGNCAGGTTTCCTGTTTGTGGGG 

GAMCTTGGGNCTTGAGGAGGANGGGGTTAMTTTCTTCTTCCAACCCCTGGGAAGCGGG 

CCTTGCCTTGGANNTGAAATTTAANTNAAAAAAAAAAAAA 

Sequence 1870 

TCTTCATCATGGGTGCCAGCATCCACCTGGTGGGTGACTCTGTCAACCACCGCCTGCTCT 

TCAGTGGCTACNAGCACCACCTGTCTGTCCGTGAGAACCCCATCATCAAGAATCTCAAGC 

CGGAGACGCTGATCGACTCCTTTGAGCTGCTCTACTATTATGATGAGTACCTGGGTCACT 

GCATGTGGTACATCCCCTTCTTCCTCATCCTCTTCATGTACTTCAGCGGCTGCTTTACTG 

CCTCTAAAGCTGAGAGCTTGATTCCAGGGCCTGCCCTGCTCCTGGTGGCACCCAGTGGCC 

TGTACTACTGGTACCTGGTCACCGAGGGCCAGATCTTCATCCTCTTCATCTTCACCTTCT 

TCGCCATGCTGGCCCTCGTCCTGCACCAGAAGCGCAAGCGCCTCTTCCTGGACAGCAACG 

GCCTCTTCCTCTTCTCCTCCTTCGCACTGACCCTCTTGCTTGTGGCGCTCTGGGTCGCCT 

GGCTGTGGAATGACCCTGTTCTCAAGAAGAAAGTACCCGGGTGTCATCTACGTCCTGAGC 
CCTG 

Sequence 1871 

CCGCGTCCGGTTTGTTCGTTATAGGTTTAATAAGTCTATTGAGGAAGACCTACTCCTGTG 

TGAATCTTTGCAMGTMTGCTACCGGTGMGAAATATTCAACTGNATCAACAGTTTTAT 

GCAGAMCATGAMTTGMTGGGAAAAATGTGTTGATGTTTGTAGTGATGCTTCTAGGGC 

AGTGGATGGGAAAATTGCCGAAGCTGTCACCTTAATAAAATATGTGGCTCCCGAAAGCAC 

CAGTAGTCACTGCCTATTATACAGACATGCACTGGCAGTTAAAATAATGCCTACATCTCT 

AAAAAATGTGCTAGAC CAGG CAGTACAAATCATCAATTATATTAAAGCTCGACCACATCA 

ATCCAGACTATTAAAMTTTTATGTGAGGAAATGGGTGCTCAGCACACAGCACTTCTTCT 

AAATACAG AGGT G AGGTG GCTTTCTCGAGGTAAAGTTCTTGTAAGACTTTTTGAACTTCG 

TCGTGMCTTTTGGTTTTCATGGATTCTGGCTTTTCGACTATCTTGATTGGTTAACAA 

TTCATCTTGGCTGCTMGACTTGCATATCTTGCAGATATTTTTACTAAATTAAAATGAAA 

GTTMTTTGTCAATGCCAAGGNAAAAATGTGACCCGTTTTACNAGTATTTGATAAAATGT 

CGTCATTGGTAAGAAAATTGGAAT 

Sequence 1872 

TCGACCCACGCGTCCGCGGACGCGTGGGTGTGACTGGCACCCAATGCCATGCCTTTATGG 

TCAC TTGGTAGTATAMGGCATGGCATTTGTTGTGACTGGCACCCAATGTTTGATTTTTT 

TTTTMMCTATCCAATTAAAATTAAGGTCTGGGAGTGTTCTGTTTCCCATTCTTTAATA 

CTCACCTCCTCCCAGACTTTCTACACCTGTTGCACCTCAGGCAGAGGATGTTCTGGACCT 

CCCCCTCTTGGTCCCTACTAGAGACCTCTCAACAGATCTGTGGGCCCAGTCATTGGGTTT 

TATCAGTGCTTAATGTGMCTMGTTTTTTACTTCCACAGAATACAAGCCACTACCTTCT 

GACCTCCCCACCCCCCACCAACCCCCATCTTTTAATATGCTGTGGGGCATAGAACTCCGG 

MTGACCAGCATGATATTTTCAGAGTCTTGTCCCCGGGGTATTAGCACCTCTTTTTGAAC 

AGGGAATTGATTCAAGATTGGACATGGTCTCCTCTGATTATCAGGTACTGGGGCTGAGGG 

CATTAAAAATAGTAAGCCTCCCTCTCGTCCCCTGCCTCAAGAAATTGCCTC7TATTTATC 

AACATCTTTTTCCTC 

Sequence 1873 

CCNCGCGTCCGGTGTTCCTCCCTGAGAATTAGTGGTCAGACATTGCAGAGGGCATCAGAA 

GCGGGTAGATGAAATAGCGAAAGGAAACAGGCTAGCAGACCAAAGAGCTAAGTCAGCAGT 

AAGAAGGCCCCAAGGTCCCAAAACACTTGAGGCCCCTCTGATTTGGGAGGGCTACATAAG 

GGAAATAAAGCCTCAGTATTCCCCTACAGAGATAGAATGGGCCACCTCTCGAGGTATACT 

TTTCAACCCTCAGGATGGTTACAATCAGAAGATTGCAAAGTACGCTTGCCAGCCTCCAGC 

CMTGGMGATTCTTAAAATCCTCCACTGAGCCTTTCACTTAGGAAAGCATAAGACATCA 

GTGCATCCAAAGATTGTTCTCAGGAGAAAATCTACTAAAAATGGTCAAATAGGTTGTTAA 

TACTCGTGAAACCCTCTTAAAAATAATCCCTTTAACAGATGACTTCTTCCCCACCACAAT 
CAAAGGA 
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Sequence 1874 



ACGCGTCCGATGACCAGGCTGCCCCGCCTCCTGGTTCTGCCCAAGTTCTCCCTGGAGACT 

GAAGTCGACCTCAGGAAGCCCCTAGAGAACCTGGGAATGACCGACATGTTCAGACAGTTT 

CAGGCTGACTTCACGAGTCTTTCAGACCAAGAGCCTCTCCACGTCGCGCAGGCGCTGCAG 

AAAGTGAAGATCGAGGTGAACGAGAGTGGCACGGTGGCCTCCTCATCCACAGCTGTCATA 

GTCTCAGCCCGCATGGCCCCCGAGGAGATCATCATGGACAGACCCTTCCTCTTTGTGGTC 

CGGCACAACCCCACAGGTGAGCCTGGAACCCATCACGTTCCACATCCTCCCACCCATTCT 

TTCTCTCAGGAACTAGTCCCGACAGATGCAGACATCCCTCTATCCCTGAGAGGGCTCTGG 

GCAGGGAACCCATAACCCTACCCTGCTTCCTGTCCCAAGAGGAGGC 

Sequence 1875 

AGTCGACCCACGCGTCCGCCCACGCGTCCGCCTTCTTCTGGGCACTGACTGCCCTTCTGG 

TCGCTTCAGCTGCTGCCTTCCAGGGTCTTCTGCTGCTGTTGCCGCCACCACCATCTGTAC 

CCACAGGGGAGTTAGGATCAGGCCTCCAGGTGGGAGCCCCAGGAGCAGAGGAAGAGGTGG 

AAGAGTCCTCACCACTGCAAGAGCCACCAAGCCAGGCAGCAGGCACCACCCCTGGTCCAG 

ACCCTAAGGCCTATCAGCTTCTATCAGCCCGCAGTGCCTGCCTGCTGGGCCTGTTGGCCG 

CCACCAACGCGCTGACCAATGGCGTGCTGCCTGCCGTGCAGAGCTTTTCCTGCTTACCCT 

ACGGGCGTCTGGCCTACCACCTGGCTGTGGTGCTGGGCAGTGCTGCCAATCCCCTGGCCT 

GCTTCCTGGCCATGGGTGTGCTGTGCAGGTACACAAGGACCCCCAGCCCCTGTGCGGGTG 

GAACTCA 

Sequence 1876 

TCGACCNCGCGTCCGGTCTTCGCAGGTGGCCCTCGGGCCCGCAGCCGCTGGGTAAGGGTG 

ATGCCTAGCCTGGCTTATTGCACCTTCCTT1TGGCGGTTGGCTTGGNGCGAATCTTCATC 

TTAGCACATTTCCCTCACCAGGTGCTGGCTGGCCTAATAACTGGCGCTGTCCTGGGCTGG 

CTGATGACTCCCCGAGTGCCTATGGAGCGGGAGCTAAGCTTCTATGGGTTGACTGCACTG 

GCCCTCATGCTAGGCACCAGCCTCATCTATTGGACCCTCTTTACACTGGGCCTGGATCTT 

TCTTGGTCCATCAGCCTAGCCTTCAAGTGGTGTGAGCGGCCTGAGTGGATACACGTGGAT 

AGCCGGCCCTTTGCCTCCCTGAGCCGTGACTCAGGGGCTGCCCTGGGCCTGGGCATTGCC 

TTGCACTCTCCCTGCTATGCCCAGGTGCCGTCGGGCACAGCTGGGGAA 

Sequence 1877 

ACCCCGCGTCCGCCCTTAAGAGACAATGATTGAGAAAGAGCCATGTGGCTTGGCTCTAGA 

MCGTCATTATCATTAGGACCATCAGATTTTAGATTAAGCTGCTATTGAATTAATAAAAT 

CCCAATGAAGCAGAGTTATAGGGATAGATTTATAGCTGGCAGAGTGGTATCAAAGGAGAA 

MACAGT GAAA AAGCCMTTTCACTGGTTCGTTCMTCCAGCTTGTTGCTAATATTAGTT 

ACCCTTGTTTTAATGACAGAGAGTGGCTGGAATCTGTAGCTAGGGGAGGGGCAACACTGT 

TAGATGTGAGGAAAGGAAGTGCCAAAAATGCCTGGACAGATGGCTTGTCCCAAGGCCAGG 

ACACACACTTTAAMTCCAACATTCACCTAAGCAAGTAATTCTTAAAGATCTTACAGAAA 

CGCAGAGTCAATTCAGGTTTATAAAGGAAGGCTTNAGGGGAGAGAGGAAGGCCTGGGGGG 

CCTGGACGAAAGAGGCCTAGGACCTGAAGAGACTCCAGCGAGTCTTCGGGAAGC 

Sequence 1878 

AGTCGACCACGCGTCCGCAGCATCGTCCGAGACTTCCCAGACTCCCGGCCAGCCATCGAG 

GACCTCAAGTACTGCCTGGAGAGGACGGACCAGAGGCAGCAGCTGCTCGTGTCCCTCAAG 

GCTGCCCTGGAGACTCGGCTCCTGCATCCAGGCGTCAACACGTGTGACATCATCACCCTC 

TATATCTCTGCCATCAAGGCGCTGCGCGTGCTGGACCCTTCCATGGTCATCCTGGAGGTG 

GCCTGTGAGCCTATCCGCCGCTACCTGAGGACGCGGGAGGACACAGTGCGGCAGATTGTG 

GCTGGGCTGACGGGGGACTCGGACGGGACAGGGGACTGGGCTGTTGAAGCTGTCCAAGAC 

CGACCCGGCGAGCCTGGAGACAGGCCAGGACAGTGAGGGATGACTCAGGCGAGCCAGAGG 

ACCTGGGGTCCCGGACCCTGTGGATGCCCGATCAGGGGAAAGTCGAGCTCCAGCC 

Sequence 1879 

GTCCGCACMTTGAAAACTGGGAAMATAATCTCGTGGTTCGGTTTGTTAGTTTGAGACG 

CAGTCTCACTCTGTCGCCCAGGCTAGAGTACGGTGGCCCAATCTCAGCTCACTGCAACCT 

CCAACCTCCCAGGTTCAAGCAATTCTCTTGCCTCAGCCTCCCGGGGTAACTGGGGATTAC 
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AGGCGCGCACCACCACGCCCAGCTMTTTTTGTGTTTTTAGTAGAGATGAGATTTTGCCA 

TGTTGGCCAGGTTGGTCTTGAACTCCTGACCTCAAGTGATCCACCCACCTCGGCCTCCCA 

MGTGCTGGGATTACAGGTGTGAGCCACCGTATCTGGCTCTCATATTTTTTATATACAGA 

TTGMTATCCCTAATTTGAAMCCTGMATCTGAMTCTTCCAAAATCCAAMCTTTTTG 

AGCAGTTACATGATATTTGAATGAACTGCTCATAAGAA 

Sequence 1880 

GAGGAAACCAAAGTGCTCTGTATCCTCCAGTCTCCGCGCCTNCACCCAGCTCAGGAACCC 

GCGAACCCTCTCTTGACCACTATGAGCCTCCCGTCCAGCCGCGCGGCCCGTGNNCCGGGT 

CCTTCGGGCTCCTTGTTGCGCGCTGNTCGCGCTGCTGCTCCTGCTTGACGCCGCCGGGGG 

CCCCTCGCCAGCTGCTGGGTCCTGTCTCTGCTGTGCTGACAGAGCTGCGNTGCACNTTGT 

TMCGCGTTTACGCTGAGAGTAAAACCCCAAAACCGATTGGTAAACTGCAGGTTGTTTCC 

CCGCAGGCCTCGCAGTGCTCCAAGGTGGGAAAGATGGTANGCTCTCCCTTGAAAGAACCG 

GGTMGCANAGTTTATGTCTGNNACCCCGGGAANGCCCCCTTTTTTCTTAAAGGAAAAGG 

TGCATCCCAANNAAAAATTTTGGGACCAGNTGGGGAAAACCAAGGAAAAAAA 

Sequence 1881 

GCGTCCGCCCTGGCTCCTCCAGCAAGACCTCGTCTTTGCTTGTCCTGCTCAGATGCTGGT 

CATCCTGGGCATGTCCCCAGTGTGGACTCTGGACTGGGAAGGGGGCAGGCCCCTTTGGAC 

CTGCAGTTGGCCTCAGCAGAAGGCCTTGCCTTGTGTATGTGACTCCATATCCCGGGAGCA 

GTTGACCTTTGCCAAACACTTTACAGTTCTGGAGGAGGAGGTAACATAGATGCCTGGGCC 

TGATGGTGGGGCCATACCCATGTGTCGCCTCTCACTCTGGCAGCCTCAGAGGCCCCTTGC 

TGCTGGCTCCCATCTCCCTCCCATTTGCAGACCAGGAAGGAAGAGCAAGCTGTACAAAGG 

GAAGCAGAGCCTGGGGTGGGGTGTGAGCAGGGTGACCCCTCATCTGAAAGGCCCAAACCA 

GGGGGGAAGCACCAGCCTTAGTGCAGCCCCCTCTGACCCCACCTTAGAATGGAAAGCCTT 

CACCTGCAGCCCAGGCCTTCCTCCCCGG 

Sequence 1882 

AGTCGCCCCGCGTCCGGTGATTCCAGGGTGCAGMGGGATTCATATTCCCAGAACGCTTT 

MGTGTACACC TGCAG GATAAAGAGATACCGGTTACATTATTAAATGATTCTAGGGATTC 

ACTGGGGGAT ATTTTTGTTGCTTTTACTTTCATGGTTAGAGCTACAMGAACMGTGATT 

I I I I I I I I I I ICTCCCTTCCCCATTCAGAAACATTATACATTGGGCCATTTTTCTTTCTC 

CCAAAGAAGATTCATGGATAGTCAGACTGAACTGTGTGCAACAGGAAAAGTCAAAAGGGA 

AAAGGCAGCTGATGAGGTTCATGGTTACATGTTCTACATCATGCAGAGTAGCTTGAAATC 

TAGTCTGGAGAAAACTGGATCAAGATTCTAGCCCACTGGAGTTGCAAGGAATGAGAGGCA 

AAAATTCTAMGATTTGGGTTATATTTTCAACTTGGGGGACAGAGAGAAATGGAGAGCAG 

GAATTACAGTTCCAACAAACATCATGATAGTC 

Sequence 1883 

CCACGCGTCCGACTAGTTCT^ 

AATTCACTTTATTTTTCTTGNATAAAAACCCTATGTTGTAGNCACAGNTGGGGCCTGAGT 

CCGNTGCACGGAGACTCTGGTGTGGGTCTTGACGAGGTGGTCAAGAGNAACTCCTNGATA 

GGGAGACTTGGGTGAATACANTNTCCTTCCANAGGTCCGGGGNGTTCATGGTATGCTGTA 

NGGTCCTTAAAAAAATGGG 

Sequence 1884 

GTCCGAAAAAATGATAATGTGCATAAAATTCAACCCAGCTTTCAAAGTCCAGTCAAAATA 

TCAGAMTCATGAGGTCCAATGGATTTTGTTTAGCAAATACCGAAACAATAGTTATTGAC 

CACAGTATACCAAATGGAAGAGACCAGCACCTGGGCGTGGACCCAACAGAGCATTTATTT 

GAGAATGGCAGTGAGTTTCCCTCAGAGCTGGAGGACGGGGACGACCCAGCAGCCTACGTC 

ACCAACCTGTCATATTACCACCTGGTCCCCTTCGAGACAGACATTTGGGACTGAACCTCT 

CTATCAGGCCTCCCCCGCCTCAAGCTGTTCACTGCC 

Sequence 1885 

CGCGTCCGCACAACAAGACACTCCAATTGTGATTTGAGTTGAGGATCTCTGCCTGCCTTC 

CTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGC 

AGAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGA 
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AGATGGGGTATTTCCCACTTCCAGCAACGAAATAACAAATGAAAAGTTGCATACTTATTG 

ATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAACAGAAGTCAATGCAACAGTATGT 

ATGTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGC 

MGCCACTTCCCATCTCTGGGCCTCGTCTTTCCTCCCTCTAAAATCAAAGAGCTGAATTA 

TGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACTGTCTTGTCAGACTGGCAAATTA 

TATTGGCCTCTCCTTATGTGGTGGGTTTTTTTGGGAGGGNCATAGTTNCTTATACACAGG 

ACACCTGCATNATCNAAGGGC I I I I I I I I CTAAAAAAAAAAAAAATG 

Sequence 1886 

CGTCCGCTCCTGAGTAGCTGGGATGACAGGCGTGCACCTGGCAGCTTTTTCAAAGTGTTG 

ATGGTAATCTGAGGCAATCTAAGGGAGTCATTTTTTAAGTGACTTTATACAGAAAGATTG 

GTAAGAGCCAAGGGGTAGAAGTGGCATAAATGTCTAAAGCAGGGAAGTGACAGGGACTTT 

CATTGTTCTTGGCTGAGGAGAAGCGGGAGTGGCTGATGGAAGCACCTAAATGATGCCTTT 

GTCTGTGGGAAGGCAAATGATGCCCCAGAGCTCTAACCAAAGGTTTTGCAGCCGCCGAAA 

AACAGGAAAGTTGGGAAGCGGGGGTAGGACTACACTGAATCATTAACAGTGCTGTAAACT 

ACCCATGTGGCCATTAACMTGGACCTTTGGGGGAGTTTTCCTAAACGATCACTCTGGA 

Sequence 1887 

CGTCAAACACCCGCAGGCTGTCGATTTCATCATCAAAGAAATCAAGACGATAAGGGCAGC 

TCACTCCCCATCGGGAAGAGATCCAGCAACGCGCCGCGCGTGGCGTATTCGCCGTGCTCC 

ATCACCTGGTCAACATGGCGATAACCGGCGCTGTCCAGTTGGGTTCGTAATGCATCTCGT 

GACAGGCGCTGACCTTTTTTCATCACCAGCGCATGACCGTGGAGAAAACTGTGTGGGCAA 

ACGCGCTGCATAAGCGTATTCACCGGAACAATCAGTACGCCACGCTGCATCGTCGGTAGC 

TGGTAAAGGGTGGAAAGGCGCGAGGAGATAATGTCCTGATGAGGCGAAAAACTGTCGTAG 

GGAAGAGTTTCCCAGTCCGCCAGATTCATCACCATTTGATCGGTAAACTGGCTGATTTCA 

TCATGCAAACGCAGAGCATTTTGCATATCTGGTGCAATGAGTACCACCGGACCGGCGTGA 

CGTTCGGCA 

Sequence 1888 

CGCCTCCGTTTATTTTTATGCCCCTTTTTGTGGATAA^ 

AATTTTMGTGACTTTCTCCAGGTCATGAAGATTCAATGGGTAGAATTGAATCAGAATTG 

AAATGTTCCAGATTCATATTCTTGTGTGTGTTTGATAAAATTCATGGCTTCCAAAGTAAC 

TGAA CACTTCCTTTGGGCCCTTGGAGGGAAMTCCATATTTTT^ 

TTTTAGACATCTGGCAGTTCTTTGMCTTAAACATATTCTCATGGCCATAGTTCCAAATT 

MGCCCG ACGCA GTT GCTA AAAATCTTGCTGCACTGTTGAATACTAATAATGCAACATTT 

ATTGGATGTTTTTGCATTTTGATGACCTTCATGATTCATTTATMGTCTTTGTAAGTGCT 

TAAGTGACCCCCTCACTAGTGAAMTMTAAATGTTCTATATCATTTATTATTATTTGTG 

TATTCTCTACATGATATA i I I I I I I 

Sequence 1889 

CCGCGTCCGGGAGGATGGACGTACTGGTGTCTGAGTGCTCCGCGCGGCTGCTGCAGCAGG 

MGMGAGATTAMTCTCTGACTGCTGAMTTGACCCGGTTGAAAAACTGTGGCTGTTTA 

GGAGCTTCTCCAMTTTGGAGCAGTTACAAGAAGAAAATTTAAAATAAAGTATCGACTGA 

ATATTCTTCGAAAGAGTCTTCAGGCAGAAAGGAACAAACCAACTAAAAATATGATTAACA 

TTATTAGCCGCCTACAAGAGGTCTTTGGTCATGCMTTAAGGCTGCATATCCAGATTTGG 

AAAATCCTCCTCTGCTAGTGACACCAAGTCAGCAGGCCAAGTTTGGGGACTATCAGTGTA 

ATAGTGCTATGGGTATTTCTCAGATGCTCAAAACCAAGGAACAGAAAGTTAATCCAAGAG 

AAATTGCTGAAAACATTACCAAACACCTCCCAGACAATGAATGTATTGAAAAAGTTGAAA 

TTGCTGGTCCTGGTTTTATTAATGTCCACTTAAGAAAGGA 

Sequence 1890 

CGCCCCGC GTCC GCTAATTATAAGCTTTACMGTATTTATTTTATAAGGCTTAGACAGAA 

TTATTGGAGTTTTAAATTAAGTGTATTGGAAAAGAAAGGATGGTATGTGTATGAAATGTT 

AAGATCCTACGCAACACTGCTATTTTTrTCCTTTMTATTTGTGCTGCATAACAAAAGCC 

ACTAGACTGTTACTGTCTTGTCTGTCCATGTGTTAACAGCATTTCTTAATGATGTATATA 

TGGAGTGGTCTTCMTCATAGTGMGMTTTAMGAGAAAGTCAATTGTATTGGCATTTT 
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TMTAAGAACAAMTTAGTTCGTCTMGGGGACTGGCTGGCCACATATTTGTTCCTTGCC 
CATATGCTTTCTACTTCTTGTTCTTATTATGAAATTATGAATTTGAAGCCTCTGAAATGG 
TGATCAGTTTTCAACATCTTTCAAAAACAAAATTACTA 
Sequence 1891 

GCGTCCGCGGCTGCTGTGTGTGAGCAGTGGACACGTGAGGGGGGGGTGGGTGAGAGAGAC 

AGGCAGCTCGGATTCAACTACCTTAGATAATATTTCTGAAAACCTACCAGCCAGAGGGTA 

GGGCACAMGATGGATGTMTGCACTTTGGGAGGCCAAGGCGGGAGGATTGCTTTGAGCC 

C AGGAG TTCAAGACCAGCCTGGGCAACATACCAAGACCCCCGTCTCTTTAAAAATATATA 

TATTTTAAATATACTTAMTATATATTTCTAATATCTTTAMTATATATATATATTTTAA 

AGACCAATTTATGGGAGAATTGCACACAGATGTGAAATGAATGTAATCTAATAGAAGCCT 

MTCA GCCCCCATGTTCTCCACTGAAAAATCCTCTTTTTTGGGGGGTTT^ 

TTTTTGAATTTGCACTGGACGGNGGACCGTCAGCCATGTNCAAGGATCCCCAGGGGGGGG 

GNNGTCAAMTGGCTATTGGAAAATTGGGGTGGAMTGNATGCCTTTTTCACTTTTTGGA 

TAMTAAAACATGTAAAMAATGNTTTCAAAAAAAATTAATTAAAATTAAATTAAATTCC 

NNAAAAAAAAAAAAAAA 

Sequence 1892 

AGAGGATTCCCAGGGTTTCCAGGGGGCCAAAGGAGACAAAGGTTCAAAGGGTGAGGTGGG 

TTTCCCAGGATTAGCCGGGAGCCCAGGAATTCCTGGATCCAAAGGAGAGCAAGGATTCAT 

GGGTCCTCCGGGGCCCCAGGGACAGCCCGGGGTTACCGGGATCCCCAGGCCATGCCACGG 

AGGGGCCCAAAGGAGACCGCGGACCTCANGGCCAGCCTGGCCTGCCAGGACTTCCGGGAC 

CCATGGGGCCTCCAGGGCTTCCTGGGATTGATGGAGTTAAAAGGTGACAAAGGAAATCCA 

Sequence 1893 

TCCGCCCCGCGTCCGCTTTTTCCNAACAAGGAGCATCCAAAGACACAGTGACTTGAGCTA 

TAGATAGTMAAATCATACGAGAGTTGAACTGAGTCAGGTTTAGGAAGCAAGTTTGGTTG 

CATCAATTMGCAGGCTCTTTTCMTTGACTGATGCTGGGGCCTTCAGTTTTATTCTCAG 

TATAGATTGCCAGTATTGTTAAGAGTATCCAAAGGCCTTTCTAGATGGAGACAGAATAAC 

TGACTTGAACATACAGTGTGCCTGTAAGTGTCCAGGCTCAGAGCTGGTGAAAACCCTTCT 

GTTGGGCGTGTGCAGGGTTAAACTCCTGAAGTAACTTGTGAGGACTTCAGTGCTTGCTGG 

TGTCCTGGGCAGCACCATGAATGCCTTTACCAAGACATGCCAAGTTGGATCCCCCGAATG 

AAGCAAGAGTGGCTTGTGGGTGTGACCCTTGCTCCCTGCTACACAGAAGCATCGCAAGGG 

CTGCCTGTGTNGGTTTCCAGATGAAGGGTCTTGGGTCCCGGAAGCTTTGTGGTTGAGAGC 

TCAAGTGGGACC 

Sequence 1894 

GTCACCACGCGTCCGCGGACGCGTGGGCGCACGCCGGCGGCGCGGAGGCCGGCTCTGCGC 

TTCGGGCCGCCCCCTCCCCCCACCCCGCTCACACCCGGCACTTACTTCGGCTGTCTCCGC 

TGCCCTCCAGCGGAGACGCAGCTCCTCAGGCGCCCGGCGGTATTTGTTGGGTCGGCGGCG 

TCAGGGATTCGCAGTGGCCTGTGGTCGGCGTCGTCCGGCCACTGGTGCGCCCCCGCGGCA 

GGCAGAGCTCACGCTCCTGTCCCCCGGCTGGTCCGGGGTCTGGGCGCCGCGTCGACGGCG 

GCTCCGCAGGACGCGCAGACCGGGCCGCAGCCCATGCC 

Sequence 1895 

NCCCCGCGTCCGATTTMATGCCCTAAATTTTAAATTCATACCTTTCCATGATTCAAAAT 

TCAAAAGATCCCATGGGAGATGGTTGGAAAATCTCCACTTCATCCTCCAAGCCATTCAAG 

TTTCCTTTCCAGAAGCAACTGCTACTGCCTTTCATTCATATGTTCTTCTAAAGATAGTCT 

ACATTTGGAAATGTATGTTAAMGCACGTATTTTTAMATTTTTTTCCTAMT^ 

CATTGTATGTCTGCTGTGTACTTTGCTATTTT^ 

GATGGAATGMTTTGMGTTCCCAGGGCTGAGGATCCATGCCTTCTTTGTTTCTAAGTTA 

TCTTTCCCATAGCTTTTCATTATCTTTC 

Sequence 1896 

CGACCCACGCGTCCGCCCNGCGTCCGAGGGGCAACAGCAGAGCCTACAGCAGGGGGCACA 
CTCCACCGGNTCCAGCCGCCTGCACGACCTCTACTGGCAGGCCATGAAAACCCTGGGAGT 
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CCAGCGCCCCAAGTTGGAGAAGAAGGATGCCAAGGAGATCCCCAGTGCCACCCAGAGCCC 

CATCAGTAAGAAGCGGAAGAAAAAGGGA7TCTTGCCAGAGACGAAGAAGCGCAAGAAACG 

CAAGTCAGAGGATGGCACGCCAGCGGAGGATGGCACACCTGCAGCCACCGGCGGGAGCCA 

GCCCCCCAGCATGGGCAGGAAGAAGAGGAACAGGACAAAGGCTAAGGTCCCAGCCCAGGC 

AAACGGGACGCCAACCACCAAGAGTCCAGCCCCTGGCGCCCCCAC 

Sequence 1897 

ATTATA TACTT CTGAATGGCACCTTACTTTTTGGAMCAMTCTTCTGTTATTTACAAAA 

TMTMTTTTTAAAAAACATAAAAAAAAAATCCAAAGCTGCTCTCGATAATAGTCAACAT 

TTGCATATATATGGMTTTCTTACTTTTTTTCTCCCAMCTCTAT^ 

TMTGTTTGTGTATTTCATGTATM7TGTGATCTCMTTATAAAAGTTTAATTCAGCATG 

TCTT TGAG CCAATATAATTACTGCACACCCACTAAATTGGGATCAGCCATTATAAATAAT 

GTAGTTTTAGAATAATAAAACATGACACATATATATATATATAAATATATAGTATATATT 

^® ^B^^^Z?^^^^^^^® "'^^^^!]^^®"'"^ CAG G AAG GTXTTTTTCTTTCTTC AAGTXAAAAA 

TTAH I n IGCCATATGTAATTTTGGTGTTGCAGGCTGGTCGAGAGGATAAAAAATGGAT 

TTTAAATCTGGGTACCGGATGGATCTTTCNGGNGGTTAAGAAACACAGGGGNTGNGGACC 

TTCCTTTTT 

Sequence 1898 

CCGCGTCCGAATATAGTATTTTTTAATTTTTGTGGGGATGGATTCTCAAATACTTGTGAT 

TTTAAAAGATTCTAAAGCTAAAACACMCTTGATTTTAAAAAGAATGATTCTCCTTACAC 

MTTATAMTATTTGCAGTMATATTTTCCTTATMTACTGTTTTGACCCCATTTAAAAA 

GTATTAGATTATATTCCTTTGATCCAATGAAAACTGAACCTTATAAATGGTTAGCTGAAA 

GTAGACCTTATTCTTGTCCTTCTTTAGAAGAGTAAAGATTTGTCCTAGGGAAGATGGCTG 

ACTTCGGTTCCCAACATGCCGTATGCATTTAGACTGTAGCTCCTCAGCCCTGTGGACACA 

AAATTTGGACAGCTTATTAGGNTACCGTTAGCAATGCTGGACCGGTTTCTTCAACACTAA 

AGANTTTCACCGTTGNAACAGATTTCCTGGTTCGTCTNATGGGGNCTGGTAAAAATGGT 

Sequence 1899 

GCCCTGGAGGCTACTTGTAAATCCTTAGAAGAAAAGCTGGATCTGGTCACGAACAAGCAG 

CACAGCCCCATCCAGGTTCCCATGGTGGCCGGCTCCCCTCTCGGGGCAACCCAGACGTGC 

AACAAAGTGCGATGCGCTGTGCCTGGGCGTCGGCAGAACACCATTGTGGTGAAGGTGCCG 

GGCCAAGAAGACAGNCACCACGAGGACGGGGAGAGCGGCTCGGAGGCCAGCGACTCTGTG 

TCCAGCTGTGGGCAGGCGGGCAGTCAGAGCATNGGGAGCAACGTCACGCTCATCACCCTG 

AACTCGGAAGAGGACTACCCCAATGGCACCTGGCTGGGCGACGAGAACAACCCC 

Sequence 1900 

NCCACGCGTCCGCCCCGCGTCCGGGCCCGGCGCGCCTGCTCTGGGCTCTCCGCGTGCCGC 

ATCGCTTTCTTTTCTTCCTCTGGAGCAGCTATGGCGGCGGCGAAGACCCTGAACCCCAAG 

GCCAAGGTGGCCGGAGCGCAGGCGGCGCTGGCGTTCAACATTAGCGGGGCGCGGGGTCTG 

CAGGACGTGCTGAGGACCAACCTGGGGCCCAAAGGGACCATAAAGATGGCCTTCATCCCA 

GMTMTCACTGMGGATTTGAAGCTGTGMGGAMAGCCCTTCATTTTTTGGAAGAAGT 

CAAAGTAAGCAGAGAGATGGACAAGGAAAC 

Sequence 1901 

CCACGCGTCCGCCCCGCGTCCGAAACATGAGGTTCTCTCTACTGGTCCTCTTAACTGTGG 

TGTTGAGGCTTATATTTGTGTAATTTTTGGTGGGTGAAAGGAATTTTGCTMGTAAATCT 

CTTCTGTGTTTGMCTGAAGTCTGTATTGTAACTATGTTTAMGTAATTGTTCCAGAGAC 

AMTATTTCTAGACACTTTTTCTTTACAAACAAAAGCATTCGGAGGGAGGGGGATGGTGA 

CTGAGATGAGAGGGGAGAGCTGAACAGA 

Sequence 1902 

GTCGCCCCGCG TCCGCC CTCCCTGGCMATMTATAATAACCCGTGMTTTTCAGGAATT 
TAAAAATTANGCi Mil ICCACTTAAAGGAGAAAAATATTTGGGACTAGCAGCAGAGGCA 
GTAAGAGATGTGAACCTTGGTGAGCTCTGATACAGTGAGAAGAGATTATACTCATGAAAG 
AGAATGTTAGTGTTACAGAGAAGCAGCCGATAGCAAATCGACTGTAGAGACTTGGCGGCG 
GTGGCATTGCCCCAGGTCGTCAGCAGTGTGGTATTATCTATGAGAACTTGAGCGACAGAG 
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TAT TTCTTG ATGMTTTATAGATCATTTGAGA 

TTGM I I I ICAAATAAGTAGAGACTATTTGTAAAAAACGAGGAAAGGGAAATGAAATGGG 

GCGTGTTTGATAG CMTAM TTTGGTTTCTTTTTAAAGAATTCTAAAAAGGGTCTGAGAC 

CCTGWAGCATTMTTTTTTGAGTC^ 

TTCTCT 

Sequence 1903 

GCGTCCGCCCCGCGTCCGGGAAACCCCCTTCGATGACCTCCAGAGCCTCCCAAACGACGT 

GATCTCTTCCCTGAAGAACAGGCTGAAAAAGGTCTCCACAACCACTGGGGATGGTGTGGC 

CAGAGCGTTCCTCAAGGCCCAGGCTGCTTTCTTCGGTAGCTACCGAAACGCTCTGAAAAT 

CGAGCCGGAGGAGCCGATCACTTTCTGTGAGGAAGCCTTCGTGTCCCACTACCGCTCCGG 

AGCCATGAGGCAGTTCCTGCAGAACGCCACACAGCTGCAGCTCTTCAAGCAGTTTATTGA 

TGGTCGATTAGATCTTCTCAATTCCGGCGMGGTTTCAGTGATGTTTTTGAAGAGGAAAT 

CAACATGGGCGAGTACGCTGGCAGTGACAAACTGTACCATCAGTGGCTCTCCACTGTCCG 

GAAAGGGAAGTGGAGCAATTCTGAATACTGTAAAGACCAAAGCAA 

Sequence 1904 

CGTACGGGGTGCGGTTGGCGGCGGCGGCTGGGCCGGGGGCTGCCGGCTGCGCTCGGGCCG 

TGCGCGGCGGCCGTGCGGNCACGCCATGGACTTCAACATGAAGAAGCTGGCGTCGGACGC 

GGGCATCTTCTTCACCCNGGCGGNGCAGTTNACGGAGGAGAAATTTGGCCAGNCTGAGAA 

GACTGAGCTTGATGCCCACTTTGAAAACCTTCTGGCCCGGGCAGACAGNACCAAGAACTG 

GACAGAGAAGATCTTGAGGCAGACAGAGGTTCTGCTGNAGCCCAACCCCAGTGCCACGAG 

TGGAGGAGTTCCTGTATGAGAAGCTGGACAGGAAGGNCCCCTCAAGGGTCACCAACGG 

Sequence 1905 

CNCGCGTCCGGTGCATCTTGCCCATTGATTTCTAAATGTATTAACTACTTAAATTAATCC 

TGAATCTTTTCCCAGGCTTAAGTGGGATMTGTTTTATTGTAGATGCATATTTCCTGGCT 

CTACCCAGTCTTTCTTTGAAGACTTTATCATCCTATTTTCTGMTCCAGTGGCTGACTTT 

AATCTTCTCTGGAGGAACTAGATAATTTCTAGACTAATGCTTACACTCATGATCCAGATT 

GTAATTTCTGAACTCCTTCTTCCAAATAGAATCAAAACAAGAAAGGGGAAAGCCTCTCAA 

AGCA ACTGTGCGTTAATMTGAMCACTCTTTTTTTCTAATCCAAGGAGGGTTTCATACT 

TTTTCTTAGTTTCTTGCCCTCTTCCCTTCTGATCMTAATTGTAATAGGGAMTTTGCM 

TTGTGCCMTACTCAGATTCAATACTGMCTACTTTCTTGCATTGGAATTCAAATTCCAA 

GGTTAACMCTAGCTGTATGTTTCCAAMCAATCTTATTGGATATGGATTTTCTTAGGGG 

GAAGGTTCCAGAAATGATTT 

Sequence 1906 

GANCAGGCTCAAGAGCAACATGGAGGTCTGCACTTAATCGCTCCTCTCCGGGGGCGGCCA 

TACCGAGGAGGCGTCTCTTCCGTGCAGGCAGGCTCTCCTGGGGACCTCAGAGATTCTCTC 

CAGCGGCAGCGGAAAACGGACAATGGGTGGATTCGGGTCCAGATTCTGGTAGGAGGGAGT 

TTGGGATCGAGATCTGGAAAAAAGCACTAGACTGGAAGAGGACGCGATGGAGTCGGAGCC 

GCTGGCGGGGACAAAAACCAGAGGCCGGGGAAGGCGCCGGTGGGAGGCAAGGCACGGATG 

GACTTTACCTGCGCACGCGTCGCAGCCATCTCCGCGCACAGTGGTGGCCACCGCGACTGG 

TGCTGAAGTGTTGGCCGCGTGCCGGGCGCTCCGCTGGGACCCGGGTTGCTGGCCCTGAGT 

CTCAGCTTTCTCATCTGTACGGTTGGGACAAGTACAGTAACCCTCTCCCGTCAAGACGGG 

CC 

Sequence 1907 

GTCGACCNCGCGTCCGAACATCGTCAACTACGGCATCCCAGCCCACCGTGACATCGACGA 

GTGCATGTTGTTCGGGTCGGAGATTTGCAAGGAGGGCAAGTGCGTGAACACGCAGCCTGG 

CTACGAGTGCTACTGCAAGCAGGGCTTCTACTACGACGGGAACCTGCTGGAATGCGTGGA 

CGTGGACGAGTGCCTGGACGAGTCCAACTGCCGGAACGGAGTGTGTGAGAACACGCGCGG 

CGGCTACCGCTGTGCCTGCACGCCCCCTGCCGAGTACAGTCCCGCGCAGCGCCAGTGCCT 

GAGCCCGGAAGAGATGGGACGTGGACGAGTGCCAGGACCCGGCAGCCTGCCGCCCTGGCC 

GCTGCGTCAACCTGCCGGGCTCCTACCGTGNGAAGTGTCGCCCGCCTTGGGTGCCCGGGC 

CCTCCGGCCGNGATTGCCAGCTCCCGAGAGCCCGGCCGAGCGTGCCCCGGAGCGGGCCGA 
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ACGTGTGCTGGAACCACGCGGAAGAGGACGGNATGTGCCCTTGCCCCCTGGCCCGGGCC 
Sequence 1908 

ACCACGCGTCCGGGCGGCCCGGCCAGGCCCCCGGCACTTCCTCGTCCTCGGCCCGGGTGC 

CCTGCCCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCCGGGCCGC 

TGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCAGGNGCGCTGTGTCCCCAAGCTC 

CCCGTCCTCGGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAA 

GGCAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTNTCCCTCCTTGGGCTC 

CTTCGACCCCCTGGCTGANCTCACAGGAAAGGACTCAAATGTCACAGCATCTCCCACAGN 

CCCTGCATGCC 

Sequence 1909 

ACGCGTCCGGAAGGGTGTTACATGTGTCTCTTCAATACCTTTGGTTTTGGGAAGATCTCA 

GGAACGGCCTGCCTCACCGTCTATGTACAGCCCATAGTATCCCTTCACTACAAATTCTCT 

GAAGACCACCTAAATATCACTTGCTCTGCCACTGCCCGCCCAGCCCCCATGGTCTTCTGG 

AAGGTCCCTCGGTCAGGGATTGAAAATAGTACAGTGACTCTGTCTCACCCAAATGGGACC 

ACGTCTGTTACCAGCATCCTCCATATCAAAGACCCTAAGAATCAGGTGGGGAAGGAGGTG 

ATCTGCCAGGTGCTGCACCTGGGGACTGTGACCGACTTTAAGCAAACCGTCAACAAAGGC 

TATTGGTTTTCAGTTCCGCTATTGCTMGCATTGTTTCCCTGGTMTTCTTCTCGTCCTA 

ATCTCAATCTTACTGTACTGGGAAACGTCACCGGAATCAGGACCGAGAGCCCTAAATAAG 

TCACACAGCACCCTTGAAAGGGGATTCCTGGNCTACTTGGATTTGGCACAAGAGAAAAAG 

CAGGAGGGAAAGG 

Sequence 1910 . 

GCGTCCGCTAGTTCTAGATCGCGAGCGGCTGCCC I Mi l I CTTTTC l ITTTTTTTTTTn 

TGAGACACAGTCTCACTCTGTCACCAGGCTAGAGTGCAGTGGCACGATCTCAGCTCACTG 

CAACCTCTGCCTCCCAGGTTCAAGTAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGAC 

TACAGGCACGTGCCACCACGCCCAGCTMTTTTTGTATTTTTAGCAGAGATGGGGTTTCA 

CCACATTGGCCAGGATGGTCTCGATCTCAACCTCGTGATCCACCCACCTCGGTCTCCCAA 

AGCGCTGGGATTACAGGCGTGAGCCACCGCGCCAAGCCAAGGTCTGCATTTTTCTTTAGA 

ACTCAGAACACCCAATAGTCCTAGGCCCCCATCCTCGCATGGCAGCAAGCTAAATAAGCA 

TNTTCCCACTGCGAGTTGGGG 

Sequence 1911 

GTCCGCTGAGAATGGATAATCTCACTGCAGGTATTCCATAATAGGCTTGGATTAAAACCA 

GGGACAATGGGCAGCAATAGGCCATAGGTTAAGCAGCAGCAACTAGTCACCACTGGACTG 

TCT TCTT CTCCCCTTCCCCATATCCCACATTCTCCTAAACATGATGTACGTGTAGCAACA 

GTCTTTTAAAGTCAGATGGTCAGACTAATTATTTTTACAATTTAAGTGTAAGTGATGTAC 

ATGAATGGACTCTGTGAATCGGAAAACTTACGTAACAGCAGAGAATACGTATGTTATATG 

GAATAACCTGAGTTG AAGGTACAA I I I 1 1 I I I CCAGCTCTTTTATTCCTTTAACTGCTTA 

ACAAMGAMGAGTCTCCAMGTTTMAAMCCTTTGAAAMTATACAGCTTGATATTAT 

TTACATAAAATATGAATCCAGGTTCCAATATCAAACAAACATTGCTATGTCAGAAACACA 

GTGGAAGGCAGGAACGTAACTCACTGCCTTTTAGAT 

Sequence 1912 

CCCGCGTCCGCTCTTTTCTTCTCTNTAAAGTGAATTATTCC I I I I I I I I GTTTTATGTAA 

CGTGTATATATTCTTAGTTTTTCTTGAMTCATTGTAATGTTMCTTTGTTGTTTCAAAT 

ATCTTGGTGATTGCTTCATTATCTCTTCMCAMAAAMCCTTTAATTTTGCCATTGAAA 

CTGTAGMCTATGCCATGCTTTTATTAGAAGCAGTGCTCTGTGTTAACAACAAGAATGGT 

GTMTTAGMTTGGGATGTGGATATTTACTGTATGACAACACATTTACAGTTCTGTAATG 

CMGGATGCAGTTTAAAAATGTGAAGTAGTGATGGTTTTTGAMTMGCTTTAAAATATA 

GGGGATCTTGAAGGCTCCCTGGGGTAACTATTTTATAACTTAGATAAAATGGCTAGTCAT 

ATCTGTGTGGTTTGNMAGGTTATTTTTTTAATA 

AGAMTGAGCCCAACTATTTAAATTTTAAAACAGTAAAAACAAA 
Sequence 1913 

CGACCCACGCGTCCGCTTTGAAGCAGGAAAAGACACGTCTCTAGAGCAACATGGAAAGGA 
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TCCAGTATTTGTTTTCAGGCAAMGCMGCCTCCAAMGCCCCTTGGTTCTTCTCTGTGT 

GTCTCGCCCCTTCTGCCTTGTACACCAGACTGCCGGCAAGCAAGCGGGGAGGTGGCGAAC 

AGCTGGGATCCTCCAGTGAAAAGTTTGCTCAGCGCGAGAGATGTCAAGAGGCTGGGATTG 

TCGCCTATCGGGAGGTGGGGAGCTCTCTCACTCACACTTCCTGGGAAATGAAGGAGAACT 

AATAGGCAGCCCGCATTTGCAGCCAGCCAACCTGGGTGGTATTCCTTGAAGTGAAGCGCT 

CATAAATCCCTTCTGGGGCCAGAGCCGGTTTCACAAGCTGCAGGATTTGGGTGGCCTGCC 

TGCTCTTTGGGAGCGGGCTTGGGAAGCTGAAGAAATTGAATCAGGACCTCAGGCCCTTTA 

CAGACTCCC 

Sequence 1914 

CGCGTCCGCCCCGCGTCCGCTCAGAAACCCCTGCCCTTTCCCTTCAGAAAACGATGGCAG 

GCATTCCTCTGAGTTTAGAAGCAGAGACTCACTCCAACCCAAACTAGCTGGGAGTTCAGA 

ACCATGGTGGMTAMGAAATGTGCATCTGGTCTCTTCTGTTGTTTTTATTTCATATCAG 

ATTAAATTTCTTTACCATGTTGGCTAAGTCTAAATATTAGAGATGAGGCTGTGCCTACTC 

CCTGGCCAGCTCTGCTGATAGCCTATGATGGGTTCCAATGGGAAATGACTCTTTACTATT 

AAAAGACAAGGAAAGCTCTGACTTCGTACTTCTCTGATGAATGGCAATGTAAATGAACAA 

GGCTCCATGTGACTGGAGCATGGMGTGAATGCTACTTTCTTAATTTAATCTGCCCTGTC 

CTACCTGCTCCTCTGATTGTTAGCCATCACATAACTTATTGAATGCTTGCCATGTGCCAG 

GCACTGTGCTGAGTGCCATACATACATTTCATTTAATTATCCAATAATCCTACTTACTA 

Sequence 1915 

CCGCGTCCGATTCTNTAACATCTCTGTGAGGMGGAATTTTTATCCTTATTTTACAGATG 

AGGAAGCTGTTTGGAGATAATTTAAGTGACTTGCCTGGGGAATCTAGCCAGTAGTAGAGT 

ACTGATTAATCAGGTGCTGACATCTGCTCTGCTTTGTGTATGTAATTCAGCAGTGCTTCA 

AAGATCCAAGAAGCTGTAGCAGATCTCAATACACTCTCCTATAAAATTAGTGAATAATCA 

CCATGACAMATTGGTATGGCGGMCAGTCATTATACATTATTTAGACTCATTCCTTCTT 

CCAGTGCCCTTATGATTATTTCCTACCTTTACCATTGG 

Sequence 1916 

CCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGATATTGCTTCAGAAAACTGAATGTGTA 

TGTCGGTCATATTGCCTTTATAACCATGCTAATATCTATGCTTTATACATACTCAAACTT 

GCCTTGCCTTAAAAAAATACATACTACATACTTAAATCAGGAATTCTAGCCATCTCACAG 

AATACCAACTAAAACTAAGTGCATTGAGATCTGAGATTGGTAAACCCAGATTCATTTACC 

ACAGCTGTAATTAAGTTTTTAGAMCTATTCTCTTTTTGGGGAAATCCATTGMGTT 

TTCTGTTATCTTATTAGMGAAAATGATGTTGATATGTGTTTCAGATTTTCCATTTGAAA 

TCTTATAATTAATTT GATT TATTTACTGTAAGTAGGAGGTATNAATGACACTCTTAAATT 

GG AAGGA GGGGTGTTTTAGTGTCTGTNTTAGGTCAAAATTAGTGATTCTATTTTTATCAA 

AAGTTTTATCCTGMGTTTCAGGACCACTCTTCCTTAATNAACTTGTTAATGGGAAGCGA 

G CCTT ATG AAC ATTT AAAAAT 

Sequence 1917 

CGCGTCCGCAGCAGTAATCCTTTMTAACTGGCACGAGCACTTTATTCTTCTGGTGAGCT 

CCCTGMTATTTATTTTTCTGATTATAMTTTTCTATATTAGTAGCATTTT^ 

ACTTCTTCACTATAGAGCATTTACTTTTAGTCTCTAGATGTATATTTTGGAATGCTGTAC 

T TGGC AT MCATAG ATTAAAATCATMTGCATGACTAAAAACTCCTTGGATTTATTTCCC 

ATTTTAAAATTTTTAGCGGTMGTTCAGATTTATAATCTTTCTCTAGACTTCCATGG 

GMTGTTGCCTGCTGAAGTAGCAACCTAAAAAGTATCCCCTGCTTATGCTTCTCCAGTTG 

GCCCTCCATGTCCATAGGCTTCGCATCTGTGATTCAGCCCACTGTGGGTCAAAAATATTT 

GGGGAAAA 

Sequence 1918 

GCCCGCGTCCGCATCCCTTGTATACCATCGTAGACTTCATACTGGAGAGAAACCTTACAA 

ATGTGMGAATGTGATGMGCTTTCAGTTTCAAATCGAACCTTGAAAGACATAGGAGAAT 

TCATACTGGAGAGAAACCTTACAAGTGTAATGATTGTGGCAAGACCTTCAGTCAGACATC 

ATCCCTTGTATACCATCGTAGACTTCATACTGGAGAGAAACCTTACAAATGTGAAGAATG 

TGATGAAGCTTTCAGTTTCAAATCAAACCTTGAAAGACATAGGATAATTCATACTGGAGA 
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GAAACTTTACAAGTGTAATGAATGTGGCAAGACCTTTAGTCGGAAGTC 
Sequence 1919 

NGACCNCGCGTCCGCGCTCCGCTGCCNGGGGCGGGAGGGAGGAATGGTTGCTTCACGCCC 

CGGGGGAAGAGACNGGAAGCTCGGCTCTGGGTTGCGGGCCCCGGGGTCTCCGCGTGGGGC 

GCACCGTCCGACCCGCCCCTCCCGGTGTGCAGCGCCCCGCACCGCCCCGCCTTGCCTGGG 

AGAAGCCCGGCGGGACGCGCCGGGCTGGAGTGGGCGGTTATAGGCTTTGAGCTAGGCCGC 

TTCCGGGAGGCGGAGCTCACACCCCATTTCCTTT 

Sequence 1920 

GTCCGTTCCTTGATTCCTGGAAGTCCAGTGGGTTCTGCAGCTGAAAAAGCCCTGGGTCCC 

AGCAGCAGAGAGACAGGACAGAGGGGATGCTTGGGCGGGGAGGGACGGTAACCTGCAGAA 

CAGATTCCATTTTTATAGMCGAGTACACGTTTGCTAAMCAGTCCTGCTTTCCCAGACT 

GGATTCCCACCACAGGGACAGTCGGAACTCAGGACTAGCTCCAGCGACATCTTTCCTCCG 

AATTCMGCCTTCTATCACMTGTCAAAACAGCTATTTATAMGCCATTTTCATTGTACT 

TGATMCAGCACGAGTCCCAMACTTTTAGAAATAAAATAGGACATTGGCTTGATTGAAA 

AGAGGGACTTTTTAAAAATTGTTCTTTCGTCAGAMGCCTTTTGGATGACT 

Sequence 1921 

GCGTCCGAAAAAAATAGCATTATACCTCTTCCTTGTCTCAACCGCCATGAAAATTCTGAA 

CACTCCMATTCAGTTGAATAATCCAAAACAAAATTTATAAGTATAAAATAATTTTACTT 

CTTATAGTAATAGTATACTTTAAAAAGCCTCAGGGTATATTATCTTCTAAACAGCTACAA 

TTCAGTGCAGCTACATTAACCAACTATGTTCTCTAGTTGAGAACAACTAGGCCTATTTCA 

CTGCTGTGTAGCCTCAGTGCCTAACATGGGTGCCAAATAAATATTCGTAGAATTACACTG 

AATTGTAAAAACCATTCGTTTTTGTTTACAATTGCCAAAAATCTCAAAAGGCCCTGTATT 

TATGTMTTCTTTGAMTTATTATTTTATTTTGATTTCTCAGTTATTGACTGGTGGGGTG 

TGACTTAGTCATMGTACTCMTATTATAAAAACCTCAAATAATTGACTTGGATTTTACA 

CAACATCCTTCCCTTTTCTACAAGTTAAA I Mill IACC 

Sequence 1922 

TTGGTATTCTTGGCTMTTTCTTAGCTACT^ 

1TAGMAAGTTTTAAGGTTGCAAAGTTATCAACACTGGGGCAGAGGGTGGAGAGGCCAAT 

GCGGGTAGAAGGAGGCAGTTATGTTTATATTGAAGGTGAAAI I 1 1 I CTTTCATTTAG AAT 

GGAAAACATCCCCAAATGTATCATTATAAACTAGTCAGCCTTGACTACACAAAATTGACC 

TTTMGTTGCTTGAGAAAAACACAATGCAAATCGTTCAGAAGGGTCAACATCCTTTGGTG 

CTAAMTCTTGTGTATGTTTTCAGMTGGCTTTTTCTGTATGTTATAGAATAATCACTAA 

AGGMAGGTAGTTGAATTTAAAGTCATGAAGCAAGACTCTTTMTTCAGTTATTTTAAAC 

AAGMTTAAAACCCCMACATCCTTGGCAGGCTTTGMGCACACAGAATTTTCTAGNATT 

TCTTATTA 

Sequence 1923 

CNCCGCCGGAACAACAACAGAAAGCTGTGTTTGTC I I I 1 1 ( CTCTCAAATATATCTCCCG 

TATGAGATTTCAGGTCCCCATGTTTTCACCAAGCAATCTGCTATGTCAGCCAACCCANCA 

TCACTTTCTACAGGAGGTTATGATTTTTGCCATTTACTAGAGGMGATGTTTTATGAAAT 

CMGTTGGGGTTTGAATTCAGGTGCAGTCATCAGTTCTTTAGGGGCTGCMTGTTTTAAA 

MMAATMGTCATCAGATTTTAAGMMAAGTGATGATTTCTTATTGATATTTTTGTAA 

CAGMTATAGGCTCTTAACTGAAMTCCAGMCCAGMACATAAATCTTGAGTTTCTTTT 

CATGTACATAAAAAGCAATAGCCGTTTTAGTATAGGATAGCCCTGAGCCCAAAAAGTAAT 

AGAMTTTTCTCTAGATATTTTAATACAGAGAGTGTATAGACTGACTCTAAGTTAATAAA 

TGTGCAAAAATATCTTAAAACCATNCCCTNCCCTTTATTTCAAC 

Sequence 1924 

CCNCAAAAAGGAACCAGAGGCCACTTGTATATATATGGTCTCTTCAGCATTTATTGGTGG 

CAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGA 

GAATTGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCA 

ACTAAAATGCCATTTCACCTGGMCTTGATGGAGGGAGMAACTCCTTGGATGATTTTGA 

AAGACAAAGTATTTTACCGGACTGAGTTTCAGAATCGTGAATTCAAAGCCACAATGTGCA 
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ACCTACTGGCCTATCTAAAGCACCTCAAAGGGCAAAACGAGGCAACCCTGGAATGCTTCG 
TAAAGCTGAAGAGTTAATCCAGCAAGAGCATGCTGNCCAGGCAGAAATCAAGAANTCTGG 
TCACCTGGGGAAACTAT 
Sequence 1925 

CCACGCGTCCGTCCGCCfCCAGCAGCATCCGCTTCAGCAAGGCCTGCCTGAAGAACGTCT 

TCTCGGTCCTACTCATCTTCATCTACCTGCTGCTCATGGCTGTGGCCGTCTTCCTGGTCT 

ACCGGACCATCACAGACTTTCGTGAGAAACTCAAGCACCCTGTCATGTCTGTGTCTTACA 

AGGAAGTGGATCGCTATGATGCCCCAGGTATTGCCTTGTACCCCGNTCAGGCCCAAGTTG 

CTCAAGCTGTAAGCACCATTTACGAGGTCATTCCTCCTCTGACAAGGCCTGGCCAGCCGG 

GTGACATGAATTGCACCACCCAGAGGATCAACTACACGGACCCCTTCTCCAATCAGACTG 

TGAAATCTGCCCTGATTGTCCAGGGGCCCCGGGAAGTGAAAAAGCGGGAGCTGGTCTTNC 

TCCAGTTCCGCCTGAACAAGAATAGTGAGGACTTTNAGCCGCCATTGAT 

Sequence 1926 

GCGTCCGGTMGTATTTTGGMTTCMCCCTCGMTTATTTTTTCTCATTTCAGCATAGT 

GATAGGGGATGCMTGAGGCTTCATTATTTTrTATGACCTGCCCCTCATTTGCTCTGATG 

TTCCCTAAATTCTGTAATCATATCATMCTTTTGTTATGAATAGAGAGGAATGGGCTCAC 

TGAAACCTGACACTAGAAATTGGTGGGTGATGCTCATAACTGCAAACACTTAGCTTATTG 

AAGTGCCTCTATTTACATGTTCTTTAGTTATMTATGTATTTTTCTAACAGAAATACACG 

TCTGTAATTGGTATATATTATACTTTGTATGTGTCACAACAAAAGCTAAACAGAGGCTAA 

AGTCTTTAGCAGAGAAGAATGAATTN 

Sequence 1927 

AACTGTTTGGGAAAATACGTTGAGGGAGAGAAGACCTTGGGCCAAGATGCTAAATGGGAA 

TGCAAAAGCTTGAGCTGCTCTGCAAGAGAAAATAAGCANGACAGAGGGATTTGCTCTGGA 

CAGANATGGAAGAGCCNGGGAACAGAGAAGTGTGGGGAAGAGATAGGAACCAGCANGATG 

GCAGGGGCAAAGGGCTCAAGGGTGAGGAANGCCNGTGGGACCCCACAGANTATGGGGAGA 

TAMGGACATTGCTTTGCTTTTGGTGGCACCGTAAGCTCCTTGACTGTCTNCAGCACCCA 

GAATCTCATTMMGCTTATTTATTGTACCTCCAACCGGCTTGTGTGCAATGGGGGTCTT 

TTTGTGGAAAATCAANGAGCANACAGGTTTTCATGTGTACTGTCACCACGTGGGATGNGA 

ACCAGATNGCATGGAANCAAGACGCTAAATGNAAGAGGGCCATAANGGNTGGGATTTCCC 

AGGCNCCTTAAGAACAGCTTGTC I I I I I I l I i I I l CCTTTCCAAAAAA 

Sequence 1928 

CCGCCGGTMGTTAMGACTTTMGGACATTCAMGTTTAAMTAGTGTTCAAATTGCAA 

MTTTGGCAATCTTCATATAAATTGGTTTCTTTTCTMCTmCAAAMC 

TGTCAATTATAGGAMACATAGTTGGAMTGTMTCATCCAMGATCATTTTTAAAATGA 

AATTTMTTAGCACATATTGAACATTTGACTTMTTGTTAMCCCCAGTTTTGTl^ 

Nil I AATCAGATTTTTGCACACTGATTAGTTTTTGTGTTGTGGCTTTTGTTGCTTTATT 

ATTCAAGGI I I I I I I I I I I I I I CTTCCCCATGGGGGAGATTGTCTTCCAATGTTTAACTA 

CGTTTAAATAMTAAAAATTGAATTTTATTGhrrCATTTATATAAAATCTGATCCTTGATG 

TAATTTCCAATACAGTTCCAATTTTATGGCTTTATMTTACMTGATATTTTCTTC 

ATAMAACCAMGTAAACATTTAAATGGGGAMCTGATATTTTTCATTTATATGAAGTAT 

NAAGCCCTCTACTGGGGTCNTTATTGGNGAATCATNCTGCCTTCAAANTGGTTTCAAAAN 

TGGGTAGAMNAAAACC I I I I I I I I I G 

Sequence 1929 

CCGCACACCCTAAAGAAAATAAGTATCCCAGTCGACATCAGTGACAGTGATATGATGCTG 

AACATCATCAACAGCTCTATTACTACCAAAGCCATCAAGTCGGNGGTCATCTTTGGCTTG 

CAACATTGCCCTGGATGCTNGTCAAGATGGTACMGTTTGAGGAGAATGGTCGGAAAGAG 

ATTGACATAAAAAAAAATATGCAAAGAGTGGAAAAAGATACCTGGGAGGCAATCATTGGA 

ANGACTTCCTGTGTCTTGCGTGGAGTCATGATTAACAAGGATGTGACCCATCCACGTATG 

CGGCGCTATATCAAGAACCCTCGCATTGTGCTGCTAGGA7TCTTCTCTGGAATACCAANG 

AAAGGAGAAAGCCAGGACTGACATTTGAGATTACACGAGAGGAGGGACCTTCACCCCGAA 

TTCTCNCAGAATGGAGNGAAGGAAGTACATCCCAGCAGCTNTGTGAGGGACCATTATCCC 
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Sequence 1930 

I I 1 1 I i II 1 I AC AACTCAAAAAAAG AMTCTTTTAATAAAAATTACTCATAAAAATCCTA 

ATAMTTTTAAAGAGCAGNGATATTCCTTATTACATTTATAAMGAACATTTGGNCCTTT 

TACAAMAGATCCCTTTTAATTNAMTACNTTTCTTATTtACAGATTAAACATAAAATAT 

C ATNT ACAGTTGCAMGCATATTGCACATTACAGAGCAAAGCATTNGNGTATTTCCGNAA 

GTTTTCCCAGAGTTCCCAACTCTATAC I I 1 1 1 I I I GTAAAAAGATTTACCCTTGCTTATG 

CAAAMTAAMTAAGAAATGCNANCTGNCGGTTTTGCTATTTAAAAACTANAANCCAAAA 

TAMCCTNTTAAMATATTATTCCTCTGCCTTGCANAAAAGGAAAGTGAAGAGGGGTNTT 

ANAAATCAGNGGGGGTTNCCACCAGNGTCNCTTGATAATTTT 

Sequence 1931 

CGTCCGGGGAMCTTTATATGGTTGGGGATAAGAATTGAATGCAAATTAATGAACTAGAT 

TTGCATTTATGGAGTTACCTCATCATGGAGTTACCTTGGTCTGTCCCACGTCAATTAATC 

TTGNTTCCTTATTTTCATAGACCATCCTCTAGAACAGTGTTTTCACACTGTGTATCATGA 

TCCTATMTGCAGGGGTGTCCAAGCTTTTGTCTTCCCTTGGCTACACTGGTNGAAGAATT 

GTCTTGGGCCACACATAAAATATACTAACACTAATGATAGCTGATAAGCTAAACAAACAA 

AAAMTCACAAAAAATCTCATAATGTTTTTTTGAGATTGTGCTGl Mill I AGTTGTGTT 

TTCAATGGAGAAATAC 

Sequence 1932 

CGTCCGGCGCGTTCGTGCGTCCTAGTTCCAGTACATGCGTGGAGGGTTTACGGCAGCGTG 

TTCTGATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTANGAGGC 

CTTTTGAGGCCGCGTAGTCGGTGTTTTTGAACTGACTCTACAGCTTCTGGCAGGCCGTGC 

GGCGCCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTACGCCAT 

CTTTMGGTTCTAMTGAGAAGAMCTTCMGAGGTTGATAGTTTATGGAAAGAATTTGA 

AACTCCAGAGAAAGCAAACAAAATAGTAAAGCTAAAACATTTTGAGAAATTTCAGGATAC 

AGCAGAAGCATTAGCAGCATTCACAGCTCTGATGGAGGGCAAAATCAATAAGCAGCTGAA 

AAAAGTTCTGAAGAAAATAGTAAAAGAAGCCCATGAACCGCTGGCAGTAGCTGATGCTAA 

ACTAGGAGGGGTCATAAAGGAAA 

Sequence 1933 

AGGGAGCCGCCCCGCGTCCGCCCCGCGTCCGCGGACGCGTGGGCTAAAACCCATCAGGCA 

AGATCACCACGCA1TGANATATTTTCATATCAAGATAMGTCGCACATTTTCCACAATAC 

ATTGCTAAMTAMGAGGAGAAAGGCTTAGGMGTTTTTTTGCAGAGAGTGCTGGTAAAG 

AATTGAGCMGTTTGCTATTGTATTGNAATGTTTCTCTCAGGTTTGNTCTTCCTATCATG 

GNNGGTATTCCATGAATAATTGAGATCAGCCCTATGTAAGGTAAGATCATAATATGGGGG 

ACAAATGG 

Sequence 1934 

GCGGACGCGTGGGCTCCATCTGAGCTCTTGGGTGACCAGGGTGCATTGTCAATGAGGGTA 

ATATTTTGAAAGACATCTTTATTATGAGCAGTAGGTCTCAACAGTGGGCTTAAAATGTGC 

AGTAMTCATGCTGTAMCAGATGTGTTGTCATCCAGGTTTTGTGCCATGTCTAGAGCAC 

AGGCTGAGTAGATTTAGCATMTTCTGMGGACCCCAGGATTTTCAGAATGATAAATGTG 

CATTCGCTTCCACTTACAGTCACCAGCTGCATTAACCCCTAACAAGAATCAGCCTGTCCT 

TTGTAGCTTTGGAGGCAGGCATGAACTTCTCCTAGATGGCATCTtCCAAGAGGGCTATTT 

TTGTCTACATTGAAATTCTGCTTAGTGTAGCCACCTGCTTCAATGATCCTA 

Sequence 1935 

TNGACCACGCGTCCGGGCGCCNTCTCCCAGCAATCAAGTCTTGCTCCCTGGCCTGCCCTC 
CGGCACCCTGGAGCCCCGCTTTCCACACAGCCTTGCTCCTCCCGGCCTGGCTTGCTGCCT 
GCAGGCCTCTCAGGGTCCTTCACCGTCTCTCCAGTCTCGGATCCGGACCCGTGGATCTCA 
GCCTCAGACCCTCCTCTGGCCCCGGCCTTGCCCTCGGGCACGGCCCCTTTCCTCTTCAGC 
CCTGGGGTCCTGCTCCCCGAGCCAGAATATTGTCCTCCTTGGAGGTCCCCAAAGAAGGAG 
TCTCCCAAGATCTCCCAACGTTGGAGGGGAAGTCCAAGCCCAGGGGGAACTTGACATACC 
ACCAGGTACATGCCCCCAGAGCCGAGAAGGATNCGGGGGGCAAGACCCCCANGCCGTAGG 
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GNGTCCTCCCTTGGGTCCTCCTGGGCCATCTCTGTGGGGAAGGGACAANACTCGCAACAA 

NCCACATTCTAANGATGANGGCCCTTTTNCCCT 

Sequence 1936 

CCN CGCGT CCGGAAAATATCCNAGGTTGTACGCAGCAGTGGAAGTTGCTCTCAAGGGAGT 

GGTATTTTACACTATGCTCATGGCGACAGTCAGCAAACTCACCTGTTGAAGCAAGGAAGA 

AGCTCCATGGGCACTGGTCTCAGTGGTGGGAAACGTCCTAGTCAGGAAGAGGACACACAG 

AGTATTGGTCCTAMGNCCAGAGACAGAGCACTMTTAGGTAMTATTTTAGAGCTGTAT 

TTCTTGCTTTAGAAGAGTATATMTTMCATAAATTAAGATAATTTCAAAAATGGAGCAA 

ATCTCTATTTTCAMCCAGAAMTCTTGAGGCATTAATTTTTAAGCMTTTTTACAAACT 

CAGTTAATTTTTGGTCAAGAGACATGCATCTGTACTGGAGAAATTGTTGCACCAAGTTTT 

ATA7TCATCTGAACCAATGC 

Sequence 1937 

CCCCGCGTCCGCCCCTTTNCTCCCTGAGGACTCCACAGAAGATGGTATATTATGGGAAAC 

CTTCTTGTAAAAACAACTATGAGAATTATATTGACATAGTGAAATATGTGTTCAGCGCTT 

ACAAGAGAGAGTCCCCTCTCATCGTCAACACTATGGGATGGGTTTCAGACCAGGGGCTCC 

TGCTTCTCATTGATCTGATCCGATTGCTGTCTCCCAGCCACGTGGTTCAGTTCCGCTCTG 

ACCACAGTAAATATATGCCAGACCTTACCCCGCAGTATGTAGATGACATGGATGGCTTGT 

ACACAAAAAGCAAGACCMGATGAGAAATCGACGTTTCAGACTCGCAGCATTTGCAGATG 

CTTTGGMTTTGCTGATGAAGAAAAAGAGAGTCCAGTTGAGTTCACTGGACATAAACTGA 

TAGGTGTTTATACMGACTTTGCATTCAGAATAACTCCAAGAAATAGGTAACTAACCCTC 

ATTTGGCTGAAGAATTATTTTTCCTTCCGTCGAAAAAGACCTGCCATTCTCA 

Sequence 1938 

GTCGACCCACGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTCTCGC 
ACTCTGTTCTTCCGCCGCTCCGCCGTCGCGTTTCTCTGCCGGTCGCAATGGNAGAAGAGA 
TCGCCGCGCTGGTCATTGACAATGGCTCC 
Sequence 1939 

ACCACGCGTCCGGGCGCCAGGCTAGGGCGGCCTGGCCACTGAGCCGGGGTGCAGTGGCAG 

CGGGAGAGTACCTGGCGATGGCGATATGAGCGGTGCGGGGGTGGCGGCTGGGACGCGGCC 

CCCCAGCTCGCCGACCCCGGGCTCTCGGCGCCGGCGCCAGCGCCCCTCTGTGGGCGTCCA 

GTCCTTGAGGCCGCAGAGCCCGCAGCTCAGGCAGAGCGACCCGCAGAAACGGAACCTGGA 

CCTGGAGAAAAGCCTGCAGTTCCTGCAGCAGCAGCACTCGGAGATGCTGGCCAAGCTCCA 

TGAGGAGATCGAGCATCTGAAGCGGGAAAACAAGGGTGAGCCGGCGCGGGGCCCTAGGCC 

GGCCCTGCCTCCCCAGGCACACTCAACACTGCCGCTCCCGCAGCACAGAAACACAGCCAT 

TCAACTCCAGCACACGCCTGGGCTCAGGGGGAACACAGGACGATCTCCATTACAAGCTCA 

TAATGAATCAGACATCACAGAAGAA 

Sequence 1940 

CGCGTCCGTGAAGGCCAGACCGAGAGGTGCCAGAAGAGAACAAACCTCCATCCAGACACG 

CGGGCGGAAAGGCTCCAGGGGTCCAGGGCCAGATGGCGCCGCCTCTGCCCGACTCAGAAA 

GAGAGAAACAAGAGCCGGAGCAGGGAGAGGTTGGGAAGAGGCCTNGACAGGCCCCANGCC 

TTTGGAGGAGGCGGGTGATCTTCCTGAAGATCCCCAGAAAGTTCCAGAAGCAGATGGTCA 

GCCAGCTGTCCAGCCTGCAAAGGAGGACCTGGGGCCAGGAGACAGGGGCCTGCATCCTCG 

GCCCCAGGCAGTGCTGTCTGAGCAGCANAACGGCTTGGCGGTGGGTGGAGGGGAAAAAGG 

CCAAGGGGGGACCGCCGCCAGGCAACTCCGCCGGNGACACAGGGCAGCCCGCANAGGACA 

GCNACCACGGTGGGGAAGCCTTCCCTCCAGCGGAGAAGCCGGCTTCAGG 

Sequence 1941 

CCGCGTCCGCAGAAACATATGTGTAGTGTGCTGCAGCATAAGATGGAAGAACTTAAAGAA 

GGCCTGCGGCAAAGAGATGAGCTTATTGAGAMCATGGCTTAGTTATAATCCCCGATGGC 

ACTCCCAATGGTGATGTCAGTCATGAACCAGTGGCTGGAGCCATCACTGNTGTGTCTCAA 

GGAAGCTGCTCAGGTCTTGGAGTCAGCAGGAGAAGGGCCATTAGATGTAAGGCTACGAAA 

ACTTGCTGGAGAGAAGGAAGAACTACTGTCACAGATTAGAAAACTGAAGCTTCAGTTAGA 

GGAGGAACGACAGAAATGCTCCAGGAATGATGGCACAGTGGGTGACCTGGCAGGACTGCA 
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GAATGGCTCAGACTTGCAGTTCATCGAAATGCAGAGAGATGCCAATAGACAAATTAGCCG 

AATACCAATTT 

Sequence 1942 

CCGCGTCCGCTCGCCTGCCCCGGTGCACCCAGTCCGCTCACCCAGCCCAGTCCGTCCGGT 

CCTCACCGCCTGCCGGCCGGCCCACCCCCCACCCGCAGCCATGGACGCCATCAAGAAGAA 

GATGCAGATGCTGAAGCTGGACAAGGAGAACGCCATCGACCGCGCCNAGNNAGGGCCGAA 

TCCGACANGAAGCAANGCNTGAGGACCGCTTGCAAGCANGCTGGAGGAGGAGCAGCAGGC 

CCTCCAGAAGAAGCTGAAGGGGACAGAGGATGAGGTGGAAAAGTATTCTGAATCCGTGAA 

GGAGGCCCAGGAGAAACTGGAGCAGGCCGAGAAGAAGGCCACTGATGCTGAGGCAGATGT 

GGCCTCCCTTNACCGCCGCATTAGCTGGTTTGAGGAGGAGCTGGACCGGGCCCAAGGANC 

GCCTGGCTACAAGNCCTGCANAAACTTGTANGANGCCNNNNAANGCCGGCCTNATNATTN 

GCCNNGAAGAGGGAATTNAAANGTTNTTNNAAAAACC 

Sequence 1943 

GTCCGGTTAGTTTCTGCATTATNAGTNAGCATAAATAATAAATCCAGAAAACGTGCTGTA 

TTTGTTGTTGTTTCCTCCATGGGCTTTCCCGCCATCTAATTTGATATAGACTTCATCTCC 

CGGCTCCAAATGAAGAACCACACTGTTACTGGCATAGTCGTAATTCTGATCAGCATCTTG 

GGCAATTG CACTAGCACGC ACCTACGTGAAACAGACAAGAATTAGGATGCGTAAATGAGA 

ATTCTCAAGTTTTCTTTTTTGCCATTCATGTAGCATCACAAGCTGACTTGCTGCCATAGT 

ACAGAATTTAGCATAGCAAAG 

Sequence 1944 

GCGTCCGGCTGCGGGCGGTCGGGGCTCCGGGCCGGGGCGGCGGCGCCATCTTGTGCCCGG 

GGCCGGTGGGGAGGCCGGGGAGGGGGCCCCGGGGGGCGCAGGGGACTACGGGAACGGCCT 

GGAGTCTGAGGAACTGGAGCCTGAGGAGCCCCCGGCAGCCAAGAGGAGGAGGAGGAGCCG 

GGACTGGTCGAGGGTGACCCGGGGGACGGCGCCATTGAGGACCCGGAGCTGGAAGCTATC 

AAAGCTCGAGTCAGGGAGATGGAGGAAGAAGCTGAGAAGCTAAAGGAGCTACAGAACGAG 

GTAGAGAAGCAGATGAATATGAGTCCACCTCCAGGCAATGCTGGCCCGGTGATCATGTCC 

ATTGAGGAGAAGATGGAGGCTGATGCCCGTTCCATCTATGTTGGCAATGTGGACTATGGT 

GCAACAGCAGAAGAGCTGGA 

Sequence 1945 

CCCA CGCGTCCGGCAAACCGGGAAAGGAGAGGATCCCGGAGCCGGTGAGAATTCTCTGTT 

TTTTCTCTACCATCCTn'CCAGGCCTTTTCCTCACCTAATGAGTCGTAGAGACGAGGGCC 

CAAAAAGTCTGTAAAGGTGGCTGGTGAAAGATTAAGTGNTCCAAGGGCCCTACATTCCNG 

GANGNGGTTCGGGATAAAAGAGAACTAGTCNTGGGAACAATGTAAGTGGGAACNTNAAGG 

NANNGGNAAAGCGGCCNATAAAGGNGNNCGGAGGNCCCAATGGNANTAAAGCGGACCCTG 

TGTAGGTATAGAGTTGAGTCAAGTGGAGTCACTGCCTCTTGTCCCTCTTGGTCAGCGTGA 

TGGCCAGAGGCCTGGGGGCCCCCCACTGGGTGGGCCGTGGGGACTGCTGANCCTGGGCCN 

ACTTGG 

Sequence 1946 

ACGCGTCCGGCCGGGAGTGGTGGTGGGCACCTGTAATCCCAGTTACTCGGGAGGCTGAGG 

CAAGAGAATCTCTTGAGCTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATTGCGCCACTGC 

ACTCCAGCCTGGGTGACAGAGGGAGACTCCGTCCCAAAAAAAAGAAAAAGAGAAACAGCT 

GTCACCTCCCGCAGGACCCAAATCCTCTCTCTGAGCACCGTCATCCACCACATGGCTNGG 

CCTNGNTTCCAAGANCNAGTCNANCCTTTNNNGNCTTANTTNNAGGTNGANNCCGCNNNT 

TTCNNCCCAAAGGAGACAGCCCCTGCTCCTAGATGCCCTTGGCCTCCGCAGTGCAGCCCC 

CAGGTGTCCTGACTGAAGCANAGGCCNTAGCCCCAT 

Sequence 1947 

NCGCGTCCGAAGTGGATGAMATTGGTACCATGGGGMGTCAATGGAATCCATGGCTTTT 

TCCCCACCAACTTTGTGCAGATTATTAAACCGTTACCTCAGCCCCCACCTCAGTGCAAAG 

CACTTTATGACTTTGAAGTGAMGACAAGGMGCAGACAMGATTGCCTTCCATTTGCAA 

AGGATGATGTTCTGACTGTGATCCGAAGAGTGGATGAAAACTGGGCTGAAGGAATGCTGG 

CAGACAAAATAGGAATATTTCCAATTTCATATGTTGAGTTTAACTCGGCTGCTAAGCAGC 
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TGATAGAATGGGATAAGCCTCCTGTGCCAGGAGTTGATGCTGGAGAATGTTCCTCGGCAG 
CAGCCCAGAGCAGCACTGCCCCAAAGCACTCCGACACCAAGAAGAACACCA 
Sequence 1948 

CGCGTCCGAGTATTTGAAGTGATGCTGGCTCAGACCGCTCCCACTATGCAAAATGTAACC 

CATACATGGATTCTCCACAATCAATAGG7TTTGCTGAACTGCTGAGCCTTGGCAGTGGGC 

GCCTTTGAGGCTTAGMGTGCCNNAAGNNTCTNNCANNCNGNTNNCATGCTTTNTTTTTT 

TGNCGCACTNTTANCAGTCANCAAANAAAATCCNGGGGNGNTNNTNANCCCCNAAGGCNC 

NNGNTNCCACNGTTCAGTGAAACTTGCTTCAGAGGCAGAAAGAAGCAGGTTTCCAAGCAA 

CMTCAGTGCTCCACACATGCATGCATATGCGCTAGMCTTCTATTTGATCAGTTGCATG 

AAGGAGCTAAAGCTCTTGATGTAGGATCTGGGAAGTGGAATC 

Sequence 1949 

CCACGCGTCCGGAGAGAATGGGCCGCGCGCGCCCGCGGCCGGGGAAAGCCTGTCGGGAAC 

CCGGGAGAGCCTGGCCCAGGGCCCCGACGCCGCAACCACCGACGAACTCAGCTCTCTCGG 

GTCTGACTCGGAGGCCAACGGCTTCGCCGAGCGCCGCATCGACAAGTTCGGCTTCATCGT 

GGGCTCGCAGGGCGCCGAGGGCGCGCTGGAGGAAGTACCCCTGGAGGTGCTGAGGCAGAG 

GGAGTCCAAGTGGCTGGACATGCTCAACAACTGGGACAAATGGATGGCCAAGAAGCACAA 

AAAGATTCGTCTGCGGTGCCAAAAGGGCATCCCGCCTTCTCTGCGGGGCCGTGCTTGGCA 

GTACCTGTCAGGAGGCAAGGTGAAGTTACAGCAGAACCCTGGAAAGTTTGACGAGCTGGA- 

CATGTCCCCTGGGGACCCCAAGTGNTTGGACGTGATGGAACCGTGACCTGCNCCGGCAGT 

TNCCATTCATGAAGAGTTTTTGTGTCCGGGGGGG 

Sequence 1950 

NGCGTCCGGCTTTACAACGGGCAAATACTGGAAACCATCGGAGGCAAACAGCTCANAGTC 

TTCGTATATCGTACAGCTGTCTGCATTGAAAATTCATGCATGGNGAAAGGGAGTAAGCAA 

GGGAGAAACGGTGCGATTCACATATACCGCGAGATCATCAAGCCAGCNNANAAATCCCTC 

CATNGAAAAGTTAAAACMGGATAAGCGCTTTAGCACCTTTCTCAGCCTNCTTGAAGCTT 

GCANMCTTGAAA AGAA GCTTCCTGACACAACNCTGGAGACTGGGACATTATTTGTGCCA 

ACCAMTNGATGCTTTTAAAGGGGAATGACCTAGTTGAANNAAAAAAGGAAATTTCTTGA 

TACNGGGACCMAAAMTGGCTCNTTTCAAAMACAATTCCATTTCCTTTATTTNACCCC 

TGGACCACCCCAAGGGMGTTTTTTTCMTTTGGGGAMAMGNGGAATTTTTGG 

Sequence 1951 , 

CAATNCTTGAACCAGGAAGGCGGAGGTTGTCGGGGGCTGAGATTGCACCACTGTGCTCCA 

GCCTGGGCGACAGAGGGAGTCCCTTTCTCAAAAAAATAATAATAAAAATAAAGATGGCAG 

TAGGAAGGTTTCAGCTTGAGATGCTGTCTTTTCTTCTGTTTTTATGCATAAATACAACGA 

AGACGGGAGAGGAGATGGAAAGCAAAGATGATTAAGTGAAATAATTGTGGGAAACAATAG 

AGGGATAGACTTTGCTTATAGGGGATGTGGACAGAGCAGAAAAATGGGAGGAATGGGGAG 

GATTCAGTTAGAGAAGGAAGAAACCGGTCCAAGGGGCTGGGGCTTTAGGCCCTGGGGCCT 

CCAGTGCCCGTATMGGCTGTGGCAGMGCCCTGCCCATTTCCGTTCCTTTCACTCCCTA 

TNTTCACCCTTACACCTCCCAAAAAC 

Sequence 1952 

CCGCGTCCGGGTTTTTTGGCCTTATTTTC 

GATTTCATTCATTGMGMTCMTACATATTTTGTTTCAAMTGmGAAACAAM 

TAGATG GTAG ACTTTTATTAAAACATATATGGATGTGGAAAGCACATATATTAATGCAGT 

CATCCCTTTTCAGGTGGGAAGAGAGCAAACCAGTTGATTTTTTMTTCATCCTTAGTACA 

CAGAGMTATCTTTTNCTCAAGGAATATACCCTGGTTGGAGCTTTAMAMGMTGGTTT 

TGGGM CCATT TCATTTTTCCCAANAAGGTTGCTATTCTTGGGGTAANTGGGGNATACCN 

GANTGNTTTTTCCNGGGGGGTTTTGGTGGTGGGGGGTAAAATTGGGGN M I I I I I 1 1 1 

Sequence 1953 

GCGCCGAGGGGTGTAATGCATTNGCAGCAAGAGCTATGAGAAATAACTTTAGACATTATT 
TCATTGMCCTTCCCAACTGAMTTATTTTATGATGTTATAACATGGATAGTAACTCAAG 
TAGCAATMGTTACACAGTTGTGCCATTTGTGCTTCTTTCTATAAAACCATCACTCACGT 
TTTACAGCTCCTGGTATTATTGCCTGCACATTCTTGGTATCTTAGTATTATTGTTGTTGC 
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CAGTGAAAAAAACTCAAAGAAGAAAGAATACACATGAAAACATTCAGCTCTCACAATCCA 

MAAGTTTGATGMGGAGMAATTCTTTGGGACAGMCAGTTTTTTTACAAQAAACAATG 

TTTGCAATCAGAATCAAGAAATAGCCTCGAGACATTCATCACTAAAGCAGTGATCGGGAA 

GGCTCTGAGGGCTGl TTTTTTm I GATGTTAACAGAAACCAATCTTAGCACCTTTTCAA 

GGGGTTTGAGTTT 

Sequence 1954 

TCGCCCCGCGTCCGGTTAAAACTGCCTCTTTAGATGTGGATGCCTTAATGCTGTAACACA 

TTTGAAAACATTGGCAATACTTMGTTGCTGCCATGATTACAGATGGAATTATTGGCTAC 

CAAAGAGACGCAATTGATGATGAGAAGCATGATTCTTGCTTCCATATAACCAAAGTTAAT 

CTTMTTGCMTTTGACTCCGTTTCCTTGGTAGGGATAGACTTTCTTCAGATTCCAAGTG 

CTCTCTTAMTGGCAAATTAAGTTAAAGAATACTACTGCTCCATTCCCCTCACTTATTCT 

C CAGT TAATTGCTTGTCAGTTCCATTTCMGAAAGCAGTGGATGTTCCAGGTTTGATTCA 

GTTTTCCTGTGCACACTATTGCCAAA1 TTTTTT1 J AGCAAAGATTCTGCACTGGAACGTA 

GACAGTTGGAMCAGTACTACCTACTAGAGGGTATGGGGTTTCTCTTTCTCCCCGCTTTC 

ACCTCTTTCTTTTCCAATTC 

Sequence 1955 

CCNCGCGTCCGGCTCTGCCCAGTCACCCGGTCTCCTCCGGCTTCCCTCCGGCCAACAGCG 

CGCTCAGGCTCGCCTCAGGCCCCTCCAACGGAACAGGAGTCGAGGGGCAGTGAGGCCGGG 

ATGCGTGCGAGCGCGGGGCGCGGCTGGCGCTGGGCCGTGGGGGCGGGGCGGCGTGCGTGC 

CAGCGGCCGTGCGATTCTGTGAGGCCTGCTCTGCGCCGGCGGGGAAGCGCGGGCGACGCT 

ACGCTGGACGTCCTTACCTGCCGGAGGAGAGAAAGTGCCGTCAGCTGTAGGGG 

Sequence 1956 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGAC 

GCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGAGAATGTGCTATACCCC 

AGAAATTCCATCGATCTGTCATGGGCCCCAAAGGTTCCAGAATCCAGCAGATTACTCGGG 

ATTTCAGTGTTCAAATTAAATTCCCAGACAGAGAGGAGAACGCAGTTCACAGTACAGAGC 

CAGTTGTCCAGGAGAATGGGGACGAAGCTGGGGAGGGGAGAGAGGCTAAAGATTGTGACC 

CCGGCTCTNCAAGGAGGTGTGACATCATCATCATCTCTGGCCGGAAAGAAAAGTGTGAGG 

Sequence 1957 , 

CCACGCGTCCGCTGGAGGCACTGGACGAGATGCTGACGCACGACATCGCCAAGCTCATGC 

CCCTGCTGCGGCAGGAGGAGCTGGAGAGCACCGAGGTGGGCGTGCAGGGGGGCGCTTTTG 

AGGGCACCCACATGGGCCCGTTTGTGGAGCGGGGACCTGACGAGGCCATGGAGGACGGCG 

AGGAGGGCTCGGACGACGAGGCCGAGTGCaGTGGTGACCAAGGACAAGTCCAAATACGACG 

AGATCTTCTACAACCTGGCGCCTGCCGACGGCAAGCTGAGCGGCTCCAAGGCCAAGACCT 

GGATGGTGGGGACCAAGCTCCCCAACTCAGTGCTGGGGCGCATCTGGAAGCTCAGCGATG 

TGGACCGCGACGGCATGCTGGATGATGAAGAGTTCGCGCTGGCCAGCCACCTCATCGAGG 

CCAAGCTGGAAGGCCACGGGCTGCCCGCCAACCTGCCCCGTNGCCTGGTGCCACCCTCCA 

AGCGACGCCACAAGGGCTCCGNCGAGTGAGCCGGGCCCCCCTT 

Sequence 1958 

ACGCGTCCGCCTGGCTAACATGGCCGAAAGGTCGTATTCTCCGGGGGAGGACGGAGGCCC 

GGAGAGGAGGGGGTGGAGTGCCTGGTTTTCCAGTCAGGCGGCCGGAGGGCAGCCCTCAAG 

AACGGCCCTGACCGCCCGCGGGGTGAGGGGCCCTTTCTGGGCAGGACCCGCCCCTTGGTC 

CCGCAGAGCCTTGGTACTTGGACCTGAACCTTGCTCCGAGAGGGAGTCCTCGCGGACGTC 

AGCCAAGATTCCAGMTGACTACTATCTTGACTTACCCCTTTAAAAATCTTCCCACTGCA 

TCAAMTGGGCCCTCAGATTTTCCATAAGACCTCTGAGCTGTTCCTCCCAGCTACGAGCT 

GCCCCAGCTGTCCAGACCAAAACGAAGAAGACGTTAGCCAAACCCAATATAAGGAATGTT 

GTGGTGGTGGATGGTGTTCGCACTCCATTTTTGCTGTCTGGCACTTCATATAAAGACCTG 

ATGCCACATGATT 

Sequence 1959 

CCACGCGTCCGGGGGACATGGTGGTGAGCAGGACAGAATTCCTGTCCTCATGAGTCTTAC 
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ATTCTAGAAGGAAGGAGCAGATAAAATCTAAATAAGGTTATGAGATTGTGACGAAGCGTC 
CGATTACACCTCAACAAGAAGGCAACTGACAAACAGCCTTATAGCAAGCTCCCAGGTGTC 
TCTCTTCTGAAACCACTGAAAGGGGTAGATCCTAACTTAATCAACAACCTGGAAACATTC 
TTTGMTTGGATTATCCCAMCCTTTATTTTTCTTCGGTAGCACCAATGAAAAGA 
Sequence 1960. 

TCCGGTGCAGCCGGGCTAGCTTCTCCTGCATCTCCCGAACGTGCTCCAGCTGCTCAAAGG 

AGCNTTCCTTCCCTGGAACGAANCCCCTGNCTGAACCCTNNNAAAGAGNCCTGGAACCCT 

GGAACCCTGGAACCATGGNCCCAGCCNTNCCCCANGNACTGGGCCCTACAGCCCTATCCA 

CACACCCGCTGAAACCAGCCCCCAGGACTCACCGAAGGCCNGGGGGCGGCCAGAGTGGAA 

ATCATTGAGCAGGTTCAGGAGTCCCCCCTCCATCTNATAGACATCAGTCACCTCGGTCAG 

GAAGGAGTGCTGCAGGGGCGTGCCTGCAGAGCCGCTTCCACCTCCAGCCAGGACT 

Sequence 1961 

NGNCGCGGTTCTCGTCTCCTCCCGGCCGGCGGAGCGAGTGGAGGCTGCAGCCCAGCTCGT 

CTCGGCGCCCGCGTCGCCGTCNCGAAGCCCCCCGCCCCGCTTCCGCCGCGTCGGAATGAG 

CTCCCGGAAAGTGCTGGCCATTCAGGCCCNAAAGCGGAGGCCGAAAAGAGAGAAACATCC 

GAAAAAAAATCAAGCAGAAGATTGAGCTGCTGATGTCAGTTAACTCTGAGAAGTCGTCCT 

CTTCAGAAAGGCCGGAGCCTCAACAGAAAGCTCCTTTAGTTCCTCCTCCTCCACCGCCAC 

CACCACCACCACCGCCACCTTTGCCAGACCCCACACCCCCGGAGCC 

Sequence 1962 

CGCCCACGCGTCCNNCCCGCGTCCGGGAGAAGATGGAAGCAGTGCCCGACGTAGAGCGCA 

AGGAGGACAAGCCCGAGGGGCAGTCACCTGTGAAGGCTGAGTGGCCCAGCGAAACCCCGG 

TGCTGTGCCAGCAGTGTGGCGGCAAGCCTGGCGTCACCTTCACCAGTGCCAAGGGCGAGG 

TCTTCTCCGTACTGGAGTTTGCACCCTCAAATCATTCTTTTAAGAAMTTGAGTTCCAGC 

CTCCAGAAGCCAAGAAGTTCTTCAGCACAGTGCGGAAGGAGATGGCGCTGCTGGCTACCT 

CACTGCCTGAGGGCATCATGGTCAAGACTTTTGAAGATAGAATGGACCTCTTCTCAGCTC 

T 

Sequence 1963 

TCCGCGGGNGCGTGGGCGGCCACTCATTTGCATTATCTTAAATCACAAATAATTACTTAA 

TTNGCTGGAGTGTGTGCTTTGCAACTTTTATACCAGAGTAAAATTTGTATTTAAAACAAA 

AAATAAGAATGCCCATCACTAGGAGAAACACTCCTCACAGAAAACACACACACACACACA 

CACACACAATTTAAAMCTGAGTAAATTTAAATGTATGAAAGGCNCCACAAATTGATTTC 

MCAMTMATTTCATTTCTCTAGCTACTTATGTCTGGCCTTATTTTGAGCGTTACMTT 

TTATTGCCTTCAT7TGCCTATTTAGACTGATGTAGTTTGATATGATGAAGT 

Sequence 1964 

CTNTAGGGAGNCGACCCACGCGTCCGCCCCGCGTCCGGTTAGTTCTACCTGGTGCCCATG 

TTCTGATTGTGTGTGGGATTGCATGGTGTCCTGATTGCATCTAGGTGGAGCGGATGGAAT 

GTGCTGGGCCACTGTTGGGTGGAGAGCAGCACATTCTTACAGAGGAGATGGAGCGTTATG 

AGCATAGTATGTGGATAGGTATCTTCACCTGCCCGCCCCTGAGTCAGCCTCCTTGACTTG 

ATAGCTTGAAGMTCCTTTTCCACTGAAATAGAGGATAATTAATTGACACATCTGAAATC 

CCCAATCAATCAATCAAGAGAAAGGTAGAACTAAAAACTCCTTAACTTACTGTTGCTTAC 

ACCCCTGAAAGTCTGTTTTT 

Sequence 1965 

CGTCCGGCGCCCTCCAGCCCCGTCCGGGAGTCCCCGGCCCGCTGCGGTGCCGGGAGTACC 

TCCAACCCCCTGCNCCCCNNANGGAGGCCIMAGGGGCTTAGCCACCAGGGCTCGGAAGTGG 

GGGCCGAATCCGGTGCNNGNCNNNNNNNAGGNNAGCANAGCCGGAGTTGGGGAGACTGGT 

TGCTGAAAAGCCAGGAGTCAAAATGACTGAGCGCTTTGACTGCCACCATTGCAACGAATC 

TCTCTTTGGCAAGAAGTACATCCTGCGGGAGGAGAGCCCCTACTGCGTGGTGTGCTTTGA 

GACCCTGTTCGCCAACACCTGCGAGGAGTGTGGGAAGCC 

Sequence 1966 

CCCACGCGTCCGGTGCACTCAACAGAACGGCCCTTCAAATGTGAGAAATGTGAGGCAGCT 
TTCGCCACGAAGGATCGGCTGCGGGCGCACACAGNACGACACGAGGAGAAAGTGCCATGT 
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CACGTGTGTGGCAAGATGCTGAGCTCGGCTTATATNNNNGACCACATGAAGGTGCACAGC 
CAGGGTCCTCACCATGTCTGTGAGCTCTGCAACAAAGGCTTCACCACGGCAGCATACCTG 
CGCATCCACGCGGTGAAGGACCACGGGCTCCAGGCCCCGCGGGCTGACCGCATCCTGTGC 
AAGCTGTGCAGCGTGCACTGCAAGACCCCTGCCCAGCTGGCCGGCCACATGCAGA 
Sequence 1967 

CGNCCCACGCGTCCGGCGGTGCCGCGGGGATGGCGGGAGCCGGAGCTGGAGCCGGAGCTC 

GCGGCGGAGCGGGGGCGGGGGTCNAGGCTCGAGCTCGCGATCCACCGCCCGCGCACCGCG 

CACATCCTCGCCACCCTCGGCCTGCGGCTCAGCCCTNNNNCCNNNNNANNNATGGCNNGN 

TCAGGGGGCCTGGGGTCTGGGGACAACGCCCCGACCACTGAGGCTCTTTTCGTGGCACTG 

GGCGCGGGCGTGACGGCGCTCAGCCATCCCCTGCTCTACGTGAAGCTGCTCATCCAGGTG 

GGTCATGAGCCGATGCCCCCCACCCTTGGGACCAATGTGCTGGGGAGGAAGGTCCTCTAT 

CTG 

Sequence 1968 

GCGTCCGGGCGTGTAAACCAGCCGGAGCGGCGCGGCAGCGGCAAGGACCGCCGTGGCGCC 

TAGAGTAGCCGACCCGGGGGGAGCGCGGGGCGACGCTGGCTGCAGGGACCCGGTGACAGC 

GTGAGAGGTTCGCAGAGTACTAGGTTTTGACAAGCTTGCATCATGCGTGAGTATAAGCTA 

GTCGTT CTTGGCTCAGGAGGCGTTGGAMGTCTGCTTTGACTGTACAATTTGTTCAAGGA 

ATTTTTGTAGAAAAATACGATCCTACCGATAGAAGATTGTTATAGAAAGCAAGTTGAAGT 

AGATGCACAACAGTGTATGCTTGAAATCTTGGATACTGCAGGAACGGAGCAATTTACAGC 

AATGAGGGGATT TATA CATGAAAAATGGACAGGGATTTGCATTAGNTTATTTCATTNCAN 

GCCCANNCCCAATTTTACGTTTAAAAAACCTGNNNNAAAAAMTTTTTTTGGNTAAAAAN 

CCNTNTTGTNTCCCCAAAANTTTTTTTGGGGNANAAAAANNGGCCCTCCNAAAAAAAAAA 

Sequence 1969 

CCCACGCGTCCGCACGCCAGTGCCTCCCTTTACCTACTAATGAGGCAAAACTTTGAGATT 

GGGAATAACT TTGCC AGGGTTAAAATGCAGGTAACAATGTCACTATCCTCCTTGGTGGGC 

ACATCTNANMTTTTMTGMGAATTCTTAAGACNGTCTTTCTAAANNACTATTTNGTAC 

ATTATGCTTGAAGAANATNTGNGAATTGAGGGAAACA 

Sequence 1970 

GCGTCCGGTTTTCCMCTGCAGCTTTTTAATGGTTAACCTTCATCTA AI I I 1 1 I I I CTCC 

CACTGGTTTATAGATCCTCTGACTTGTGTGTGTTTATAGCTTTTGTTTCGCGGGGTTGTG 

GTGAGGAAGGGGTGATGGCATGCGGAGTTCTTTATCTTCAGTGAGAANNGGCCCTGCCCG 

CCTGAGAGCCAGCTTNCGCGTTGGAGGCACCGNGTTCAGAGAGCTGCTGAGCGCCACCCT 

CTACCCNGCTGACAGACAACACAGACCTGTGCCGMGGCTMNTTGNGGCTTTTTACGAC 

CCTACCCCACCCCCTGTTTTCAGGGGTT 

Sequence 1971 

CGTCCGGTGAGATTCTCCGTAATGGGCGGGGACAGAGTGCCCTGCAGGAGATTCTGGGCA 

AGGTTATCCAGGATGTGCTAGAAGACAAAGTGCTCAGCGTCCACACAGACCCTGTCCACC 

TCTATAAGAACTGGATCAACCAGACTGAGGCCCAGACAGGGCAAGCGCAGCCATCTCCCA 

TATGATGTCACCCCGGAGCAGGCCTTGAGCCACCCCGAGGTCCAGAGACGACTGGACATC 

GCCCTACGCAACCTCCTCGCCATGACTGATAAGTTCCTTTTAGCCATCACCTCATCTGTG 

GACCAAATTCCGTATGGGATGCGATATGTGGCCAAAGTCCTGAAGGCAACTCTGGCAGAG 

AAATTCCCTGACGCCACAGACAGCGAGGTCTATAAGGGTGGTCGGGAACCTTCCTGTACT 

ACCGCTTCCTGAACCACTTGTGGTGGCTCCTGACGCCTTCACATTGTGGCCATGGCAANC 

TGGTGGAGCCCTGGCTGCCCCCCAGCGCCATGCCCTGGG 

Sequence 1972 

GAGGGGAGGGAAATTTATTTCTCTGCTTTTCTATTATACMGTTGTTTACAGAAACTGCA 
MTTAAAAAATTACACTGGCATTTGCAGTCCTTAMATAMTTAAAAGTTCTCAACTTTT 
I I I I I I I I GC TAAAC ATTTTTTTMGTATGAG TCCTT TGTTTAAAAAGAAAN AGATTANA 
ACAGMAATATTTTCTATAMTAATACATGTATTTTGGTTTTAGTGCTCCCGCCC 
Sequence 1973 
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GCCGACCCACGCGTCCGACATCCCCAGGCACATCCGGAAGGAGGAAGGTTCCT7TCAGTC 

CTGCTCCTTCTGCANGNCCAAGAAATTCACTACCATGATGGTCACACTCAACTGCCCTGA 

ACTACAGCCACCTACCAAGAAGAAGAGAGTCACACGTGTGAAGCAGTGTCGTTGCATATC 

CATCGATTTGGATTAAGCCAAATCCAGGTGCACCCAGCATGTCCTAGGAATGCAGCCNCA 

GGAAGTCCCAGACCTAAAACAACCAGATTCTTACTTGGCTTAAACCTAGAGGCCAGAAGA 

ACCCCCAGCTGCCTCCTGGCAGGAGCCTGCTTGTGCGTAGTTCGTGTGCATGAGTGTGGA 

TGGGTGCCTGTGGG 

Sequence 1974 

CGCGTCCGCCGAGAGAAGAGGTACGGTCGAAGCCCGGAGCCAGGCCGAGCGGGAGCTGAC 

CAGGCTTGACTCGGGTACAGAACGAGGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCA 

GTGGATGGAGGAGGCCCAGCCCTGAGGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCC 

TACAGGGTGGGTAGGCGGGCGTGCCGCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCC 

GAGGGGCCTACGCTGCGGCCCGGCAACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCC 

CACCCGATCGGAAGCGGATC 

Sequence 1975 

GGCCTCAGGGGGNAGNNATCCTGCAAAGACNNACATGAGCCCANAGGGGAAANAGAGNCA 

CCTGNGAGTACNNGCCTTTGGGNNTGACCTTGGCTCTCAGCACAAGATATTTACAGCCTN 

TGAGCTTGATATTCTAGAATTGNTACAGAGATAGCTCTGGAAAAGAAATAGACTAGAAGG 

ATAMGGGAAGGAATCATAGCTTATGMGGTTTTTACTCTGCATCAGACCGNTTTCTAGN 

NCTATGACTTAACGTCCTATAGGCTGTAAGGTTCTCTGCGTGAACACTTCTTTNCTGGCC 

TCCTTTCTGCCCCATTCCTNTTTAMCTCAGTTGCTGAGTTTATTATNCCCTGTGCATCC 

CTGGGCATTGNTCATTCACATATGGAMCMTTCANGGAGGACCCTTTGCCTANTTTNCT 

TTATCCTCTTGMTTNTGGATGGGAAMATNTTAANTNCTTTTGGCCGCCTGCANTNGGG 

GNAANTATTGGGCCTT 

Sequence 1976 

CGCCCCGCGTCCGCGTTGAACAGCTTTAAGAACAATGTGAATGAAATCTTAGCAACTTGG 

TTAGTAATCTGAAMGTCTATTMTGTATACTTGAAATTCTGTTTGTATAAAAATGCATT 

TTCCCCTTTATTTTAACACTGTGTAAAAGMCATTATGCATGTGAGTGGTTTGAGAATTA 

AATGGTTTAATACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

MMAAMAAAAAAAAAAAAAAAAAAAAAAAAANGGGGNGNCCNTTANNNTNNTTNAAAA 

AAAAACCCCCCNNNCCNCCCCCTGNACCCGNAANNAAAAAANNANNCCNTNGGGGGGGGA 

ANCCTNNTT NTTGNCC CTTAANAGGGGNCCCAAAAAANCNATTNCCNCCCNANTTTCNNN 

AAMAANNCTTTTTTTCCCCNNNTTNNAAGNGGGGGGNNGGCCCANCCCCANAAGGGGTT 

TTTNAAAGGGGGGNAACCCCCNGGTCCCCGNCCCCAAAAAAAA 

Sequence 1977 

CGC CNCG CGTCCGGGCTGCCGTAACAATCGCCACAAACCTGGTGACTTAAATAACAAATA 

TTTATTTTCTCGTAGCTCTGGAAGCCGGAAGTCCAAAACCAATAACTTGGCAGGGCTGTG 

CTCCCTCTGGGAGATCTCGGGGAGCCTCCTTGCTACATGTTCCGCTTCTGGGGACTTCTG 

GTGGTGTTTCCCTGGAGGGCCAAGGCCTCCGGGCGGCCCGTGCATTCGGTGAACACCTCC 

MAGAGCTCTGAGTCACMAGCCTTGGGAAACTTTATTTTATTCTTTCTAGGACATTATC 

AGTAGTCCCGAGGAGACATCAATTACAAAACAAAAAAAAAAAAAAAAAAAA 

Sequence 1978 

CGTCCCGACAGTCAGCCGCATCTTCTTTTGCGTCGCCAGCCGAGCCACATCGCTGAGGAC 

ACCATGGGGGAAGGGTGAAGGGTCGGGAGTCAACGGGATTTGGGCGGTATTTGGGCGCCC 

TGGGTCACCAAGGGGCTGGCTTTTAAACTTCTGGGTAAAAGGTGGGATATTGGTTTGCCA 

ATCAMTGACCCCCTTCATTTGGACCCTCAAACTACCATGGGTTTACAATGGTTCCNAAT 

ATGAATTCCACCCAATGGGCAAAATTCCCATGGGCACCCGTCAAAGGGCTTGAGAAACGG 

GGAAAGCTTGGTCAATCAAATGGGAAAATTCCCNATCACCATCTTTCCAGGAGCCGAGNA 

TCCCTCCAAAATCAAAGTGGGGGCGATGCTGGGCGCCTTGAGGTACCGTCGTTGGGAAGT 

CCACTTGGCCGTTCTTTCACCCAACCATTGGGAAGAAAGGGCTGGGGGCCTCATTTTGCA 

GGGGGGGGGAGCCAA 



• 
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Sequence 1979 

GAGTGTAGTTACTTCGGTCTTGCCTCACTGGGAGGCCACGTTGGTGACGATGCACACGAA 

GCCCCGGTACTTGTCCAGGTTAACCATGTGCCCGTCCGATGTCCTTGGCGGAAAACTCGT 

GCATGGAGCGCGCACAGCGCCAGTCGTCCNGGGACGCGCACGGGTCCTGGGACTTGGGAC 

ATTCGNGGATCTCCGGTCGCAAGCTGCGGGGACAAGGCTTCTTCCGCCGGCGCCGCGCCC 

TCCTGCGCCTGCACGCTGGGGCCTTCCGCCGGCGAGGGGCTCGCGGCCGCTAGACTA 

Sequence 1980 

CGCGTCCGATCGGAAGTGGCGCTCGTGCATTCAACTTGTTCCCGCTCATGGAACCCCTCT 

TTAAAAAGACGGAGGGCACCtGTGAGCGCAGGAGCGAGCCTAAGGCCACCCAGCGGCAGC 

GCCCGTGTCCTGGGCACTCAGCGTGCTGGGCAGAGCAGGTGCGATGGCCCCAGTCCTAGC 

AGCCCTCGCCCATGTCCTGTGCCCTTACATGGCTCCCGGACTGTGCAGGGAGCCGATACG 

TTTGCTGATAGCAATACTGGAACCACCCGGGTGCGATGGCAGTGAGGAGACTGCCCAGTG 

CCTTTGGGGCTGTGCTTGCAATAAAGAAAGAATTTCCTGGAAAGGCAGTCTGCAAAAGAG 

GGMCCCGG TGACT CAGAAAGACAGGATGTTTTGGTAATTTACCCCNAAATGTGCCATCC 

ACCATAGTGCTTTTTCCTCTTGCCCTTCGGCTTGTTTGMTCTCACMTTATGGTATTTA 

ATTCTCAAAGAAATATGTATCTGTTAGCCCGNTTGGTGACACTTATACAGATGATTAA 

Sequence 1981 

CNCGCGTCCGGTCGAAGCGAGGACGTGGTGGGTCCTCTGGTGCGAAATTCCGGATTTCCT 

TGGGTCTTCCGGTAGGAGCTGTAATCAATTGTGCTGACAACACAGGAGCCAAAAACCTGT 

ATATCATCTCCGTGAAGGGGATCAAGGGGACGGCTGAACAGACTTCCCGCTGCTGGTGTG 

GGGTGACATGGTGATGGCCACAGTCAAGAAAGGCAAACCAGAGCTCAGAAAAAAGGTACA 

TCCAGCAGTGGTCATTCGACMCGMAGTCATACCGTAGAAMGATGGCGTGTTTCTTTA 

TTTTGAAGATAATGCANGAAGTCATAGGTGAACAATAAAGGCCGAGATGAAAAGGTTCTG 

GCCATTACAGGACCCAGTAGCAAAGGGAGTGTGCAGACTTGTGGGCCCCCGGATTGCATC 

CMTGCTGGCAAGCATTGCATGGATTCTTCCAGTATATTTGTAAAAAAAAAAAAAAAAAN 

NAAA 

Sequence 1982 

GCGTCCGTGGTAACGATTGGCCCTAAGAAGCCCCTGCCTGACCCGTGAGCATTGTGGAAC 

CCAAAGATGAGATACTGCCCACCACCCCCATCTTCAGAACAGAAGGGTGGGGAAGCCAGA 

GCCCGNCTGCCATGCCCCAGCCAGTCCCCACAGCATAACAGGGTCTCCTTGGCAAGCTGT 

ATTCTGGAGTCTGGATGTTGCTCTCTAAAGACCTTTMTAAMTTTTGTACACTGGACTT 

TAAAGTATTGNTCACAAGGGTTATGCAATTCCNNGNCANG 

Sequence 1983 

CCCCGCGTCCGTGACAATCGAGTAGTACTCCCGATTGAAGCCCCCATTCGTATAATAATT 

ACATCACMGGACGTCTTGGCACTTCATGAAGCCTGGTCCCCACATTAGGGNTTTAAAAA 

MCCAGGNATGGCMTTTCCCCGGGACCGGTCTAAAACCAAAACCCACTTTTCAACCGGC 

TTACCACGGACCCGGGGGGGTATACTTACGGGTCAAATGGCTTCTGGAAATCTGGTGGGA 

GGCAMCCACAAGTTTTCATGCCCATCGTCCTAAGAATTAATTCCCCTAAAAATCTTTGA 

AATAGGGCCCGTATTTACCCTATAGCACCCCCTCTACCCCCTCTAGGAGCCAAAANAAAA 

AAAANAANAAANNAAAGTGCGGCCCGCTAGACTTAGGTCTAGGAGGAAAAAAACCTTTCC 

ACACCTTTCCCCTGGAACCTGGAMCATMAAATGMTGCAMTTGTTTGGTTGTTAACT 

TTGG NTTAT TGGCAGGCTTATAAATGGGTTTACCAAAATAAAAGGCCAATAGGCATCACC 

AA AA ATTTTC AAC AA A AT AAAAG G C A I I ! I I I I I I i I AACTGGCATTTCTTAGTTTGGNG 

GGGTTTTGGTCCCAA 

Sequence 1984 

CCGAAAGAATATCTGTGTGCTTAGGGAGGAMCTTTTTGATCTGCAGAAAAGCCAGAAGA 

CATCTAGGACATCCATAAAAATTCATCAGAGAGCATTTTACTACTGAGCTGCAAAGGGAA 

AAACTTAAAATGGGATATGAAAAGTGAAGAAAGTGATCATAGGGAGAAAAACCATTTCAG 

ATGACAAGAGCACCTCAAAGGCAGCAGCCTCAAGGAGCAGCCATGGCCCCAGACTTGTCG 

CACGGATGCAGAAMCTTAATGGAGGAGGCTGAGGTCAGMTGGGMGAGTTTTTAAAAA 

ATAAAAAGGGGAGCTAATATGTGAGGGACCAAAAAAANNNNCANAAAAAANAAAGTGCCG 
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GCCCGCTAGACTAGTCTAGAG 
Sequence 1985 

MGCTTCCTGGTTCAMTGTGCCATTTCCCGGGTTGATGCTGCCACACTTTGTAGAGAGT 

TTAGCAACACAAGTGTGCTTAAGTCAGGCGGTAGGGAAATCCCTCACTAAAAGCAGGAAG 

AAAGGTTCCAATTCAAAAGGTGCCCAATGGATAGGAAGTCAAACAAGGAAAGGGTTAAAT 

GGTTNGGAAAACCACAAATCAGGGTGGTGGGATTTGGGTGCTTACTTTTGNAACAAAAAA 

GGGTCCCCCCTGGTGGGTCTTTTTGGTTCAAACAATTGGTACAAATTGGTAAGAAACTTC 

TTGTCCAMCCACTTAAATTTATTTTTGGTCTTTGGAGGTTm 

NGACTATTGGGAATCCCCGGCATGCCCTGGAATTCACTAAAAGCCAAAGGGGGTCTTGTA 

AGGCCAACGCTGCTTCTTCTGGAGGACTTTCCATTTCCTTTTCTGGATTGGGCAACACCG 

TGCAGG CTCATGGACAAATCTGGTAGGGATAACAAATTCAGTGGTGGGANTTTCCACTTC 

TTTTTTCMGTCCTTTCATGTTAAAAGGAATTTTAAGAACACCCACATACAA 

Sequence 1986 

GNGTCGACCACGCGTCCGGAGCAGCAGCCATGGCCCTACGCTACCCTATGGCCGTGGGCC 

TCAACAAGGGCCACAAAGTGACCAAGAACGTGAGCAAGCCCAGGCACAGCCGACGCCGCG 

GGCGTCTGACCAAACACACCAAGTTCGTGCGGGACATGATTCGGGAGGTGTGTGGCTTTG 

CCCCGTACGAGCGGCGCGCCATGGAGTTACTGAAGGTCTCCAAGGACAAACGGGCCCTCA 

AATTTATCAAGAAAAGGGTGGGGACGCACATCCGCGCCAAGAGGAAGCGGGAGGAGCTGA 

GCAACGTACTGGCCGCCATGAGGAAAGCTGCTGCCAAGAAAGACTGAGCCCCTCCCCTGC 

CCTCTCCCTGAAATAAAGAACAGCTTGNCCGGATAAAAAATNAAAAAAAAAAANAAN 

Sequence 1987 

GCCACGCGTCCGCAGGGAACGTGATTAGTGAAAGGAAGATAAACGTGGATGTTACTCCAA 

AACTTCGTTTMTGMTGCTTAAAGMTTCAAATTTTATCTGCCTCTCTTGTAATTTGGA 

TCTCTTCTTAATGTACATAGTGCTMCATGAAGACCTTTTTCTGCACTATATGCAAACAG 

GGTMCTMCTAAAACAMGCCACTTTCMTCTTCAATCCTTGAAGGTATATCTAGGTTT 

ATGACAGTAATTGTGTTTACATT TTATG GTGCCTAGTATTGACAAMTGTTATTTCCCTA 

CATTAMCATGACTCCATAGACCTTTTCATTTGTGGGGTTTTTATTTCCTATGATGTATA 

CTGCCACTAACCTTNCAAAAATTACTTAGTATTGCAAAGTCAGGGAATCATCAGGGAACG 

TTTAGCTTGGCCAAAATACTTGGTCTGGTTTTAAAAACCCTGTCNAGGTCTACCAAACCT 

GTTCAAGGTCTACCCAATTTTAAGGGGCAAATTTGGGGGNAAAAAGGAAAAAAT 

Sequence 1988 

GGTGTCGACCNCGCGTCCGCGAGTCCCGCGTTCTCTCCTTGAATCCACTCGCCAGCCCGC 

CGCCCTCTGCCGCCGCACCCTGCACACCCGCCCCTCTCCTGTGCCAGGCAAGGTGACCCC 

ATG GCAA GGCGCAAGCCAGAAGGGTCCAGCTTCAACATGACCCACCTGTCCATGGCTATG 

GCCTTTTCCTTTCCCCCAGTTGCCAGTGGGCAACTCCACCCTCAGCTGGGCAACACCCAG 

CACCAGACAGAGTTAGGAAAGGAACTTGCTACTACCAGCACCATGCCCTACCAATATCCA 

GCACTGACCCCGGAGCAGAAGAAGGGAAGCTGTCTGACATCGCTCACCGCATCGTGGCAC 

CTGGCAAGGGGCATCCTGGCTGCAGATGAGTCCACTGGGAGCATTGCCAAGCGGCTGCAG 

TCCATTGGCACCGAGAACACCCGAGGAGAACCGGCGCTTCTACCGCCAGCTGCTGCTGAC 

AGCTGACNACCCGGGTGAACCC 

Sequence 1989 

CGTCCGACAACATTTGGCATNAGGGTTGTATCTGTTGGTGGAGGACACAACGCCAAAAGG 

MATGGGATTTCCTGGTTAGGCCTGCGGCTTGGCAGATGATTGTTATGGGAAAGACACTG 

AGTCTGTTTAGGCMTTTCTTCCTCTTTACTMTMAGTGTTCCTATTTTGAAAGCAATG 

CTGAGTTGTGGACATGTGTATAAACCGTAATGCTGTAAGTTAGGCCTCTCTTGTCTAGAA 

TCTCAGCCTCTTCATACTCTCTCCCCCTTTTTTGCCATATTATTTTCCTTATCACTAACT 

ATATATTTTACCTGTCTCACTTACTATCTGTCTCTCCTCTCAGAGTGTTACTCCAGGAGG 

GCAGAGATMCTGTTTAGTCCAGGCTGTGCCTTTCGGTGCTCAAAATAATGCCTGGGTGC 

AAAAT AAATATATCTTGAATGAAATAATGAAGTAATCTTTAAAATGGTGCTCCGAGCATA 

ATTTTCTATAGTAACACATAGTTTGCAATTAGTTGTCTCTTTCAGGGATAATGGAAATTG 

GTCAACATAAGAAAA 
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Sequence 1990 

NCCCCGCGTCCGAATTTGTTTTATGGATTGMTNATGTCTCCTCTAAAAAGATTTATTGA 

TGTTCTMCCCTCCATGTCTCGTATTGTMCCTTATTTGGAATAGGGTTGTTGCAGAAGT 

AATTAGTTACATTAAGATGAAGTCATGCTGGAGTAGGGTGAGCCCCTAATCTAAGAGGAC 

TGGTGTCCTTATAGGAAGAGGAAAATGCCACATGAAGAGAGAGAGAGATGCCCAAGGAGG 

TGTCATGCTGCAATGAGGGCAGAGATTGGAACTGTGCAGCTGCATGCCAAGGAACACCAA 

AGATTTCAGCAAACTACTGGAAGCTAGGAAAAAGCAAAGAATTCCCCTAAATATTTCAGT 

GGGAGCATAGCCCTGCCAACACCTTATTTCAGATTTCTATCCTCCACAACTATGAGGCAA 

TMGCTCTGCTGTTTTMGCTCCTCAGTGTGMCTATTTTTGTCATGGCAGCCTTAGGAA 

ACTMCACMTTTACAAACAGATTAAGTTCACTAATCAAAAGACAGCCCTTGCAATTGGG 

G I I I I J I 1 1 i 

Sequence 1991 

CCGCGTCCGGCGGGTGAGCCGCTGGCGCGCCGGGCCGGGCGGGGGATTGGCTGAGGGCGA 

CGCGAGAGAGGGGAGACCCGGACTGAGGAGAGGACGGGGTGGAGGGTCCCGGCCGGAGGC 

TAGCCTGAGGAGACCGGGGGCGGAGGGGAGACCCGGGCCGCGGAGGAAAGGGATGGAGGA 

GAGGAAGCCGCCGGGCGCCAGCGGGACCCCCGGGCTGAGGGGAGAGGCGCCCCAGGCCGG 

GTGAAAAGTGGCCGAGGAGACCTGGGCTGGGCTGGCAAGTCCCGGACCGGGGAGGAGGGG 

AGCAGCCCTCCGATGTGAGGGATCGCAGAGGAATGAGCTTCGTTCTGGATTAAAAAAAAA 

AAAAAAAAAAA 

Sequence 1992 

GTCACCACGCGTCCGCAGCGCACGCCCCNTGCCCTGAGAACAGGAAAGGGCCCGGAAGGG 

CTGACTCACCGGGCCGACNCTCACACGAAAATGGGATGCACTTTATTTGCTGGTGCAAAG 

GCAGGTGAGGGTGCCTNCTGNGTGACCGNTGGCCCTNCTGCCTGGNGGCGCTGAAGGGAA 

GGAGCCAGTGAGCCTGACCCCGGGAGGGGCCGTCCCCGTGTGCCGCGTTNGGCGGGGCCC 

CACGCGGCTCCCCCANGCCCGGGTCTGGGGCCCCAGGCTTTCCCCTGNCTNGNGGNCGNN 

CNTNCCNNGCTTTGGGNTNCTTGNNTNGGNNTTTTTAATGCCAGNNNTCANNACATAANT 

GCNTTNTGAAAGAGGTTCCAGCTATCACTTGTAACCATATATATATACATATATATTCTA 

TCTACAMGTGTTTATTTNCAAAGATNTTTCAACGGTGAATTCAGTCCCCGGCCGCCCTT 

NTGACCATCTGTNCCCNGTCCTTGTCCCCGCCCCGGGG 

Sequence 1993 

TCNCCNCGCGTCCTTGATATTTTGAGAAAAATCATGTGAGTCA I I < I < I CTGTTTCTCTT 

TTCTCTTAACGATTATCACTGTAATTCTGAATCTGAAAGGTAAAACAATTAGTCAAAATA 

TTATTGCCATCATTCTACCTGTGTTATGAMCTACTTATTCATAGTTMTTCTCATTAAC 

ACTTACATTTCC ATAMG AAMCTCAAGTATTAATAAMGAGACTTTACTGGCTTAAGAG 

GGCTGTGAMGATTTTTGATAGTGAATCATGACCCTAAGGGAGAGATTTGTGTGATAAAA 

GTATTGTATATMTAGATCAGCGATTTTTGTAAGGCAAACAGMTTTGTMGTTGGCAGA 

TCTTCCTMGTTGCAAAATGTAATGATGAGCTTGGTGGGAGAAGAATGAGTCGTTCtTGG 

AATACCTATGTGCAGCCACTACCCATCTCAATGTCACCTTGTTTGCATTCTTGGATAGCT 

TGTATATGTAGTAGTTTGATGMTMTTTAAAGAAAAACACCTAAAATTTGAAAAATGAT 

TGTAGGGATCAAAAAAGGCAGATGAAATTAC 

Sequence 1994 

CACGCGTCCGCTGACCTGACCCNTTCCTGATCCNAGGCCNGGTGGTTGTCTTATTGCACC 

ATACTCCTTGCTTCCTGATGCTGGGCAATGAGGCAGATAGCACTGGGTGTGAGAATGATC 

AAGGATCTGGACCCCAAAGAATAGACTGGATGGAAAGACAAACTGCACAGGCAGATGTTT 

GCCTCATAATAGTCGTAAGTGGAGTCCTGGAATTTGGACAAGTGCTGTTGGGATATAGTC 

AACTTATTCTTTGAGTAATGTGACTAAAGGAAAAAACTTTGATTAAAAAAAAAAAAAAAA 

AAAAAAAGTGCGGCGGCCGGGCGCGCCGGGGTGGGCGGCGAGCGGAGCCGGCCGGAGCGG 

GCCGGGCA 

Sequence 1995 

TCCGACNAAGGAACAAAAGCGAAACACACAAACCAGCCTCAACTTACACTTGGTTACTCA 
AAAGAACAAGAGTCAATGGTACTTGTCCTAGCGTTTTGGAAGAGGAAAACAGGAACCCAC 
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CAAACCAACCMTCAACCAAACAAAGAAAAAATTCCACMTGAAAGMTGTATTTTGTCT 

TTTTGCATTTTGGTGTATMGCCATCAATATTCAGCAAAATGATTCCTTTCTTTAAAAAA 

AAA AAATGTG GAGGAMGTAGAAATTTACCAAGGTTGTTGGCCCAGGGCGTTAAATTCAC 

AGATTTTTTTMCGAGAAMACACACAGAAGMGCTACCTCAGGTGTTTTTACCTCAGCA 

CCTTGC TCTTGTGTTTCCCTTAGAGATT^ 

I I I I I I AATAAAAATGAGTTGGAAAAAAAATAAGATATCAACTGCCAGCCTGGAGGAAGG 
TGACAGTCCAAGTGTGGAACAGCTGTTCTGAATTGTCTTCCGCTAGCCAAGAACCTATAT 
GGCCTTCTTTTGGACAAACCTTGAAAATGTTTATTTAAA 
Sequence 1996 

ACGCGTCCGMGAGGTATATCCCATACCTT7TCCCCAAATTCCCTTGTCATTAATTTTCC 

AATTATGCTTCTTCCAAGTGCCTGACCATCCAGCCAAACCATTGGCTACAGCCCATGAAT 

CAGTATACAATGGCGCTACTGGTCATTTCTCTTTCCATACAAAGTGCACAACCTGGTATA 

CTCTTCAAAGTTCTGCCCACTGGGAAGATTTGCCTTCACCACTGTCCTTCAGGGATAGGG 

GCTATAGAGCTGCAGCTGTCCACTTTCAGGTGGTGCCTGCATATCGTGCATAACCATCTG 

TGAACCAAGCCCTAGTCTTGTCTTCCTCTGTCAACTGATCATAGGGAACTCCCCATGAGG 

CCATCAGTGCAGGCTAAGGGAAAGAAGGCAAGGTGGCAGGAGTGGAGACCATGGACATTT 

GAGCTACTTTCTTAGGTAACTTACTTGTGTCTTCAGGACCTGCTCAAGCCAGATCACATA 

TATACCACTTTAATT 

Sequence 1997 

CGCCNCGCGTCCGCTACACTTAAGGATGAAGAGAGGAAAAATCTACAGTGTAGGCACAGA 

AATGCCTAGAAATGAGAGAAAAAAAAAACAGGAGCGCATACTGCCTCAGAAACACCAAGA 

GTAGTTTGGGAAGAAAGGAGTCAACAATAACAACTATCAAAGGAATGAATGGCCAACAGT 

GTAGCCTCAAGTAGGGTAGTGTGAGTAGTTGATAGACCAAAAATCCTCAGCAAAATACTA 

GCGAACAGATTCAACAGCATACTGAATGGATTATACACCATGGCCAAGTGGGATTTATCC 

CAGGAATACAAAGGTAGTTCAGCATTAGAAAATCAATCAATGCCAACACACCTCATTAAC 

AGAAAAAGTAAAGAACTGCATACATGATCCCACAATTGATGCTGAAAAGGCATGACAAAA 

TCCAACATGCTTTCATGATTTAAAAAAAAAAAAAAAAAAAAAN 

Sequence 1998 

CCACGCGTCCGCACACTTGACCCCAGAGATCACGCCACTGTCAGCTGCCCTGGCTCAAAC 

AATTGCCCAGGGAATGGCACCTCCACCTGTCTCCATGGCTCCTGTGGCTGTATCTGTGGC 

TCCTGTGGCCCCTGTGGCTGTATCGATGGCCCAACCCTTGGCAGGAATCACAATGAGCCA 

CACCACCACTCCCATGGTGACTTACCCTATCGCTTCCCAGAGCATGCGCATCACGGCCAT 

GCCACACTGATGGGGCTAATGGACACTCCCCTGGTATAGCCTCGCAGGGCTGGGGTCAGG 

GGGCCCTTGCCCACTCACCTAGCCTTCCCCATCCCTGTCTGAAGGGCTCCCTTGAGAACT 

AGGACAAGAGACTACAAGGAGTATGTCCTGAGGAGGGGTTGGGATGGTGTGGTTTTCTCT 

CACCTCCCTTTTATGAGGGTCCTCTTGTCCATCTTCAAGCCTCACAGTGGGGGGCTT 

Sequence 1999 

NNGGCAGGAGAGGTTCAAATGCATTGCATCAACCTACTATAGAGGAGCTCAAGGTAATGG 

GGCTGGGTGAAGTGGGGTAGGTGGGTCTCAGAGTGCACATGGCTTCTCATATGGAGCTGG 

AAGGATTGGGGAAATGAGCAGTAGTGTCTTCCCTGTCAACCTGGGGCTGTTTNTGCCACT 

CTTCCAGCCATCATCATTGTCTTCAACCTGAATGATGTGGCATCTCTGGAACATACCAAG 

TAAGTGAGCATCCTGCAATATAATGGGAGGCTCCG 

Sequence 2000 

ACGCCTCCAGCCTGGGCAATAGAATGAATGAGACTCCATCTCAAAAATAAATAAATAAAT 
AAAA TACT GAGAAAAAGAATCTTTATTGTTTCCTG 

GCTCTTTTCCCTTTGACTCTCGCCGCCTAGATTTCGGTACCAGGACCACACATTTTTAAA 

GATGCTCCTCACCGCCGTACAGCTCCTGTACAGCCCAGAAAGCTCCGTGCGCACGAAGCT 

CATCCAGCTCCCGGTGGTCTACGTGATGCTCATGCAGCACTCGCTGTTTCTGCCGACTCT 

ACTGACGTCTGACGGAGAGGAGAGCCCGGACAGCCAAGTAAAAGGTGACCCCTCACCCCA 

GCCCTTCCATTCCGTATCCAGATTTTATCTCCTGATTTCCTTA 

Sequence 2001 
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CCCGCGTCCGCCGAGATGGCGCCCCTGCACTCCAGCCTGGGTGACAGAGTGAGACCTTGC 

GTTTAAAAAAAAAAAAAAAGGAAAAGGAAAAAAAAGTTCTACTGTGGGTAAAATGCTATC 

ATGCTATCAAA CAGCA TCACATGCTACAGAGAAGTCTTGTGAAAGGATGTCAGTGTGGCA 

MCTTCATTGTCTTTTAAGAAATTGCCACAGCTATCGTAATCTTCAACCACCACCACCCT 

GATG AGTCAGT AGCCATCGTTAACGTTGGCGTAAGACCCTCCACCAGCAAAAAGATTGTT 

AGGAI I I 1 1 1 1 AGCAGTGATGTATTTATCAATTATACATAATGCTATTGTACACTTCTTA 

ATAGACTATGGTATAGTGTAACCATAATTTATGCACTGGGAAACCAACAAATTCATGAGA 

TTCGCTTTATTGTANTTGCTCTGGAATTGAGCCTGCCCGTGTTTCTTGAGGTATACCTGT 

GTATTAGGTATGTTCCTTGTATATAMACCCCATTTTTTAGGGTAGGAAACTAANGCTTA 

GAGTTTGAGCCAA I I I I ICCCATA 

Sequence 2002 

CGTCCGCACTGTTCAMCCCTGTCATGCTTTAAAACTGATGCGAGATGATTTTGTTTTTT 

GCATAATCAATACTTAAGGGTGCAATCAACTGTTAGTAATTGTGCAGTANAGTAAAGCCC 

TGTGGTGTATCMCTACTAGTTAAGAGTCTCAGTTGATTTCTGTAATGTTTGACCTAATA 

ATAGCCCGTTTCGTCTCTGACCCAACAGAGGAAGCACAGATCAAATCACCTTGGAGTGGT 

CACCAGGGGGACAGGGAGCCCCCCACCAATGTATCAATGGGTGATTTATGATGCCTTCTG 

CCCTTTG GCGAG TGAATGGGTTTCCCATAGGGGAAGTTNGGCCTCCCTCCGTGAGCTTTG 

GAAAATGl I I I ICTMTAGACACAGGGGAGGCCAGTTTTTGTTTTNANAGCMTTATTTT 

TTC CCAM TTCNTCTGTTTTGGGNGGTNGGMCCTGNGGGGCCCCCGGGGTTTCTGGTTT 

TCCTTTTTN CNGCN GGAAAATTCTCCTGGCTAAAANANTCCC I II I 1 I I I I 1 NGGTTTNC 

CANMG CCCTTTTTATAAATGCANNATANTGAATNGCTTGGGGAACNNAAAATAAANTTT 

rrm >c ccantcaaaaaaanaaaaaaaaaaanaaaaaaaaaanggcggggccccttana 

TTTTTTTAAAAAAAAAAAACCCCC 
Sequence 2003 

TTNCCATCCCTCAGGTGCTGAGAACCAAAGTGCCTAGAGAAGGGCAGGAGGAGGATGACG 

ACGATGAGGAAGACGATGCTGACGAGGAGGCTCCCMGCCCGACCATTTTGTTCAGGACC 

CTGCAGTGCTGAGAGAGAAGGCAGAAGCCAGGCGCATGGCCTTTCTCGCCAAGAAAGGGT 

ACCGGCATGACAGCTCAACAGCAGTGGCCGGCAGCCCCCGAGGCCATGGGCAGAGCCGCG 

AGACAACCCAGGAACGCAGGAAGAAGGAAGCCAACAAGGCGACAAGAGCCAACCACAACC 

GGAGAACCATGGCCGACCGCAAGAGGAGCAAAGGCATGATCCCATCCTGAGACCTGGTGC 

AGGGCCAGTGGGGAGGCAAGC 

Sequence 2004 

TNCACATTTTCCTCCTAAGCACATGGGTCATTCTCAAGGAGAGACCATATGTTAGGTCAC 

AAACATTCAAAAAATTGAAATATTATCAAGCAACTACTCTGACCACAATGATGTAAAACT 

AGAMTCMAACCMGAGGMTTTTAGAAACTATAGAAGCACATGGAAATTAAACAATCT 

GTTTCTTMTGACCAGTGAGCCAATGAAAMATCMGMGAAATTTGCAMTTGTTTGAA 

AGAAGTTATAATGGAAACACACTATACTAAAACCTATAAAACAGCAAAAGCAGGACTAAG 

AGGAMATGTATAGCTATTAMGTGCCTACATTTTTAAAAAATGAAAAAACTTTAGATAA 

ATCACTTATCTATGAATTTGAAAAAAAAAAAAAA 

Sequence 2005 

CGGGMGCTTTNTTTTAATTMGCTGAMCCGAAGCTTTMTTTTM 

ATGTGCCCATATTGGACAGTATAGCTCCAGGGTTTATTAAACCACCATTCCTGGTGATAG 

GATAGAGCCCAGCACAAACTAATCTGGGCAACAAATCAAAGGGCACAAGTCGCATGTAGG 

TTCCMACTCAGTACATTTTAGAGAMAGGAGTTTGATTATGTAGTAGAAGGAAGAACTG 

CCTGGTGpGACTCATGATCTTCCTTTAAGAGCAAGGCTCMAGACCTGGGGAGTTTTGAT 

TTGATGCTATGATGTCTCCTGGGGCTCAGAATATTATGAAATGAGGAGTGAATCTCTGAG 

TGAAAAGAACTCAAGCTGCTTGTTGCATTGCGGAATGTCTCTAAGAGGTAGAGAGGCGTC 

TGTTAAGTGGCTTTGTATGAAGGTTTCAGAAGGTAAGATGAGCC 

Sequence 2006 

NCATTTMCTGTAATAGATATAGAACGTATCCTTCAATAACCAACAACAAATGCTTGTTT 
CCAACAGGCTCTGATGGTTGTATATANGGAAACCTTGATACCAATACGTCCAGCTGACCC 
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AGAAAAAAATCAGAGCTTAGTTAAATCACTGCTGCTCAAGGCTGTTGTATCTGGNNACGC 
TCGAAATGGAGTTGCACTCACTGCCCTGGATCAGGATCACGTCGCAGTCCTAGGAAGTCC 
ACTAGCAGCTTCT AAGG GTAACTGACATCCAATTCATTAAGAAGGCCTTTGTATGTTTCC 
TrCACGGTGTACCTTTTAGTCTTCCTTTTNTTTTTCATTtCATTATTACCCAGACTAAT 
Sequence 2007 

GTCCGGCCCTGCACTAGGGACTGGGGAGGACATGGAGAATAATATATGTACCAGCACAGA 

CAGACAGGCAGCTTCTGGAGCCAGACTGTCTGTGTGATCTTGGGTAAGTAATCTAGCCTA 

ACCATAGAGATGATAACAATTGTACCTGCCACAGGCTCATGGGGCTATTGTGATGGTGAA 

ATGTAAAGATGf AAAGTATAGAGAAAGGATGTAAAACTGCCCGACATATAGTAAGGGCCA 

TTTAAAAGATGCCTGTTATTGTCCCTCTTCTTGAGAGCATATTAAGGGCTCAAATCTAGC 

CAGAGTCTGGCGGGTACAGAGGTAAACAAATAATGGCAGGTTGGTTAAGTACTGGACACC 

AGTGATGTCCAAGGTCTCTAGGATGCCCTGGAGAAATAATGAACTCTTCTAAGATTGTTT 

GGAAGGGGATTTATAAAGGACATCTAGGCTGGGTTTTGAGTGAGGTATGGGGAGTTCGCT 

GGTCAAGAATGTANGAATTACAAAAGCANAAAGTGCCTTATTTNAAAAAAAAAAAAAA 

Sequence 2008 

NCCCCGCGTCCGGAAAATATTTTAGAMGCACCTTGAAGATTAGTATTTTTATGTAACTT 

CTGTTGGAGAGATGTCTTCAGGAGACTGAAGTAGAAGAGCGACTGTCAAAATGGAAAGTC 

CCAGAGACATCCAATTTATGTAAATCAACATCACCTGAATTCAGAATCTCATCCAGATTT 

CAACAAAGACTTCTGAATGCCAACAAAGAAGAGGACTGAATTTACAGACTCTCACTCTAA 

CMTATATGCTGTTCMTTTGAAAACAGAATAAAATTATTTTGGCAGAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2009 

AGAGTGTNCAGCTGGCTTTCCTCACTTGGGAAAAGGGGTACTGCCAGTCTAAGCAGCCTC 

CTCTGTACTCAGCCAGGACACCCAGCGCGTGGGACCTGTTTGTGTCTGTTTTGCTTCCTT 

GGGAACGGCACAGTCACTCACCCTGCCATTTGCGGAAATGACCTGGTGCACTTTGACTGT 

TAAGCAATGCGT TATTG CTGTAGTCAAGGTTAGTGCAAGCAAGGAAACATTCCCAGTAAG 

GTATTTGTTTCCATTTTCTGTCTGTGCTTCTGTCAGAAACTTGCTAGGACTTTAGTGGCC 

AATAAAAAAGAAATTCTAGCTTGATCGACCCACGCGTCCGGTAATGATTGATGAGGACAT 

GACTGCCCGAATTAGCATGGCTGATGTCAAGTTCTCTTTCCAATGTCCTGGTCGCATGTA 

TGCACCTGCCTGGGTAGCCCCCGAAGCTCTGCAGAAGAAGCC 

Sequence 2010 

GCGTCCGACAAACATAATGATCCGTGACAGTGACCTTAATATATAAATGTGTGTGTTGTG 

ATTGTTCTACTGACCAGCCATTTCCTGTCTCCCTCTTCTTATGCCTCCCTGTTAAATTAG 

ACACAACAACATTGAAATCAGACCAATTAATAACCCTACAGTGGTCTCTGAGAGAGGCTA 

ACAGCTAATCCTCTTGTGCTAAACAGCCAAATTGTGAATGCAAAGGAAAAGTTCTCAAAG 

GAAATTAAAAGTGCTACTCCAGTGAACACATGAATGATAAGAAAGTGAAACAGTCCGGGC 

GTGGTGGGTCACGCCTGTAGTCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACCTGA 

GGTCAGGAGCTCAAGACCAGCCTGGCCAACATAGTGAAACCCTGTCTCTACTAAAAATAC 

AAAAATTAGCTGAGTGTGGTGGTGCACACTTGTAATCCCAGACACTCGGGAAGCTAAGGC 

CGAGAGAATCACTTGAACCCCAGAGGTGGACATTGCAAGTGAGCTGAGATTGTGCCACTT 

GCACTTCAACCTGGGTAACAGAGAGAGACTCTGTCTCAAAAAAAAAAAAAAAA 

Sequence 2011 

GTCTATTAACATACGGAGCTGAATCTGCCGCAGCTNGAAATGCTCAAGAACCAGCTGGAC 

CAGGAAGTGGAGTTCTTGTCCACGTCCATTGCTCAGCTCMAGTGGTACAGACCAAGTAT 

GTGGAAGCCAAGGACTGTCTGAACGTGCTGAACAAGAGCAACGAGGGTATGGGGTAGGCG 

GGTGAGGGTAACCTAAAGTGGCGAACCTGCTTCTCTCGTCCCACCTCCTAACCCAGTTTT 

TCTTACCTGAAACGAGAAAATCCATTACATATCGTATACCGCTTCATGAACCCTTTGCAT 

GTT GCCTG CCTAGMTTGAAAAGTACAGGACATTCCTCTGCTCCTATTGCCCCTGTTTCC 

GTTCTTTTCACACTGTCTGTGGGTGCTGTGCCCTGTTGGAACTCTCTTTAACGTCTTACC 

GTTGGAGCCGCTTACCTTCCCAGGTGTTGTCTTCATTGGCTTTCACAAGGGAAAA 

Sequence 2012 
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NGGCAGCTGCTTTGTCTGGGAGGGGGCTTTGTGTCGAGTCTCCCTGAATGAGCAGGGCTG 

GCGACAGTTGTCAAAACACATGGTGCTTGGTCAGAGCCCCCGTAGAAGCCCCTTGTCCTC 

CGCATGGCCTCCGCCTGCACCCGGGGCGTGGAATGJGCTCTTGTGTGTCCCTGGCTGTCT 

GCTTGCTTCTACACTGGCCCCTGCAGATGGAGGGGGTGGGGTACAGGGGTTCCTATAAGA 

AGCAGACACTTGGGGTTTTTCCCAGGCCCTGTTGCAGGAGGGTGCGTGGGCTGGTTTCCC 

TGAAGGCGCCTGGGCCGTGTTGGTGTTAACTGATCTGAGATCTTCTGTGGCCCTGATGTC 

TATGAGCATGCCCCAACTTGCANGGGGCTGAGTAGCCCGGGCACCACCAGGAGGCTTGCG 

TGCCCTGTGCTTGGGTGTACCCATGCCCTGTCAGCATCGTTGGTCTGTTAGGGGTCAGGG 

ACTTCGGCTTCTTGTTTAATACCCTNC 

Sequence 2013 

AAGTCGGACGCGTCTCAGTTCCTTAAAGCTGTTGGTCCAAGGCTACCCTCAGTTCCTTGC 

CACATGGGCTTCTCCAACAAGGCCATTTACATTAACAAAGCCAATGAGAAAGAGAGTTAC 

CGGGCATGGTCGCTCACACCTCTTAATCCCAGCACTTTGGGAGCCCGAGGCGGGTGGATC 

ACTTGAGGTTAGGAGTTCGAGACCAGCCTGGCCAATATGCTGAAACCCCGTATCTACTAA 

AAATACAAAAAAATTAGCCTGGCATGGTGGTGGGCACCTGTAATCCCAGCTACTCAGGAA 

GCTGAAGCAGGAGAATCACTTGAACTTGGGAGTCGTAGGTTGCAGTGAGCCAAGATCGAG 

CCATTCCACTCCAGCCTGTGCAACAAGAGCGAGATTCCATCTCANACAAAAAAAAGATNA 

ANNAAAAAAAAAAAANT 

Sequence 2014 

TCCTATTCCTCTCTTGAGTTTATGCACATCTCTATAMTCATTAGTTTTCTATTTTATTA 

CATAMATTCTTTTA GAMA TGCAAATAGTGMCTTTGTGAATGGATTTTTCCATACTCA 

TCTACMTTCCTCCATTTTAMTGACTACTTTTATTTTTTAATTTAAAAAATC 

GTATCATGAGTAGGTCTTACATCAGTGATGGGTTCTTTTTGTAGTGAGACATACAAATCT 

GATG T TMTG TTTGCTCTTAGAAGTCATACTCCATGGTCTTCAAAGACCAAAAAATGAGG 

TTTTGCTTTTGTAATCAGGAAAAAAAAAATTMTGAACCTTAAAAAAAAAANANNAAANA 

ACANGTGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCAl I I I I I I INACTGCANTC 

TAAGTTGGGGGGTTGGNCCAAACTCATCAANNNATCNTAATCATGGCTGGGATCCCCNGG 

GTCCCGNGCCT CGGAAATT AM1TNCCTTTTTTCCGGTTTTCCTTGGGGTTTT4 

TGNGGNGAAAN'I 1111111 I GGGGGGGGGGGGGG 

Sequence 201 5 

CGCGTCCTGAGAGGTCAGGCCGGTCCTGGGGGCAGCAAGCCCGGCCACACTCCCCACCGC 

GACCGGGGCTCTGGGCTCGCTTCCTGCTTCAGTTTCCCCAAGCTCCNGATGAGACTCCGC 

TACTACCACCACGTCGATAACGCAAACCTAGAGGGACTCAGGGTAAACTGAGGCACTCAA 

ACTGCCGAGGAGCTCCGCCTCCCGAGAGACATTTAATCCGGGGGGATTTGCAGGAAACTT 

CTAAATTAAGGGTAGCGGCTGCTGCAGCTGAGGGGGGGCACACCGGTCCCTGCGCCCGGG 

CAGCTGCCGTGAGCTCACGCCCCGAAATAGCCCCAGGGGCCCCAGCCGCAGCTGCCACTG 

GGTCCGGCTGTCACTCAGAGGAAGCACGGAGCCCCCAGCCCAAGGGTCCCCTCCCCTTCG 

CATCGCGGGGTTTTTCCAGCCGACCGTCGGCCACTTTTTCCTCCGACNGCTGGCAGGGAA 

GAGGGGGATTGGGGGCCGGGACCCCAAGGGAGGCGGTCCCCAATGGGTGGGCCAAGGG 
Sequence 2016 

ACGCGTCCGCACACCCCCAGGTGCCCGCGCTGGCCCCCCAGGCGTGGTGGCCTGCACGGA 

GGGGACCACTTACGTCTGCTCCGTCTGCCCAGCAAAGTTTGACCAAATCGAGCAGTTCAA 

CGACCACATGAGGATGCATGTGTCTGACGGATAAGTAGTATCTTTCTCTCTTTCTTATGA 

ACAA AACAA AACAACAACAAAAAACAAACAAACAAAAAAGCTATGGCACTAGAATTTAAG 

AAATGTTTTGGTTTCATTTTTACTTTCT 

ACATGAAGAACTGTTT1TTGCCTGCTGGTACATTACATTTCCGGAGGCTTGGGTGAATAA 
TAGTTTTCCCAGTCTCCCTCGGATGGTGGCCTTAAGGCCTGGTAGTGCTTCAAGAGGTCC 
ACTGGTTGGATCTCTAGCTACTGGCCTCTAMTACMCCCTTCTTTTACAAAAAAATCTT 
TTAAAAAAAAAAAAAAAAAA 
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Sequence 2017 

CGCTCCGCAGCATCMGAGTTATTTTTCTGACTGTCMGTTTCMCATTCAGGTCTGTCC 

CCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAG 

TAGACAGAGTTGCCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGTACCCAGCTGC 

AGAACTTCACCCTGGACAGGAGCAGTGTCCTTGTGGATGGGTATTCTCCCMCAGAAATG 

AGCCCTTAACTGGGAATTCTGACCTTCCCTTCTGGGCTGTCATCCTCATCGGCTTGGCAG 

GACTCCTGGGACTCATCACATGCCTGATCTGCGGTGTCCTGGTGACCACCCCGCCCGGCG 

GAAGAAGGAAGGAGAATACAACCGTCCAGCAACAGTGCCCAGGCTACTACCAAGTCACAC 

CTAGACCTGGAGGATCTGCAATGACTGGAACTTGCCCGTGCCTGGGGTGCCNTTTCCCCA 

ACCAGGGTNCNAAANAACTTTGGCTGGGGCAAGAAATNAAANCATATTTGGTCCGGAAAA 

AAAAAAAAAAAAAAAAGGGCCGGCCCTTNAACTAAGTCTAAAANAAAAAAACTTTCAAAA 

CCTTCCCCTGGAACCTGAAACATAAAATGNATNGCAATGGTGGGNGGTNAACCTGGGTTA 

ATTTGGCANCCTTTTTANTTAAAANTTGGGGGGGGGGTT 

Sequence 2018 

CGGATCAAGACCATCCTGGCTACAGTGAAACCCCGTTTCTACTAGAACTACAAAAAATTA 

GCCGGGCGTGGTGGCAGGCACCTATNGTCCCAGCTACTCGGGAAGCTGAGGCAGGAGAAT 

GGCGTGAACCTGGGAGGTGGAGGTGCAGTGAGCCAAGATGGCACCAGTGCACTCCAGCCT 

GGGCGACAGAGGTAGACTCTGTCTCAGAAAAAAAAAAAAAAAAAAAATCAGTCACTGGAT 

TTGGGCCCACCCTACCTNCATATGACCTCATGTTAACTTGATGACATCTGCAAAGACCCC 

ATTCCCAAAAAGGTCACCTTCACCAGTMGTNGGGGGTrAGGACTTGMTATAGCTTTTT 

GGTT GATGTAATTCMCCCACAGCACTGCCTTTTNCATTCCATGTTATGTTTTTTGAGAT 

TTTTGAGATTTGCCAAATATATGAAGCTATAMTTATCAGNGAAMTAMTMTTTCAAA 

ATNTMGCTGTTGAAAACTCTAAATTATTTTTAAGCCTTTAAM 

GACMGGNCCCGNNNGGGCTTCAATGCCTNTAATCCCCANCACTTTTGANANGGCTGATT 
GGGNGNGNGGATTCCACCTTGAGGGTTTAGNGGNTTTCAAAGACCAAGCCTGGNCNCTAN 
CGGNGGTTGAAAACCCCTGGTNNTTCTNACCCTTNAAAAAAAA 
Sequence 2019 

GTTTTT TTATT TCTACTGTCAMTGATGTGCAAAACCTTT^ 

GCCMGTTTTATMTCCTTAMTCTTAATGTTCCTCAAAGCTTGGATTAAATACATATGG 

ATGTTACTCTCTTGCACCAAATTATCTTGATACATTCAMTTTGTCTGGTTAAAAAATAG 

GT GGTAGA TATTGAGGCCAAGAATATTGCAAAATACATGAAGCTTCATGCACTTAAAGAA 

GTATTTTTAGAATMGAATTTGCATACTTACCTAGTGAMCTTTTCTAGM 

ACTCTAAGTCATGTATGTTTCTCTTTGATTATTTGCATGTTATGTTTAATAAGCTACTAG 

CAAAATA AAACG AGTTGAC CCAC GCCGTCCGGACACAAGAAAGGAATATAATTCATACAC 

TATTGCATTTTTAATAAATCTTTTGAAATTTGCAGMTTMGATT 

GGTTAAATGATMTTG MTGT AAATATTTAAGATGCAGCACCATATTTTATAACCCAGCT 

TTAGGCATTTCTTCATATTTTMGGGAMCCCCCCACCTCCTTCTTTTAANGGCGCTTCT 

TGCTCTCTGAMTGCCCTGCTAMTGCCTTCTCTTMTTATTTGGAATAANGGTAGGTTT 

TTGGGGGAAAATTTTAAAAAAAAAANGGGGGGGNAAAAAAAA 

Sequence 2020 

AAliii \ NATAATCTGMTTATCACMAAAGATMGGATTTTTAAAGTTATTTGGAGGGA 

GTGTACACATTGTTTATTTAATAGTGAGGGCTATTATACAAGCNGGTCNATGTAAAATAA 

TCCCTTTATATGTATGAGCATAGTTAATTTGGTAAACAACAGACAATTACATACTGTGAT 

CATAAGGACTTTAGTATCAGTTACCATATAGCAGGTACTCTTTAGTCAGGATATACCTAT 

ATAGGTGCTAMTTAAAAATCAACCTTATATCTCAAATTTACTTCCTAGTAGAGTGTAAC 

GCTGCCATAATTGCMGCCTAATTATGGGGTTGTCCATACTGCAGTCCCATCAGTACTCA 

TACACCAAAGGGTCGGGCCTGTGAAGACTGGAAAAAGAATAAAATATCTTTTGATTGAGA 

TACTACAAGCAAAAATGACTTTCTGGCTACCATTACTGCAAAGAACAAAACTCAACTGAA 

AACAATATGTAGCAATTAAAAAGTCTCATTATCTTCAAAACACAC 

Sequence 2021 

CCCGCGTCCCGGAACTCTTACCCATAACCCTAATGATGCAAGTCATATGGGGGAACACTT 
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TGTAMTGGTCAGGATAAAMCCMATCTGGGTGCCAGATCCCAGCACTACTTTTTATTA 

CTGGAGAAATGGGGGGGATAGAAAATTCTACTTTGAATTATTTAG I I I 1 1 I I I AAAGAGT 

GGGTTGTGTTTGTGCTTCTCCCACCTTTCAAGCATTTATAGAACATGCTGCCCCACATAC 

AMGTCAAGACCACTTACTTTTATGTGACACTAGTAGTTTGGGGTTAATGGTTTGNGTAA 

AGAACAAGCTGCATATGAGTAAAGGTTACCCCAACCCNCAGTGAGGANGAAAGATGTTCA 

CATACTGGGMCTGTCCTGNCAMTMATNTGGCCCCTATTGGGCTCTGTTTTAATTNGG 

MGNGGGCAAMGTMCCTC7TGCTTTGGGGCAACTATTTGGNNTCAAAATTANAAACCT 

TTTAGACCCAAANTTNANNNNNNNNAAAANNGNNNNNNNANGGGGCCGGNCCCmTNGAC 

CTTAGTTTTTANANAAAAAAAAACCTNCCACACCTTCCCCTTGGAACCCTGAAACAATAA 

AMGGAAANGCCANTTGGGGGGGNGGGTAAACCTTGGTTTNATTGGCACNCTTTATAAAN 

GGNTTNCCAAAANMAAGGCAATTTGCCTTTCCCCAAANTTTTCCCAAANAAAAAAAGGG 

Sequence 2022 

ACCACGCGTCCGGTCTGCAGAGGCCCGGGCCTGGGCACAAAGGGAGAGAGGCCTCCATTG 

TCCCGCAGGGGCCAAAATGCAGACCGTGCATCCCCGGTGACCTCGGGGACCGTNCTCTGA 

TCAGCAGGATTTTCTTGGACTCTGGGGTCCTTGTCCTGCTCAGGCATCCCTGCCCTGCTC 

TCCTTGAGGGCCCTCAACACTATCTTCCCTGGACACAAGTCTGGGGACAGCCGGGTGTTG 

AGGACCCCAAAGGGGTGACTACCTGCTCCTGGGCCCCACAGAGTCCTTGTGCTCAGTGTA 

GTGGCTGAGCTGGGGGATGCCCTGGAATTCGGAGCACACAGCACTGGCTTACTGTGGTAC 

CTGTGCAGTGAAATTGGAGACAGAATCACCAGGATGGAACACAGGTCTTGCAGGATCACG 

GAAMCCTTTTAGAGTTGTCTTGACACCACtTGATGTTGAGTGTCCGGGTGTTTGTAGGA 

TGGCCTGCACTCAGTCCAGGGGCAGG 

Sequence 2023 

CGCGTCCGCTTGACCCTGTATTTTGGGAGTCGAACGGAGAATGGAAACTGAAAGTGGAAA 

TCAGGAAAAGGTAATGGAAGAAGAAAGCACTGAAAAGAAAAAAGAAGTTGAAAAAAAGAA 

ACGGTCACGAGTTAAACAGGTGCTTGCAGATATTGCTAAGCAAGTGGACTTCTGGTTTGG 

GGATGCAAATCTTCACAAGGATAGATTTCTTCGAGAACAGATAGAAAAATCTAGAGATGG 

ATATGTTGATATATCACTACTTGTGTCTTTTAACAAAATGAAAAAATTGACTACTGATGG 

GMGTTAATTGCCAGAGCATTGAGMGTTCAGCTGTTGTAGAGCTTGATTTGGAAGGCAC 

CAGAATCCGGAGGAAAAAAC 

Sequence 2024 

CGCGTCCGCAGACTTTCCCTCTGCAATAAATCCTGTAACAAAATTGCACTCGCACCCCTT 

ACGT1TATACAAATTTTTAAAATAAATAATAACTAGAAANGCAAGAGGAAACCAAACCCA 

AAATTATTAGAAGAAAAGAAATGATAAAGATTGGGGCAGAAATTAATGAATTGATACTAA 

AAATAGTACAAAAGATGGATGAAAG AAAAAGTTG I I I I I I 1 1 1 1 I I I AAAAGATAAGCAA 

AATCAACAAACCTTTAGCCGGACTGTAAAAGAGAGAATATCCATATGAATAAAATCAGAG 

ATGMAAAGGAGTCATTACATCTGATMCACAGAAATTCAAMTATTTTAGAGACTATTA 

TGAGCAACTATATGCAGTAAATTGGAAAACTTAGAAGAAATGGATGAATTCCGAGACACA 

TAAAAACATACCAAGATTGAACCATGAAGAAATCCAAAACTTGAA 

Sequence 2025 

AATTATCCTGGTGTGGTGGCGTGTGCCTGTAATCCCAGCTACGCTGGAGGCTGAGGCATG 

AGACTCGCTTGAATCCAGGAGGCAGAGGTTGCNTTAAGCTGAGACCACACCACTGCACTG 

CAGCCTGGGTGACAGAGCAAGACTCCGTCTCAAAAAAAAAATAAGCTATTGATGGGCTAT 

ATATTGTTAAGCTATAGGGTGTGGTTACAGTGTCCAGTGTAGCATTGNTCGATTMTTTA 

TAGCCTCTTGTGGCAACAGCAAGCAGNTTCCAGAGATGAATACACAGCTTGAGTAGGCAG 

GGGGGAGTAGGACATGACGGCTGCCTCATCTTAACACCTTGGGCCTGATAATTTAAAAGG 

ACTCACAGTCCTCAGATAMCATGACTTTCTTTTCTCATGAGGAAACANANAAGGTGGCT 

AAAGGGTATNTCTTTCTCTCATGATCCCAAACCTATCAGGTTT 

Sequence 2026 

ACCNCGCGTCCGGGTGCTGTACCAGACCAGAGGCCAGCTCCATGTCCTCCGCGTCGGCAA 
TGATACCCACTGCCAACCAACAAAAATTGGCTGCAACCATCCCCTACCAGGACCCGGCCC 
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CTACAGGGTGAAGTTCCTGGTGATGAATGACGAAGGACCCGTGGTGAAACCAAGTGGTCC 

AGCGACACTCGCCTGCAGCAAGCCCAGGCACTTCGGGCTGTCCCCGGCCCCCAGAGCCCG 

GGCACCGTGGTCATCATCGCCATCCTGTCTATCCTCCTGGCCGTCCTCCTCACGGTCCTC 

CTGGCTGTGCTCATATACACCTGCTTCAACAGCTGCAGGAGCACTTCCCTATCAGGCCCA 

GAGGAGGCAGGGAGTGTGAGAAGATACACCACGCACCTCGCGTTCAGCACTCCTGCCGAG 

GGGGCTTCTGAGGGGTTCCAGAGGGGCCCACGTGTCCCTCCACCTCCTCCCTGGCCCAGG 

CTGCAGAGCCTGAGCTGGGACACGCCCTGAAGCTTCTGGACCCTGAGAGAGATTGGTTCT 

Sequence 2027 

GCTGC TAGAAAAGGTTAGACTAACTGGGAGTTCATMTTTGTCTTGGAGATTTAAGAAAG 

TTTTTGAAGAGGTTTTCTGTGTTGAGATGMGAGGAAGATTGAGACAAAGCACAATGGAG 

AGTAGGGGAGAGAGCACTTCAGAATGTTTGAAGGTGCTAAGAGGAAGTGCTTGACACACG 

GGAGGAACTGAGAAAGAAGGCTAGGGCCATCGGCTGGAGGGACAGGACAGGTGGAGATGT 

MGCATGGGCCTCCAGACCATGGTMGGCCACTGGATCTTTACTTTAATGCATTTGTGCA 

TGTGATTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2028 

CCCCGCGTCCGAAAAAGACATATGTACAAGTCTATTTCCTATAGCACTATTTGTAATACC 

TAGCACTATTTGTANTATCTAAAGACTGTAAACAACGCAGGTGCCCACAAAGGGAAAATG 

GTTTGACAAACTTACGAACATCCTTTTAAAGGAGTACTAGACAGGTCAAAAAGGAATGAA 

GAATGTATATATTACTATGGAGTGAATCTTCAGGGTATATGACTAAGTGAAAAAATGCAA 

GGTGAAGTANTATGTAACATATGGTACAGTTTACTTAAGAGAGAAATACAGATGTATATA 

CACATTACCTAAAGCAAAATGAGGACCCTACTGGGCTGCCATCCCAGCTGGACTGCTGCT 

GTGG AG CTCAGCATCAAGTACT 

Sequence 2029 

CGCCCCGCGTCCGGAGAGAGCTGTCTTTGCAGTTACTAGGTTTCATCAMGTTGTTTTTT 

TCAGTATGGTAGGTTTAAAMTGGGGATACATTTTTGTTTTTATT^ 

TTTCTTAGGTTAGTTGGCTACTTAAATTTCTTTTTCTGAAAACTTTGTATTTA 

TTAAMTTCTTATTGACTTGCCTGMCTATTTGTAMTTACAGAAATTAGCCCTTTGTCG 

TATGTGTTGCAGGTGCTTTTCCAGTTTGCCAGTGGTCATTCATTTTGGTTTATGGTACTT 

TTGATAGACCAGAATCTTTGACTTTTATTTAGTGAGATTTATCCATCTTTTTCTG 

CATCTTGGTATTATAGCATTAATATTCTCTTTTCT 

Sequence 2030 

CCACGCGTCCGGTTGGGACGGCACCAGGCGAGGTGTTGAGTTGGCTCGGCTCAAGGTTCT 

TCGGGGTGTGAGCTGGCATGAGGACCTGTTGGAAGTGGGATCCAGGCCTGGNGCAGNCTC 

CCAGCTGCCTCGATTTGTGCGTGTGAACACTCTCAAGACCTGCTCCGTTTATGTAGTTAT 

TTCAAGAGACAAGGTTTCTCCTATCAGGGTCGGGCTTCCAGGCTGGATGGAGTGCCCTGG 

CGCGATCTCGGCTCACCGCAACCTCTGCCTCCTGGGTTCAAGCGATTCTCCTGCTTCAGC 

CTTCTGAGCAGCTGGGATTATGAAGGGGT 

Sequence 2031 

NCCCCG CGTC CGGCTCMGGAGGTGATTAAAAGMTGTGTMTTTAGTATTTTTAAGATG 

TTTAGAGTTTTMAMGTTACGTTGTTGCTGTCATTGTATTTAAATAGTNATAAAGAATA 

TAAACTGATATAAGTAA \ I I I I I I I AAATTCTTCAGATATAATGAAGATCTGGAACTTGA 

AGATGCCATTCATACAGCCATCTTAACCCTAAAGGAAAGCTTTGAAGGGCAAATGACAGA 

GGATAACATAGAAGTTGGAATCTGCAATGAAGCTGGATTTAGGAGGCTTACTCCAACTGA 

AGTTAAGGATTACTTGGCTGCCATAGCATAACAATGAAGTGACTGAAAAATCCAGAATTT 

CAGATAATCTATCTACTTAMCATGTTTAMGTATGTTTTGTTTTGCAGACTTTTTGCAT 

ACTTATTTCTACATGGTTTAAATCGACTGTTTTTAAAATGACACTTATAAATCCTAATAA 

ACTGTTAAACCCAAAAAAAAAAAAAAAAAAAAA 

Sequence 2032 

CGACCNCGCGTCCGGCGTTCTACCCTTCCGGCCCGTGTTCTATCCGCCGCCTCCACCTTC 
CATNCGGCGCCGGCTTTCGGCGCGACGGTCGCCGCGTTCCATCGTCGCGCGGCCCTTCGG 
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GCGGAAGGCGAGGAGGACGGCTTCGGGGAAGCAGAATACGCTGCCATCAACTCCATGCTG 

GACCAGATCAACTCCTGTCTGGACCACCTGGAGGAGAAGAATGACCACCTCCACGCCCGC 

CTNCAGGAGCTGCTGGAGTCCAACCGGCAGACACGCCTGGAGTTCCAGCAGCAGCTCGGG 

GAGGCCCCCAGTGATGCCAGCCCCTAGGCTCCAAGAGCCCCCAACCGGGACCCAACCCTG 

CCTCCCTGGGCTAGGCTCTGGCCTGGGCACTCACCCCCTGGCTTAGACACCTTCTCAAGG 

GCTGGCCTTCAGGGACCCCTGGTGGGTCTGCTGCCTGGGCCACCCTTCTGCCTGGGCCTN 

CCCTTG 

Sequence 2033 

CGACCACGCGTCCGCTACCTCAAGGNCCTGGGCACCGAGCGGGCCTACAAATCCGCACTG 

GACTACACCAAACGAAGTCTGGGGATTTTCATTGACCTCCANAAGAAAGAGAAGGAGGCG 

CATGCCTGGCTGCAAGCAGGGAAGATCTATTACATNTTGCGGCAGAGCGAGCTGGTGGAC 

CTCTACATTCAGGTGGCACAGAACGTGGCCCTGTACACAGGCNACCCCAACCTGGGTGCT 

GGAGCTGTTTGAGGCGGCNTGNAGACATCTTCTTCGACGGGGCCTGNGAGCGGGAGAAAG 

CTGTGTCCTTCTACCGGGACCG 

Sequence 2034 

GGGGGGGAGGGGNGNAAAAAAAAGNGANGGACAAANAAANAGAAAAAANAAGANANAAGA 

AAGNNANAAACANNNNAAAANACNNNGGNAANCGGAGAAAAGAAAAGAAANNNGNANANA 

GAANANNANGAGGGGNCAAGGNAGAAGAAAANNNGANAGAAGGGGNNAAAGGGGGNCGGG 

AGAAAAGAAGNAAAAAAAAAANAAAANNNGAAAAAANGGGAAAAATNNGGGGAAANNGGA 

AANNAAAGAAAAAAAAAANANCNGNGGNAAAAAAAAAAAGGANNNAAAAAANNGGGNAAA 

ANANGGNNAANGNANANANAAAAAGNGGGGGGGAGGGGGGAGGGNAAGGGNNGAAAAANN 

NGAAAGAAAAAAGAAAGAAANAAANGAANAANAGANGGAGANGNAAGGNAAAAANAANAA 

AAAGNNNNNAANAANANGAAAGGGNAAGNAAAANGGGNNNNAAAAGAAAAAAAAAAAGNG 

GNAAAAAAAAAAANNNNGGAAAAAAAAAANNNGGNNANNNANNCCAAAAAGAAAAAAAAA 

AAAAGGGNNNNAAGGGGAAAAAANAANAAANANAAAAAANGGGGAGGNGGCNGGNAAAAA 

AAAAANGNNAAAAAAAAAAAAAAANNNAAANAAAAANANANANNGNAANANNNANAAAAN 

AAAAAAAAAAAGAAANAAGNGGGNNGNAAAANACNNAANANNGANANGNANAAAANNGGG 

AANAAAANAAANNAAAAGNNANAANGAAAAAAA 

Sequence 2035 

CCCCCCGCGTCCGCGTTTTTATGTGTGTATGTACAAMCAAMTACCTTTTTGAAATTAC 

ATAAGTGATACATGCTTATTGTGAAAGAGTTGGATAATACAATATACTGTAAAGAAGATG 

AAAACCACTCATAAATTCCAACCTGTAAATACTTTCGTGTTCATATTTTGGCATCTAATG 

TATC CTTTATGTATATTTAAATATATATTTTATTCAAGTATAGGATCATGTACCTCCCTG 

TTTTATAATTTTCTTTTTTAATTTTTCAGTGT^^ 

TACATCCCACAATGTTCATTTTTGTGCCTCATTATACCATGGAATAGTGACATCCTAGTA 

TGTTTAAGACATTCCTTATTAATGAACAATTAGGCTACTTCCMTTTTTAATTATAAAGG 

ACATTTTGAAGGACTTCCTTGTACATATTATTCTATTGCTT^ 

TTTCATGTAGTATATTTCCAGTTATTTAMTGAGAAAGTACATGGTTTATAACAAATGGT 

ATTTAACGTCCAAGTCACTGCTATCTAMGGGGTMTTTTTMGGTATAAMTAATTTGG 

CTTATAMAAATCGTGGGAAAAATATNCTAGAMTATTTAANGATTAACTTCTAAATTGT 

AAATTGGCATATTTAATGATAGAATTCAAAAAAAAAAAAAAAAAAAAA 

Sequence 2036 

CGCGTCCGGMGAAATTGTGCACCCTCCCAAMCATACAMGTTTAAAAGTTTGGATCTT 

TTTCTCAGCAGGTATCAGTTGTAAATAATGAATTAGGGGCCAAAATGCAAAACGAAAAAT 

GAAGCAGCT ACAT GTAGTTAGTMTTTCTAGTTTGMCTGTMTTGAATATTGTGGCTTC 

ATATGTATTA TTTTAT ATTGTACTT7TTTCATTATTGATGGTTTGGACTTTM 

AATTCCATAGTTTTTMTATCCCAGAAGTGAGACMTTTGAACAGTGTATTCTAGAAAAC 

AATACACTAACTGAACAGAAGTGMTGCTTATATATATTATGATAGCCTTAMCCTTTTT 

CCTCTAATGCCTTAACTGTCAAATMTTATAACCTTTTAAAGCATAGGACTATAGTCAGC 

ATGCTAGACTGAGAGGTAAACACTGATGCAATTAGAACAGGTACTGATGCTGTCAGTGTT 
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TMCACTATGTTTAGCTGTGTTTATGCTATAAAAGTGCAATATTAGACACTAGCTAGTAC 

TGCTGCCTCATGTAACTCCAAAGAAAACAGGATTTCATTAAGTGCATTGAATGTGGCTAT 

TTCTCT AMGTTACTC ATATTGNCCTTTGCTTGAATGCAATGCCCGTGCAGATTATGTGG 

CTGCTATTTTTATTTTCTGGGCATTACTTTNACACCNT^ 

TTCTTCACTGACTGGCAATGGNCCTTTACTGCAATAGGAAGAAAA 

Sequence 2037 

CCCCGCGTCCGGCTGGGCTTAAGGGATCTTTCCCAGGTAGCTGGGACTGCAGGCATATGC 

CACTGTGCCCAGCTGCTCCCTTAGTCTTGACTATGT AI rTTTTTTTTTI I GGTTGTTTTA 

TTAGGATAGAGTCCTAAG AATGG CATTACTGGGTTAATAGGTATGAACATTAGATCTTTA 

ATACATACAGTCGMTTACTTTTCATAMGCCATATACCTTTTTATATTCCCACCGATAC 

TATCCCTATCACTMGTATTAACTTATTTTCATCTTTGCCAATATGATCATCCAAAGTGA 

GGCAGAGGTTGCAGTGAGCCAAGGTCACACCACTGTGCTCTAGCCTGGGTGCCAGAGTGA 

GACTGTGTGTCAAAAGAAAAAAAAAGGGGGGGGTGCCGGGTGCGGTGGCTCACGCCTGTT 

ATTCCAGCGCTTTGGGAGGCCGAGGTGGGCGGATCACCTGAGTTTGGGAGTTTGAGACCA 

GCCTGACCAACATGGAGAAACCCTGTCTCTACTAAAAATACAAAAT 

Sequence 2038 

GTCGACCCCGCGTCCCGGACGCGTGGGTCGGCCATGAACMGTTTTCAAGTATCAGTTGA 

TTTATGATATAGGCTTATCCATTTGGTTATAAAATCATATGTTTATTACATAATCATTGA 

CAAATAGTTTCTGTATAATAACTGGCAGAGTAGCTCTAAAACATATGCAAGGAAATAAAT 

AAAGAA AAAAC TTACMTAMGAGAGTMCTCATATTTTAACAGTTTTGTGAAAAAAATA 

G AAAAT ATTTTATGTAGCTTATAGTACATATATTTTTTTACMCAGMGAATCGCATTCT 

GATTTTCCATATGGATCATTTCCCTATGTTGCTAGACCAGTACACTGGCAACCTGGTCAT 

ACAGCTTTTCTTGTCAAGTTGAGGAAGGTCAAACCACAACTTAACTGAGTACTCCAGATG 

ACAGTAACTGACTTGAAGATGGAAAAATATCAAAATAGAACTTTATATTGAAAATCACTG 

CTTCCATAGATTGGCATTTTTAGCTATTACTATGACTTATATMGTTATACATATMTTT 

TGAAAATMCAACTAAAAGATGTATAACATAGCCAAAACTGCTTAAACCATCCATTTTGA 

CCACTTGTCTTGCAGNTAGT1TTGACA1TTTGTAGGTTAATGGATTCCAM7TGGTTTM 

GTGGGCCATCTCATTCTTCACTTTCTGGNMNCCACTCCATAGATTTGGCTTTTCTTCAG 

GAAMTT MGNT TCCTTTNCCTTTATTTGGATTGGANGGNCATTGGCCTACTGGAAAANA 

AATATGCCTTTTTAGGGTTAAAAAAAAAAAAA 

Sequence 2039 

GTCGACCNCGCATCCGACCACGCGTCCGCCAGACCTCACGTCAACCGGCTGCACCCCACT 

TTCCAGCCTGCGCCCCAGATCTGCAGCCTTCGCCCCTAGATACACCCGCCTGGTGATGAG 

GCGCTCCTCGCGTTCCTTCCGGGCTCCAGGTGTCCGTGAGCCTCCCTTCCGCCCTGGCCT * 

CCGGTCTCTGCCTTGCTCGTGCTTCTACCACCACCCTTCCCCTCCCAACCCGGTGGATCC 

TCTCGTCTCCCCCAGTCTCCAGTGCACCGGCTTTCCCTCGTCCTCTGCGCAGTCCATCTC 

AGCTCATCTCTCCAATTCAATGCCATCATCTCTCCTCACCATCTCTCGGTGCCCTGGAAT 

GT TTGCT GTCMGATGTCCCCTGTGAAACCCACAAACGCTTGCGATTTGGCCTCCTTGTT 

TTATTTTGTGTAGTCCTACMCGTCTTGTTACTACCCCCTATTACAACACTTATAACTCA 

NAANGAAANAGNNNNNNNNN 

Sequence 2040 

CGTCCGCGGAGATCCGGCACACTGCGGACCGCTGGCGCGTGTCCCTGGATGTCAACCACT 

TCGCCCCGGACGAGCTGACGGTCAAGACCAAGGATGGCGTGGTGGAGATCACCNGCAAGC 

ACGAGGAGCG6CAGGACGAGCATGGCTACATCTCCCGGTGCTTCACGCGGAAATACACGC 

TGCCCCCCGGTGTGGACCCCACCCAAGTTTCCTCCTCCCTGTCCCCTGAGGGCACACTGA 

CCGTGGAGGCCCCCATGCCCAAGCTAGCCACGCAGTCCAACGAGATCACCATCCCAGTCA 

CCTTCGAGTCGCGGGCCCAGCTTGGGGGCCCAGAAGCTGCAAAATCCGATGAGACTGCCG 

CCAAGTAMGCCCCTAGCTTGAGTCGACCCACGCGTCCGATTTAAATATTTGTCCCATTG 

TTTGTGATTAGGATGTAAGCTTTGTGGMTGTAATTAACCCTGCTTTACGAAGTCACCAT 

ATTATAATAGGAAAAACACTGCCTAGGAGGCAAAGAGATCTGAATTCCAGTTCTGATGCT 

GCCACTGTGTAAGGAAGTAGTTTTATAAACCATGGGCAAATCATCTTGAGCTTTCTCATC 
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TGTAAAGTTAGGGG 



Sequence 2041 

TCGACCNCGCGTCCGAAAAACCAAAACCTGANTGAGATCTTGGAACCGNTGTGCGCCGGC 

CGNNCCTCTCCCANGGGACCANCCANCCCCGCGCGGTGGCCGACTGNATAGGCGGGACTG 

CGCTTCGAGGCTTAAGGACGNCAGATCGGAGGCATCGTGTGTTGTCTGTGCGGAGAAGCC 

AAAANNGTGATTACGTTTATTTGCMGACCGTTCATGTTGTTTTCAGTTCATGGTATGAT 

TAAMCCCGATCCTTTGTTACCATGCCCTTAGGTACGAMMATMTTGTTTNGATATTT 

GGCAGTCACCCAAAAATATC CAAAA AGCCATGAAACAGTANAGGTAAACAAGTANGAAGT 

GAAAN TMTNTTCGTCCTTTGTTTTCTTTCTGGAGGTGCTCAAAACACCCTCTCAAACCA 

TTTTTCTCAGCATAGAACCAAGTGTGGNCNGGNTANCAGCTAATATTTACNAGGGNAGAA 

ACGMCCCTNGCGATATTTAGTCACTTTGTTNCCNGGGANCACANAAAATNTTGAACAAA 

CACATGAGAACTGTCACCGATCTCTGTATTGATNACCANGGATACCCGTGAATTTTATGT 

MTATTMTCTNNGGNAGGCANGANTNTTTNNTAGGTATTTGCCTTTTCCAAGGTGCNCT 

TTCCNTACCAAAGGAAAANGGTTATTTTAAAAACTTTTACCANAANAAAGGGGATGNCTT 

ATTTTTTGGTCCT 

Sequence 2042 

NGGACTTGGTTTGAACGCGTTTTCCCCAAAGTTTATGTGTTGGAMCTTGACCCCCAATG 

CAGCAGTGTTGGAAGGTGCCTACTAGGTGGTGTCTGGGTCATGGGGGTATGACCCTCATG 

GATGGATAAATGCCATGACTGAGGCGGTGGGCTCCTTATAAAAGAATGAGTTTGGGTGAA 

ACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGCACATACCTGTAATCC 

CAGCTACTCGGGAGGCTGAGGCAGAATGGCTTGAACCAGGGAGTCGGAGGTTGCGGTGAG 

CAGAGATCGCACCACTGCACTCCAGCCTGATGACGGAGCAAGACTCCGTCTCAAAAAAAA 

AAAAAAAAAAAAAA 

Sequence 2043 

GAGAAGCCTGGGGGTCCTGGCTGAACTGGGCTGGGTGAAGGGGGCCCCCTGACCCCCTTG 

GGGTCCGGGCTGGGCTGGGTGAGGGGCGGTTTCCGACCCCCAGCCAGGTTCCCAGGCAGG 

ATGAGCTGGGGTTGGGGTGGCTAGGCCGTGGGCCTTGGGAGCTGGGCAGTCTGGGCTGGG 

CTGGGCTGGGCAGGGCGCCACATGGAAGCTGGAGGAGCAACGGGAGCGCTGGGCGTGGGG 

TGCAMTTGCCCAGTGCCTTCTGTTTCCCAGGCAGCTCTGTGGCCATGGATATGTTCCAG 

AAGGTAGAGAAGATCGGAGAGGGCACCTATGGGGTGGTGTACAAGGCCAAGAACAGGGAG 

ACAGGGCAGCTGGTGGCCCTGAAGAAGATCAGACTGGATTTGGAGATGGAGGGGGGTCCC 

AAGCACTGCCATCAGGGAGATCTCGCTGCTCAAGGAACTGAAGCACCCCAACA 

Sequence 2044 

A CACTCATCA ATTAGGTTTTATTTTTATTT 

AAI I 1 1 I I I ICCTTCTCCTCTGGCGACTGTGTGATGAATCCTTTCTTGCGTGATCAGGTT 

GCGGATAGACTTGTAAGGGTGTTTGCTGCATACAGTGTAAGCATTGTGACCGCCAATAAA 

CTTCAATGGTTTCTACTGAAAAAAAAAAAAAAAAAAAAAGGACGCGTCTACTTCCCACTG 

GGTCCCTCCCACAACACATGGAAATTCAAGATGAGATCTGAGTGGGGACACAGCCAAACC 

AAATCAAAAGGATATACAAAATAACCAGAAAACAATGAACAAAATGACAGGAATAAGTTC 

TCACCTATCMTAATMCTTTGAATATGTGTTAAATTACCTACCTAAAAGATAGAGACAG 

GCTTAATGGATAAAAAATGACTCAACAACCGTCTACAAGAAACTCACTTCACTTGTAAAG 

ACACACACAGACTGAAAGTGAAGGGATTGAA 

Sequence 2045 

GCCNCGCGTCCGTGAGAATACACAAGGGGGCACGCTTCCAGTAGATGTGTTGGGGAAGGA 

GGAGGGCAGAGGGGACAGGGGACAGGATTCAGCTTTGTGGTGGGTCCTGAGGGTTCCTAC 

CAGGGGTAGCCAGGATCTGGGAAACAGATCAGCGACTCTAGTCTGAAGTGGCTGCCTGGT 

TCGGGGGCTGCCTTCAGCAAGATTCAGGCAGGAGAGACGGAAATAGCCACCTTCCAGGCG 

TGAGTCCTGGAGATAAMATGGATTTTAACCTAGGACTGCCGGGAGCTGGCCCTCCGCGG 

CTGCTCAGACTAGGGCTGTGTGTGCTGGCTCTCGCCTGTTTCCGGTGTCTAACTGGCTTG 

TTTCTCTTTATGGCTTGGCTTCATTCCGACCTGGGGTGGGGCCACATNCAACCCACTGCC 

CACTGGCTGTCCGTCTGGCCTGCCCCCGCGGTTTCAACCACANTGGTGAAACAANCCTTG 
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CAAGATNTACAACTCGCAACACCGGGTCAAGCAATCAGCTGCATTCCGGACCGGTGTGNA 

AGACCGAAGGGG 

Sequence 2046 

CCCCCCGAATATCTTATCCTTAACATTAMTTGAATTTTTTGCAAATGATCAAAAGGTCA 

TTCCGAGTAAATTCTGTTGTATAGTGCAGATGATCAAGCTGAGTATTTGCCATGTTTTTA 

TTTTAGAAAAGAGATGTTGCTATAACATAAGTAAATACGATTCTCGTATGTGGCAGATAA 

ATTTACTTGTAATCTGCTCTAGAGTG AAATTA I I I 1 1 I ACATATAAGCATTGTCATCATT 

CTMGGATTATTGAATMTGAATATAAAATGTTCTTGTGTATTTGTGTATGTGTATATAA 

TTTTTTGMAGTTTCTTTATCCTATTGACCCTTCTCATAAACAGNAGCATATATATATTA 

TATGTAGTAGMTTTATATAGGMCATTGTCTTTTTCCCAGTAATGCTGATTCTAAACTA 

GTTATGTCAATTTCATGTAACATGACATTNAGAATAGTGGGGTGCTAAATATATTTAGAA 

ATGATTTCCAAMTTGNTGTATTTCTMCATAGAANGATATTTGTCATTTTAAAATAATG 

TAAAGAAAAAATGC 

Sequence 2047 

GCACCCCTCCCTGTTGACACAGCCTGGATCCAGAGTTCAGCAGACCTTGAGACAATGAAA 

AC AAACTTAGTAAT AATC A I I I I I CAATCATTGCAGTAATTATTGATTTGGACAAAAATC 

AATTGACGTCAMACCTTAMGTGACGTTTCTCTGCCTATGGAGTGGGTCATTCTTTTAT 

TCCTTTAGTTTCATAATAMTTTTCTTTTACTTAAAMAACTTATAGTTTGATGAAGAGT 

GAGATATATACCTCATCTCAAAGAATCTTCACACACACACTTATTAATTACAAAAGGAAA 

ATCAGTAATTTTGCAGTGGAGACATATGGCCAACTCCACCTTACCCAAGTGGCTGAAAGT 

CACTGCACCAGTAATGG 

Sequence 2048 

AGAAAAGCCNAGCCAACAGCTCTTAAAATCAGAAAAAACAANGGGAGTCCTTCCTTGTCT 

CNTCTGTGNTCNCNGGCCTTGTCTCTGAGACTNTCTGTGCCCNNAANCNNTNTNNTNGCT 

NTNANCTGATTCTANTTTTGNTNCCCATGGMTCTGTCCTAAGACTGGGGNTTTTTGNCA 

NATGACAGNCTTGCCNGNACNCAMTATCATMCAGCATTTNNNANCGANTTTTGCNGAT 

CMGTMNATANTTGCNTGACMTGACAGCTTTTTAACTCTTTCAAAGTCACCTAAAAGC 

TATTATTGCAGGAGGATTTANGAAGTCACATTCATTNAACACCCAAGTGCTATGGGTGAA 

NNATTCATGATAGCTTGGCCCACMGGTCATGMTTGAGGAGGGAATCTTGCTTTTCAAA 

AAANCMTGGMTGNTCCCNCCACTGMAMGGGNNATACGTTTTMTATTTTGGACCCT 

TCANAAAGGNTAANGAAAAAAACCCANGGTTCTTTCNAAAAAGTTAGNGAATAAGGGGGA 

ACTTAANTTTCATGGMNACMGCCCATTNTTTNAAAAAAAAAAAAAA 

Sequence 2049 

CNTACGAACGTCTGAAACGTGGAGGAACCTTCAGTTCTGGGAACTCCCTGCCCCTTTCCC 

GGAAAATTCATGAGTAATCCACCTGTTTAGCATATAATCAAGAAGTAACCATAGGCATAG 

TATATCMGCAGCCCACACTGCTGCTTTGCCTATGGGGTAGCCACTTTTATTCCTTTACT 

TTTTATTAAACTTGCTTTCACTTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2050 

CGCNTCCGAAATCCAATCCTAATGAAAGAGATTGATAAGTGTGACTACAAAAGGTTTAAA 

ACTTTTTTCATAGCAAATTATCTCAGAAACTAAATTAAAAGACAAGGGAGACCAGGTGCA 

GTGGCTCACGCTGTAATCCCAGCACTTTGGGAGGCCGAGGGAGGTGCATTGTTCTAGCCC 

AGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAGCCCTGTCTCTACCCAAAATACAAAA 

ATTAGCCAGGCGTGGTGGCTTATGCCTGCAGTCCCAGCTACTTGGGAGGCTGAGGTAAGA 

GGATGGCTTGAGCCCAGGAAATCAAGGGTGCAGTGAGCTGCNATTATGATTGTGCCACTG 

CACTCTAGCCTGCATGTCCAAGTGAATCCTACATNAAAATAAAAAGTNCAAAAANANAAA 

AAATGTGCCGGCCCGCTAGACTAGTNT 

Sequence 2051 

CCACGCCTCCGGAAATGCCTCTCTCCAGAGTCGGACCCTCACCTCCTTCCTGGAACTGCC 
TTTGGCCCCAGAACCATGAGACAATCCCCACCCTGAGAAGCTNCGATCACTGGGAGGAGA 
GAGAAAGCCTCCAGCTTTGGGATTCAGGCTTCAGAAGTTTTTAGCAGCCTTTGCTCATTG 
GAGAGGTGGGGAGGATAAGTCTATAAGGAATCCTATTTCCCCAGCTCTCCCACAGAGAGG 
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ACAAAAGMGTCTTCACACCGTTGTGGAACTTTCCTGCAACTTCTGGATGCAGACAAGCC 
TCAGAGCAGACTGTTCTGGCTCCAGNGAATATCGGCTGCCAAGCTGTGAGCATCCAGGGA 
TCCNCGTCTGCCTGGCTTTCCTGAAAGTCAGAAGGCGCCTTGGTCATACTGTGTGGGGTG 
NGTNGGATNTTNAGTTNTGNTCTCTTTTCTT^ 

GCCAACACCCCTGACAGCMTTGTGCNGCACTTGGCTTAATTCACACCCTATGAATAATT 

TTTNATATTTCMCTTGGAMAGGTGGTTAAGAAACTTTT 

Sequence 2052 

GATGGACTGTGTCATNCAGGACGGCCCTGCTGCATTGGCACCAAGGGCAGGTGTGAGATC 

ACCTCCCGGGAGTACTGTGACTTCATGAGGGGCTACTTCCATGAGGAGGCCACGCTCTGC 

TCTCAGGTAGGTCTGCAGAGTGTCCGTCGTTCCCCTCCCCCAGCTACTGTGATGCTGATA 

TGCTGCTCTGCGCAGGTGCACTGCATGGATGATGTGTGTGGGCTCCTGCCTTTTCTCAAC 

CCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCCTCTTCCTGCACGCCGGGATC 

TTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCCTGCGGGACCTGGAGAAGCTG 

GCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTGGTGTCACCGGCAACCTGGCC 

AGTGCCATCTTCCTGCCATACCGAG 

Sequence 2053 

NCGCNTCCGGGCAGAGCCCCCGGAGCCTGGCCAGCCCTTCCGGCAGCTCCAAAGCCACAG 

GCAAGCCCCGAGGCTGGGATGGCCGGCCCAGGAGGGAGGAGGACGACGTACCTCCCGAGG 

AGAAGAGGCTGCGGCTGGGGCTGGAAGGGGGAAGCGCACAGCCCGAGGACTGCNAAGGAC 

GGGGAGGACGCGCCGCGGCCAGGCAGGGAGGAGACCGGCACCCAGACAGGTGGCGACGGC 

AGAGGAACACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGCAGGTCGAGGCTGGCGGA 

TCGCCTGGGGTCAGGAGTTCGAGACCAGGCTGGCCAACATGGCGAAACTGTCTCTGCTAA 

AAATACGGAAGTTGGCTGGGAGTGATGGCACGCACCTGTAATCCCAGCTGCTTGGGAAGC 

TGAGGCAGGAGAATCGTTTGAAGCGGGGAAAGCGAGGTTTGCAGTCAGCTTGAGATCACA 

CCACTGCACTTCANCCACCTGGGGTGACATGAGCGACACTTCTGTTTTCAAAAATAAACC 

GAA 

Sequence 2054 

CTGTGTAGGACAGACTCTCTTTGACTCCCTAGGATTTACCCCAGTGCCTAGCATGTTTCA 
CAGCTTAGAGGAAAACAACATTTGTTGACTGACTTTTGATCTCC AI Mi l I GGTGAGATG 
CAGTGGCTTACACCTGTAATCCCAGCACTTTGGGAGGCTGAAGCGGGCGGATTACTTGAG 
GCTAGGAATTCAAGATCAGCCTGGACAACATGGCAAAAAATACCAAAAATAAAAAAAAAT 
AAATAAATAAAAAATTTAGCCAGACATGGTGGCAGGCACCTGtGGTCCCAGCTACTTGGG 
AAGCCAAATCGCTTAAACCTATGAGGTGGGAGGTTGCAGTGAGCCAAGATTGCACCACTG 
CACTCCAGCCTTGGTGACAGAGTGAGACCCTGCCTCAAAAAAAAAAAAAAAAAAAA 
Sequence 2055 

TCGACCCCGCGTCCGGGAATTTGGGGTGGAATGTGATGAGATTAAAATGTAGCTTTGGTA 

T AACT TCATGTGTATTTCAAAATATACTGAACGTCAACATGATTTGAATAAAGAAAATGT 

ATTTTCTACTTGMCCACATAACACTGTTATTTAAAACAGTTTTCTGCAGTCTAAAAAAA 

AAAAAAAAAANNAACAANNAATN 

Sequence 2056 

CGTCCGGCAGAATGGCTCCCGCAAAGAAGGGTGGCGAGAAGAAAANGGGCCGTTCTGCCA 

TCAACGAAGTGGTAACCCGAGANTNCACCATCAACATTCACAAGCGCATCCATGGAGTGG 

GCTTCAAGAAGCCGTGCACCTCGGGCACTCAAAGAGATTCGGAAATTTGCCATGAAGGAG 

ATGGGAACTCCAGATGTGCGCATTGACACCNAGGCTCAACAAAGCTGTCTGGGCCAAAGG 

AATAAAGGAATGTGCCATTCCCGAATTCCGTGTGCGGGTTGTCCAGANAACCGTAAATGA 

GGGATGGAAGATTCACCAAAATTAAGCTATATNCCTTTGGNTNCCCTATGTACCTGGNTA 

CCACTTTNNAAAAANTTTACCAGACCAGGTCCAATGGNGGGATGGAGAACCTAAATCGNT 

TGNTCGGCCGGANTCAAAATTAAGGTTNTTAAAATTTGCCAAAAAAAAAAAAAAAAA 

Sequence 2057 

CGCNTCCGGAGAGAGCCCAGGGATGCCTTATGGTCAGAACAAATTTATAGACAACAAAAG 
GGAAGTGACCGTGCAGAAATCAGAAGTGAGGTACAGAAACAGCTGGACTGATTACAGCTC 
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MCATTTGCCTTTTTTGAACAAATCTGAACACTCAGCAGTGTATGAGTGGTTGACCGTAT 

NGGCTGCTGTGATTGGCCAGACCTCAGCTATTGTTACAGGCACATACTCTTAAGTCAGGT 

TTTCAATCCTATCTGACTATAMGTTAGGTTACAGTTTGTCCTCATGGACTCAAATTTAG 

MGTATGGCGTCCTTCTCAGGCCATATTTAGTTCAGTTTAACAAGTGCATATGGCTTCTG 

ACAAAGGTGTGGCCCCTTTAGGACTCCAAAGACGCTGTCACTTACATGGTATTCAGGGAA 

GACACAGAGGATCTGTGAGCAGCCTGCAGCCMAGCTTTTGAGATCATATTGAGATTTTT 

TTGTANTATANGAGGAGGGGTTCTGGCTC 

Sequence 2058 

CGGAAGCATCGACCTGCGAGCTCACAGAGCTGGGAGCAGAGCACCCACGCACACCCCGAA 

TGGCTATGGAAGCTGCAGGGCGCCAGGGACACTGGGAGTCCCTGCTCTCATGGCAAAGCA 

GGGACGGGGGACTTAAAAGCCACCAACAGGAAAATCGGGGAAAAAAGGGAAGATGGTGGT 

MCAGTTGGACACTATTTCTTGGCAAAACCGTGGAAAAACACGTTCTACACCAGCAGGTG 

GCAAATTGTGGCCGCCATCTGTGTTTGCAAATAAAGTTTA 

Sequence 2059 

CCCCTACAATGAGCTGTCCCGCCTCAGTGGCCTGCGAACCCTCAACCTCCACAACAACCT 

CATCTCCTCCGAAGGCCTGCCTGACGAGGCCTTCGAGTCCCTCACCCAGCTGCAGCACCT 

CTGCGNGGCTCACAACAAGCTCTCAGTGGCCCCTCAGTTACTGCCCCCGTCCCTCCGNGT 

CGCGGATCTGGCTGNCAACCAAGT 

Sequence 2060 

ACCCACGCGTCCGCCGATTTTCCAGGTGCCGTCTGTCACCCCTTTCTTTGACTNGGAAAG 

GGAACTCCCTGACCCCTTGCACTTNCCGAGTGAGGCAATGCCTCGCCCTGCTTCAGCTCG 

CACATGGTGCGCGCACCCACTGACCTGCGCCCACTGTCTGGCACTCCGTAGTGAGATGAA 

CCCGNTACCTCAGATGGAAATGCAGAAATCACCCATCTTCTGCGTCACTCAAGCTGGGAG 

CTGTAGACCGGAGCTGTTCCTATTCGGCCATCTTGGCTCCTCCGCTCTATTTGCAGTTCT 

TAAAGGGCTATTGTACTCTCTGGGATTGTACGAACTTGGACTGNATTGGAACTGCAACAG 

AAAATCTTCCAGGCAAGTGCCA 

Sequence 2061 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACTTTCATTCAGTGTACGAGGGAAAAAAGCAT 

GTATTGGGCCACCGGAAGACAAGCTAATAAATAGGCTGGAAGTAATATTCTACCAGCAGG 

MCTCAACAGCTCCAGTTAAATGCTTTGATATAGNGGCTCCTTTGCAGAGCCAAAACAAG 

ATTTATTAAATTTCCTTCAAACTGTTTATCTTTMAACAAATATAAGGTTTTM 

TGCTGMGCAAATGTGMTGCCAAAGACTATGTTTTGCAGTTTTGCTTTCCTCCCAATAA 

ATATTMTGTATGTAATTCTAGAGGGTAAAAATGTMATAGGTTTGGACAATATTTGCAC 

CCTTGTTTGTGTTATGAAAAAMTTTTTCCAAGGCGAGCTAGAGAGAAAGATGTTTGGCA 

TGCCAMTTAACTTGCATGTTTGTTAAAAAAACMACACATGTTTTTGAAAAGAAACCAG 

ATCTG AACGTGT ATTTGTTG AAGTTTTG C AAAA 

Sequence 2062 

CCCTTTCGA GCGN CCGCCC GGGCAGGNAC I I M I I I I I U I I I I I I II I I I I I I I I I II I 
MMIMI I M Ml Ml Mil l INN M l MM I I M l I I Ml Mil l I I MM Mil l 
TTTTAAMAAAAAANNAMANNNTNNNNTTTT^ 

TTNTTTNGGGNTTNNTNNAMMANNNANNNAAANCCCNNCCNGNNNTTTTTAGGGNAAA 

MMANTTTTNCNGGG NTNAAM AAAATTNNTTTTTGGNTNCCAAAANNTNNGNNAAAAA 

ANNAAANNANCCNNNNTTTTTTNNNNNGGGGCNCCCCCTTTTTTTNCCCNTGGNGGGGNG 

GGGNAAAAAGGGGNTTTTTTTNGGANCCGAMMAMCGGGAMNTTATCCCTTT1TT 

GGGGCCNTAAAC I I I I II I I NNCCNCNNTTNTTTTTTAAAAANCCCCCNNCCCTTTNTCC 

CTGNTGGNNCCCTTTTTGGCCCGGGGAAACCCNN I I I I I I I I I I I I I I I I I 

Sequence 2063 

AAGGGAAAATGTCACGTANACTAGATCAGGGAACAAAATCCTCTCCTTGTGGAAATATCC 
NATGCAGNNNGNTGATACAACTTANTATCTTATTGCCTAANAAAAAMTTTCTTATCATT 
GTTTCANAAAAGCAAAATCATGGAAAATTTTTGTTGTCCAGGCAAATAAAAGGTCATTNT 
MTTTAGCTGCAATTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTCCCCTGATGTCAT 
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TMG C ACCATC CAAMAGTCTGCTTCATMTCTATTTTCAAGACTTGGTGATTCTGANAG 
TTTTGGTTTTTGNGACTTTGTNTCTCANGMAAAAMNATTC 

TATAATTCAATTTAAATATGNTGNGGCGTCTCATCCAGGATNGGATAGGTTGTCTTCTAT 

TTTCCATTTTACCTATTTAC 

Sequence 2064 

CCCTTTCGAGCGNNCGCCCNNNCAGGTACAGACTTAGAAATTATCTAAAGATTTCATCTT 

TTTACCTCATATTTCTTAGGAATTTAATGGTTATATGTTGTC I Mill I CCTATGTCTTT 

TGGCTCAAGCMCATGTATATCAGTGTTGACTTTTTCTTrCTTAGATCTAGTTTAAAAAA 

AAAACCACATAACAATTCTTTGAAGAAAGGAAGGGATTAAATAAI Mill ICCCTAACAC 

mCTTGAAGGTCAGGGGCmATCTATGAAAAAGTAGTAMTMGTTCTTTGTAACCTG 

TGTGMGCAGCAGCCAGCCTTAAAGTAGTCCATTCTTGCTAATGGTTAGAACAGTGAATA 

CTMGTGGMTTGTTTGGGCTGCTTTTMGTTTCTCTTAATCAAAATTACTAAGATGATA 

GAATTCMGAACTTGGTACATGTATTACTTGGTGGTATCGATAATCATTTAAAAGTAAAA 

GACTCTGTCATGCATTTTTCCCCATTTC 1 1 I I 1 1 1 I I CCCTGTCTCCGGGGCCAACCCAA 

GTGGGTCTTCATTTT 

Sequence 2065 

CCCTTAGCGTGGTCGCGGCCGAGGTACNCGGNGTCCAAGATGGCGGATGAAGCCACGCGA 

CGTGTTGTGTCTGAGATCCCGGTGCTGAANACTAACGCCGGACCCCGAGATCGTGAGTTG 

TGGGTGCAGCGACTGAAGGAGGAATATCAGTCCCTTATCCGGTATGTGGAGAACAACAAG 

AATGCTGACAACGATTGGTTCCGACTGGAGTCCAACAAGGAAGGAACTCGGTGGTTTGGA 

AAATAACATCTGGGCCTGCTGGAGAAAAAGAAAGATTACAAACTTCGTTGCATATGACTA 

CCGTAAAAAACAAGAATACCTCAAAGCTCTTCGGAAGAAGGCTCTTGAAAAAAATCCAGA 

TGAATTCTACTACAAAATGACTCGNGTTAAACTCCAGGATGGAGTACTTTA A I I I I I I II 

TNTTNATANTTNCCAGGMCATTTTCTAATTATGTTATATAAATGGGTATGTGATATGTG 

NGCTATTTGTGTGCTAATGTCCTAAGTGAAGTTCTGCAGACCATCTGGGTCAAAGTGCAT 

TTCGCATGATCCAAAANATGAAGAACCACTTGTTTGTCACGGGAGACNAGGGAAAAAAAA 

A 

Sequence 2066 

CTTAGCGAGGTCACGNNCNANGAACGCGGGGNGNTCAGGAAGATNTCTGAAGAGTGCAGC 
NGCCTGAACCGAGCCCTGCCNAACAGCTGACAATTGCACTGCAACCATGAGTGA 
Sequence 2067 

CATGCAGAANTCCTCGCTGGAGTTTCATAAGGCCAATGAGTGCCAGGAGCGCCCTGTTGA 

GTGTAAGTTCTGCAAACTGGACATGCAGCTCANCAAGCTGGAGCTCCACGAGTCCTACTG 

TGGCAGCCGGACAGAGCTCTGCCAAGGCTGTGGCCAGTTCATCATGCACCGCATGCTCGC 

CCAGCACAGAGATGTCTGTCGGAGTGAACAGGCCCNGCTCGGGAAAGGGGAAAGAATTTC 

ANGCTCCTGAAAGGGAAATCTACTGTCATTATTGCAACCAAATGATTCCAGAAAATAAGT 

ATTTCCACCATATGGGATTCCCAGACCATGANGCCAAAGTAATTCCTAATCCCCACACAC 

AGGAATGGCATGGGACCTGNGATTTTGAGTTTTCAAGGGGCCGTAAGNTTTTNATTCTTC 

ACACCTNCAAATTACCCGACCCCAAAAAAAAAANAAAAAAAAA 

Sequence 2068 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTNTCCGATTTCAAGAACTGATGAAATTAGAAA 

AAACACCTACAGAACATTGGATAGCCTGGAGCAGACCATTAAACAGCTCGAAAATACAAT 

C AGTG AAATG AGTCCC AAAGCCCT AG NTG AT ACCTN ATGTTCTTCCAAC AG AG ATTCTG N 

AGCAAGTTCATCCCACATAGCCCAAGAGGCCTCTCCCCGACCCTTGCTAGTTNCGGATGA 

AGGTNCCACTGCCCTAGAGCCCCCTACGTCGATACCTTCAGCTTCACGTAAGGGCTCCAG 

CGGGGCCCCACAGACGAGCAGGATGCCTGTCCCCATGAGTGCCAAGAACAGACCCGGAAC 

CCTGGACAAACCCGGCAAGCAGTCCAAACTGCAAGAACCNCGCCAATATCGNCAGGGCTA 

ATGGAANTNCTAAGAAATCTTGGNNGGGGACTNTTAAAGCCTACTTTCCCCTACTTACCT 

GCTTCTAMGATTCCAAGGCCNTTCTTCCAAACTTTTGGG 

Sequence 2069 

CCCTTTCGAGCGGCCGCCCGGGCAGGTTCAtGGATNNGAGCAGCTTCACCAACCCCTGCA 



• 
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AAGTGACTCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCC 

ACGCACAGCTGMGCATGAGGAAACTCATCGCGGGACTMTTTTCCTTAAAATTTANACT 

TGCACAGTAAGGACTTCAACTGACCTTCCTNAGACTGAGAACTGTTTCCAGTATATACAT 

CAAGTCACTGAGAGAACATCACCACCCTGAAGCCAGAGACTAACACTGCAGGACTCAGCA 

GGACTATTTAAGAAACAACTGAGGCATCAGACCAACTTTCCCCACAAGTCCTCGGATCTT 

TCCTGCCATGCTGATGCCATATATTCCAACGTGATCAACCTGGCTCCCCAGAAGGAGGAC 

GACTTTGCTGTCTACACCAACATGCCCCCTTTTCATCACCCCAAGAGGACATTGCCAGAC 

CAAGTGGGAATATGTCTTCCATTGTATTTCCAACTGATGGGGAAAGCCTANATGAAGATG 

CTCMGMGNGGGGGGTCMGACCCTGACCCCAGCNTGMTCTTTGGCATTACCCTTTGC 

TTT AAATTT AATGTG GTG G N G NTTT AAAA AAAAAAAA AA 

Sequence 2070 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I 1 I I 1 I M i I I i I 1 I I 1 1 I M I CNGCGGGG 
NGNNCTACTTNANMTCTTTGGCNGGTTTNNCNGTTTTNGGTTTTCTNANCNCTTGGNCT 
GGTNCATTTGGTTTNGMNMTCNGTTNCTTCNGATT^ 

AANGGTTTTNGAAANCTTTNNCCTTCTTCNGTNGMGTNGTGGGTTTTTNANAANANAAA 

MTTGGGGTTNATCATTTTTTCT^ 

ANATTAATAC 

Sequence 2071 

CC CTTT CGAGCGGCCGCCCGGGCAGGTNCNGGTTANCAGACCCACAACACGAAGCTCCTG 
CCTTTTAAGACTACAAAGAGGCAGCTCAAAATTAGACTGCACAGGTAAGCGAGGAACTGC 
AGTCTAAGCCTGGACTCTGCCTTCTGCCCTCCCCCCGCGTACTCAAGCAATAAAAT 
Sequence 2072 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTN I I I I I I i 1 1 ! I NTTTNGCNGAGTGAGCTA 
CTNTAGGATCTTNTGCTGGTTNTACAGTTTT^ 

TNNGNTTN GMGA ATCTGNTTCTTCTGATTTTTTAACANAAGGTTTTTTGACAAA 

TTTGAAMCTTTTACCTTCTTCTGTTGMGTTGTTGGTTTTTTAGAAGA 

GTTTATCATTTTTTCTAGGTCTGMAGTTTTGCGCAT^ 

TAACTAAAAATCTTAA MCT TGNTTTTTGAGAATATTTACTCTTCTGACAAATTCTTCGA 
TTAAACTTTCCTCAGATTTTAAGTTAGCTAAAACAAAANTTCCTTGTTTGNGTTTNTTM 
TNTCGAA TGCT NACTCTGTATCTTTCAAGTTTTNCMTTTTTTCGATGTCTTAGCATCAA 
AAACAGATTTTAAAACGTCTTCAAATTACTTTTAAATCTGTTCCTGCAGCTAAAACNGTC 

Sequence 2073 

CCCTTAGCGTGGTCNCGGCCGAGGTACGTGCTTATACAAGATGTCAATTATGTGGTCGTC 

CACATGCTGTATTACGTAMTTTAAAATTTGTAGMTTTGCTTCCGTGAACTAGCTCACA 

AAGGACAAATACCAGGTATTAAGAAAGCGAGTTGATAATATGATAATCACAGATCCAATA 

GCAGATATGATCACAAGAATCAAAAATGCCTTCACACGTAAACACAAAAATGTTATTATT 

CCTCATTCTAAGAAAAAAGAAAGAATCTTACAAATCTTCTTAGATGAAGGATATATAAAA 

GGATTTACTGTATCTGGTGMGTTAAAAAAGAAATTAATGTTGAGCTTAAATACAAAGGA 

AATACAAGTTCAATTGNTGGAATTAAAAGAGTTTCCAAGC 

Sequence 2074 

CCCTTTCGAGCGGNCGCCCGGGCAGGTGGGCAGGTACTTCAGCAAGTCCTCTTTCTCCTC 

AGCAGTAAGCTCAGCCGGCAGGTGCCTGACCAGAAGGGTTCGGTCGCCCCGAGGCGGGGA 

AAGCGAGGAGGAGCTCGTGCATCCCCTTGATATCGCAAGCGGCTGCTCGGGAGCTGCCAT 

TTTCCTTGGAGAAGCAAAAACAGAAATCGTGGGAAGAAGTCTCAGTCAAAATCGCGGCAT 

CMCACMGCTGGGAGAMTATTTTTTCCGCCTCGCGCTAAGGATTCTGGAAACCAGGAA 

ATACCGAGAAAGAAAGTCACCTTCTCGCGAGAACTGCGCCACCGAAAAGCGGCAACCCTT 

CGAAGACTCTTCGGGGAAGGGCGCGGTGCTAATGATTTAAATTCCAAGGGGTCTNCGGAA 

AGACTTACAAAGCCAAAAATTGGCCCAAAAGATGTGCGANGGTTAACACAAGTTGTCAAT 

CAAAGAAAGGAAACAGGAACCCCAACCCCTTTAAGGA 

Sequence 2075 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

347/467 



CCGCCCGGGCAGGTACAAATTGAGCTCTCTATTCATAACCTCAATGTATGTATTCCTGCC 

CATTMTATACTTTGCACCAGCAAAAGCGATTTCCMCATATGTGTTTTGGAGGTAATTA 

AGTMCTCTGTATAAAAATAAATGCACTTTTCCCTCCTTTCCCCAGTGAATGGAAAACTT 

CCATACTTTCAAAATAATMTAAAAAAATMTTTTTAAGAGCMCAGCCCTCMCTCTTT 

GCTGGTGCCTGCCATACTGCCTTTCTTCACTCCATTCTTAGCTCTGCTAGTTTCTTCTTG 

TATGTCATGATAAAAAGGGAATGTGGGTGTGTAACTTTTGTGTATGTCCCGTTTCCAAAT 

TTCCCTCTCCAAAAAGCCAACCAAATAAACAAACAAACAAACGAAAAAAACAGTGCAACA 

AAACACAAATAGCATTCCAACAGTT 

Sequence 2076 

TTTCGAGCGGCCGCCCGGGCAGGCACATAAAACATTATTCCTTCCTTGGCCTAAAAACTC 

ATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTTTTAGAGMCTTATTTTATT 

AGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCYACATAAAAATGCA 

CCCTAGACCCGAAACTTACTAGACTCATTATAAAATTTTCTTTAAGGTGTCCACGCAGTC 

CCTGGTCAC ACTTGAAG CAG TCCG GAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAG 

GAACTTACAC 1 1 I 1 1 1 1 I AATCTTTTCCT ACAACTTC AT ATTTTAT AAATAAAAAG ACAA 

AAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCCTGTGACCTGCACATATCC 

GTCCAGGTGGCCTGCAGGAGCCAAGAAGTCTGGGAGCAGCCCGAAAAACCACAAAGAAGT 

GAMCAAGCCAGTTCCTGCCTTAACTAATTAACCCACCTTACGACATTCCACCATTATGA 

CTTGTCCACCATTATGACTTGTTCCTGCCCTGCCCCAACT . 

Sequence 2077 

CCCTTTCGAGCGGCCGCCCGGGCAGGTTTANGTCANAGTCTTCTNTTCTNTTCTNNNTGA 

GATGGAGTCTTGCTCTGTTGCCAGACTGGAGTGCAGTGGTGCGATCTGGGCTCACTGCAA 

TCTCCACCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAACTGGGACTAC 

AGGGTGCGCGCCACCAAGCCCAGCTCATTTTNGTATTTATAGTAGAGATGGGGTTTCACG 

ATGTTGGCTAGGGAT GGGT CTCGATCTNTNGGTCAGAGTCTNTTNCTGTAAAATATCCTT 

GGGTAAAAGAAGCAATTTTANACTGTAAACTGATGNCAANATGCTTTAAGGGAAGAAGGC 

N 

Sequence 2078 

TCCCTTNCTTTCTCGCACGTTCGGCCGGCTTTTNCCCGTCAAGCTCTAAAATCGGGGGGG 
CTCCCTTTAGGGGTTCCGMTTTAAGTGGCTTTACGGGAACCTTCGAACCCCAAAAAAA 
Sequence 2079 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCAGGGTCTGTCAGAAACTGTTGGAATCTTACA 

TAAAGTCAAGTCTCAGAAATGTCCGATGCTTCACCATATTCTTATATTCTATGCAATTGT 

TGTCTGTGCACTAATCATCTCGACCTTCTACATGAGATACAGAATTAATACTCTGGAGGA 

GCAGCTGGGGTTACTAACCTCCATTGTGGACACCCATAATACTGAACAGGCAGCACCATC 

TGGCCTGAGGTCACAAGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 2080 

ACCNTATAACGGCCGCAGTGTGCTGGAATTCGCCCTTTCGAGCGGCCGCCCGGNCAGGTA 

CGCGGGGNTGGTTCCAACTTTTCTGCTNATCTGGGAGGTGNTGGGCGCGGACAGTCNAGA 

TGTCAGAGMAAAGCAGCCGGTANACTTAGGTCTGTTAGAGGAAGACGACGAGTTTGAAG 

AGTTCCCTGCCGAAGACTGGGCTGGCTTAGATGAAGATGAAGGATGCACATGTNCTGGGA 

GGATAATTGGGATGAATGACAATGTAGAGGGATGACTTCTCTAATCAGTTAACTGAGCTG 

AAACTAGAGAAACATGGGTTATAAGATGGGAGACTTCATAGCCATCCAGAAGAAGTGCTG 

AAGTAAACCTAAAACTTGACCCTGCTNAAATACATTGTAGGGGCAAGAAGAACCCAAGGA 

ATGGGGACACT 

Sequence 2081 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGNAAGTGTCGGCGCCGCCACTGTCCG 

GCCACAGCCTAACGCTCTTNGCTGTCGTTTGNGGNCTCGCGCAGGGCGGCCCCGGNTCTG 

GTGTTTGGCNGTCGGAATTAAACAACCACCATGTCCGAGCAAAAAGGCAAAGACCAAGAC 

CACCATAGAAGCGCCCTTAGCGTGCAACATCGAANGNGTTAGNCATGTTNGACCAANNCA 

CAGATTNGAGGAGTTCAAAAGAGGCCTNCAACATGAGTTGATCAAGAANANGAGATGGCT 
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TCATTNGANAANGGAAAAATGCCNCAAGGNNNNGCCTNfMGCTNACTCTAGGGGAAATAAA 
CCCCNACCTGANTNNGAAAATCTTGAATGGCCCATGGATGAAATGAGGCCCCCA 
Sequence 2082 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGTGATATNTCACATATTTTTGAGAAAAATTCC 

CAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATA 

ATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAG 

CTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCC 

ATGGTTGGCTCCTTCMGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTCTGGTTCTTC 

AGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTGTT 

TCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAGCGCCGAGGCTTTGGCT 

CCTCCCCAGCTGCTGCGAGCTTCCGCTGCCCCCGCGTACCTGCCCGGGCGGCCGCTCGAA 

AAGGG 

Sequence 2083 

CGGGCAGGTACCAMGCCTT ^GNC CCCATTTTCCATCATACGAATAGTATTCCCTGTTG 

CTAAGCCGATGATACATTANCCTTTTCCCAATAGGNGNGAGNGGATGGACTGAATGGAGA 
A 

Sequence 2084 

CGCTCACTGCCCGCTTTCCAAGTNCGGGAAAACCTGTCGGTGCCAGCTTGCATAAATGAA 

ATCGGCCAACCGCGCCGGGGGAAGAGGGCGGTTNGCAGTAATTGGGCNGCTCATTCCAGC 

TTCCTCCGCTTCACTGGACTTNGGANGGCNCTCGGGTCCGTACNGGCTGGCGGGCNGAGC 

CGGGGNATCAAGCTCAANTTCAAAAGGCCGGGNAAATACCGGGATAACCCACCAAGAANT 

CAAGGGGGGATAAACGCAAGGGAAAGGAACATGGTGAAGCCAAAAAGGG 

Sequence 2085 

NCNGANCNGGGAGNGGGGAGAGCCCGCGGAGGNGCCCCAANCNCCCNAAAGCGAGNCGCA 
NNACGCGCGCNCACGGGCCGCGAAAGACAACGCAAGACGGGGAAAACCCCGCCGCNACCC 
AACANAACCGCCGNGCAAGCACAANCCCCANAG 
Sequence 2086 

TTTNNAGTTTTNGGGANCCACTAGTTCTNGAGCGGCGAGGTACTTATCAANAATGGGGTC 

CNTAMGACATAATCTTTGCCAGTAGATGCAGAGTTGCTGGTAGNTTATGACTAATTCCA 

AGACCGTCCGAAATCCCTGGGCTGTGTTGAAATGTGNNTTCATGCTCCCTCGCTCCCAAG 

CATAGACCGNCAGGAGCTCCAGGGCATACTGAGGNGGCAAGCTTCCAAGCTTCTTCTTAC 

AATTTTGGACCTGCCCGG 

Sequence 2087 

CCGGNCAGGTACACTCGGMATGGAAGGCGANGTTTGAGTCTTTCTTCATCTCGGTATGG 
AAATCCACCTGCAGCTGTGGGTCCATGACGAAAGGGATGATCGGTGTCCCTGTGATTATC 
ACACAAGAACCAGTAGACAAGGACACAGGCCGTGTGTGTGGTACCT 
Sequence 2088 

CNGGACTCACGCGGCGGCGGGAAATGTNATNCGATCACAATTCCACACACATACCNCCGG 

GANTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnATCAGGGGGATAANGCAGGAAAGAANATGNGGAGCAAAAAGG 

CCAAGNAAAAAGGCC 

Sequence 2089 

TTTNGGGGACCCACNANTNNGGNGAGCGGGGCCACCGCGGNGGAGCCCCAACCCGCCCNA 

CAGNGAGGCGCANNACGCGCGCCCACTGGCCGACGANNNACAACGACGNGACGGGGAAAA 

CCCNGGCGGNACCCAACAAAANCGCCNGGCAGCACAANCCCCNNTCGCCAGCGGGCGGAA 

NAAGCGAAGAGGCCCGCACCGAACAGCCCCTCCCAACAAGAGGCGCAGCCNGAAAGGCGA 

AAGGGACGCGCCCNGNAAGCGGCGCACAAAACGCGnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnAGCGCCCGCNCCCTNCGCGNANCAACCCCTNNC 
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CNTCNCGCCACGGACGCCCGGCTNTCCCCGNCAAGCNCTAAAAACGGGGGCCNCCCACAA 

AGGGGGCCGANANAAGAGCNNNAACGGGNACCCNCGACCCCAAAAAAACNNGGAAANAAG 

GGGGGAAGGGGNCAACGCAAGNGGGGCCAANCGCCCCGGANAAAACNGGAGANNACCGCC 

CCNCNGAACGGANGGAAGACCANCGGCCCNAAAAAAAGGGGNCCCACGGGGCCAAANCAG 

GGAACAACACACAAACCCCAAANCCGGGGCCANNNCCNNTGGAANCAAAAAAGGGANANG 

AGCCGAACACCGNCCCAATGGGGGA 

Sequence 2090 

NATTGCGAANTGGGCCGCTCTTCNCGCTNNCTCGCTCACTGACTCCGCTTGCGCTCGGTC 

CGNNCGGCTGCCGGCGAGCGGGTATCAAGCTCACTCAAAGGCGGGAAANACNGTTATTCC 

ACAAGAANCAAGGGGGAATAAACCGCCAGGGAAAAGAAACAATGGTGAACAAAAAGGCCC 

AGCAAAAAGGGCCAAGGAAACCCGAAAAAAANGCCCNCGGTTGGCTGGCGTTTNTTCCAA 

TAAGGCTCCGGCCC 

Sequence 2091 

CCCTTANTTTNGNCNTTNNCGANGNACCACACACATAGGTAGCCNGCATTCATGGAACAG 

GCACCGTGGGCTGGGCTGCACCACACCATCTTTCCATGTGTTATGTCTTTCTAGAGACTT 

CTTGAAMTTGGTAGGATTATCATATCATATGTTCTTGGAAACATCTGTTGACTATTTCT 

GTACATCATGGCTCGGACTTGGGTCAAGCTCTTGGCACCAATGTCCTGGCATGAGTGTTG 

GATGCCAGCMTCAGGTMGGGACAAATTTGTGGATTGACCCTTTGTCCTGCACAGCACC 

AGACACTCCCTGGGCCACTTTGATTTTGTCAAGCTTCACTGAAATATCTGTTCTGGCTGC 

TGAGGTGCTTGTCCCATGGCATCGMGAGMCCCATACCGCGATATTTCTTTAGCNCGGA 

TCCCATCGGAAAAAGAAAGTANTCACCAGGGGGCCCTAAGTGGGTGGCAGCCAGGAAGAG 

AGCCCCATCATGGACTGNNGGAGGCCCCAAGGGGC 

Sequence 2092 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTAMGAGTTGTTTTTTGGCCGGGCGC 

NTTGGCTCATNCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTC 

TGGAGTTTGAGACCATCCTGGCTAACACAGTGAAATCCCGTCTCTACTAAAAATACAAAA 

AATTAGCCAGGCGTGGTGGCTGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGG 

AGAATGGCGTGAACCTGGAAGGAAGAGGTTGCAGTGAGCCAAGATTGCNCCCCTGCACTC 

CAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAGTACCTGCCC 

Gnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnG 

TTAATTGCGCCCCTTGGCGTAATCATGGTCATAAGCTGTTTCCTGTGTGAAAATTGTTAT 

TCCGCTCACAATTCCACACCAACATACGAGCCCGGGAGCATTAAAGTGTAAAAGCCTGGG 

GTGCCTAAATGAGGGGAGCTMCTCMCATTTAATTGCGGTGCGCCTCACTTGCCCGCTT 

TTNCAATTCNGGGAAACCTTGCGTGNCCAGCTTGCANTTAATGAAATCGGCCCAC 

Sequence 2093 

CCCCGCGTCCGCCTCGNAAATTGTTGATGCTCTTTCCCCTCCCCGAGGTCTCGCATNCAA 

ANCCTGGTGGGCTGGCCTTGTGTGGCTGCTTCTCCAGGCCTGGTCAGNACCCAGCAGGCT 

CAGGGTCTGCTCCTGATGCTGNGCTCTGGGACAGGCACGCCACTGTGNGAAACACTAAGC 

NAGGTAATCGAGCATTTNGTGATCACAGACTCCAGCTTCCTGGTCCACCCAGCATGTAGT 

CAGCACTCTGACCTTNACACCAGAGCTCCACAGCGGCTAGGAGTTGACTTCCTGTGTCAT 

GACCTCAGGAAATAAATTTCCTTGACTTTAAAAAAAAAAA 

Sequence 2094 

TTCTGCTGAGACGCGTGTGGCTNCCTCCCCGCAACANCCAAAATGNTGAAGCTGATCGAG 

AGCAAGGAAGCTTTTCAGGAGGCCCTGGCCGCCGNGGGAGACAAGCTTGTCNTGGTGGAC 

TTCTCTGCTACGTGGTGTGGACCTTGCAAAATGATCANGCCCTTCTTCCATTCCCTCTGT 

GACAAGTNTTCCAATGTGGNGTTCCTTGAAGTGGATGTNGATGACTGCCAGGATGTTNCT 

GCANACTGTGAATTCNAATGCNTGCCAGACCTTCCAGNTCTATAAAANGGGNCAAAAGGN 

GGGGGNNNTCTACNGNGCTAACAAGGAAAAGCTTGAAGCCTNTATTACTGAATATGCCTA 

ATCATGCTCTGAAAAGTGGGACCAGCTNCCAAGCTGNTTNAAACCTCGTACCNTTNTTAA 

TTTGCTAAAAACTATGAAAGTGTGGAGAGGCTATCCCAACTGNCATCTGATTATTAGTGA 
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CAATAAAAAAANTAATTCTACCCCTTNANAAAAAAAAAAAAAAAAAA 
Sequence 2095 

TGTGTAGCACCTGNGGNGTCCTTGNGTGATTATTCTTGTNCGAGGTACTTAGGGCAAGTC 
ACATGCCCTCCATCCCNTGGCTCANAGATGAAGAGTAAATCCAAAACATGTGCCTCGCTC 
TTGGTCACTAACTGCTGNCCTG 
Sequence 2096 

TCGAGCGGCCGCCCGGGCAGGTACTTTNTTAATGCCTTNGTTGGAGTCCTNATCCTCATC 
TTTAAAAAAAAAMAMCAGNTTANCCTMGCCANATTCACTTTTTTTAGTTNACAAAAA 
GGATTAANTNGCCACANTGTGATTT 
Sequence 2097 

ATTNNCCCTTAATCATCTCACGCCCCATGTATGATTCTCAAAGNGCCTAGCGTGANCANC 

NGTCCCTNAGACCACACCAATTTCTTNATGTCNCNCTCAAGAAAGCCAAAATGACAATNA 

TAANGCCATCTCAANCNCAATANCCTACCANAACCACCCCNCGGNCTTATCTANACTTCA 

ACTCAAACTCTCTGCCTCCTTACTNTCTGGGGAGCTTNAAAACCANNTNACTNATAACTT 

TAAAAACCTNTCTNTAAAATNTCANAAACCACANCCTCACCATTNNCACAACCACCCCCA 

ACACCAAANANNTTCCCAAAACAACC 

Sequence 2098 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

AAAATTTAAGTATGAMGAAAGAAAAAACMAAAGTTGAAAAAGAAAAACAATCTTTCAC 

AAACMTAGAGAAATT CGTTT ATCTTTTGGAATCAATTTAAGNGATATAAAAATCAAAGC 

MAAAAAGCCAAAGAATTTTTATTAGATAACGACAGAGTAAAAGTGGCTCTTCGTCTTAG 

AGGGCGTGAAMTACAAGACCTGAACAAGGTAAATTMTTTTAAATTCTTTTTTTGATGA 

AGTAAAATCGATTGCAAMTTAAGTMAGAAATGCMTCAGTTGGTMTTTTTTA 

Sequence 2099 

NGNCCTTNCGAGCGGCCGCCCGGGCAGGTACAAATTGAGCTCTCTATTCATAACCTCAAT 

GTATGTATTCCTGCCCATTAATATACTTTGCACCAGCAAAAGCGATTTCCAACATATGTG 

TTTTGGAGGTMTTMGTAACTCTGTATAAAAATAMTGCACTT7TCCCTCCTTTCCCCA 

GTGMTGGAAAACTTCCATACTTTCAMATAATMTAAAAAMTMTTTTTAAGAGCAAC 

AGCCCTCAACTCTTTGCTGGTGCCTGCCATACTGCCTTTCTTCACTCCATTCTTAGCTCT 

GCTAGTTTCTTCTTGTATGTCATGATAAAAAGGGAATGTGGGGTGTGTAA 

Sequence 2100 

NCCTTAGCGTGGTCGCGGCCGAGGTACACTGGAGGCTGGAGCCTGCAGATGGCATGGCTC 

TGCGGCTCACCTTGCTGCAGTTGGTGGTGGTGACAGAGACTGCAGCTTGACTGTAGTGAA 

TTTGGAMTTATCTGTCTGGMGCTCTGAGTTTATCTTGGGACCTCAAGAGGAGAGGATC 

ACCCAACTCACAGCAATCAAACTCCAAATGGTGCTATAAACTGAACCACACATGGACACG 

TCAGTCTTCCGAGGACCCTTAGATCAACCCCAGGAGGAGCCCTAGCTGCTGTTCCCCATT 

CGACGCCCCTTTCCAGCAGG 

Sequence 21 01 

NAGGGGCGGGMTTTTGGGNGGGCCCCCCTTTCTTANAATGCATTGCTTCCGNANGGCCC 
GGGGNCCCCCCCCCCAGTGGTGGATGGGATATTCTTNCCAAAAATTGGGGGGGGCCCTTT 
TTNNGGNCCNNAAAAAACCCCNGGGGCCGNCCCGGGGCCAAGGTTACCTTGGACTTGGAA 
MATTTGGGGCNTTTTNTTTTGGGGGNNCCTTTTNCCCNNNNNNNNNNNNNNNNNNNNNN 
NNNANGGGCCCCCGGGGTTTNTTNCCNTTTTCCCAAAANAGGCCCCNTTTNNNGNGNGGG 
GNGGGTTGGGGGNAAAAAAANNNNNNNN I I I I I 1 1 I I I NNNNNNNNNCCCCCCCCCCCCN 
TT AAAA AA N A AAA A AAA A 
Sequence 2102 

CCCTTTCGAGCGGCCGCC CGGGCAGGCACTTA I I I I I I I I [ 1 1 1 1 1 I I I II I I I I CTTTT 
I I M I I I I I I I M I I I I M I II I M M I I I I I | M | I N | | I M | | | | i n M | i i | | l | 

TTTTNNTNAANAAAMAAAMNTTTTTNTTNMNTNGGGNCNNAAACTNTTTAACNAANN 
AAAAAAAAN NNTNNT AAAANGTTNTCNAAGNNGGGNNNNNCCNNANAGGNANAAAANGAA 
AANGNNTNA l nil I NTTNAAAAAAAAANNTTTNNTAAAANTGTTGNNGGNGGGGGTAGG 
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TTAAAAAAAAAA 



Sequence 2103 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCTGTCTCTGTAGTCTCTCCATTCTAAAGTT 

TMTTTGGAAAGGTCTGTCCTGAATTATCAAAGTAAGAATACTCAGATTTCCATAAGCTC 

TTCTCACCTCCTCTTAGCCCAACTCAMCTCCATCAGACCTTCTCACCTTGATTTTGAGC 

TGGAATGTTTCAATGAGCAAAATAAATTAGACAAANGTTAAAAAAAAAAAAAAAAAAAAA 

NGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 2104 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTGACCCCGGTCCTCAAGGAATCAAAGTTTAAG 

GAAACAGGTGTAATTACCCCAGAAGAGTTTGTGGCAGCTGGAGATCACCTAGTCCACCAC 

TGTCCAACA TGGNA NTGGGCTACAGGGGAAGAATTGAAAGTGAAGGCATACCTACCAACA 

GGCAMCAATTTTTGGTAACCAAAAATGTGCCGTGCTATAAGCGGTGCAAACAGATGGAA 

TATTCAGATGAATTGGAAGCTATCATTGAAGAAGATGATGGTGATGGCGGATGGGTAGAT 

ACATATCACAACACAGGTATTACAGGAATAACGGAAGCCCGTTAAAGAGATCACACTGGA 

AA 

Sequence 2105 

CCCTTAGC GTGGNCGCGGCCGAGGCAC I I I M I I I I I I I I I I I I I I I I I I NGCI MUM 

M M M M M M M M I I NTTTTTTNTTNTNCNTTTCN A I MM I ATNC I lllll l llll 

TTTTNAATGGCCAGGCTCCCAACATTTNAAAAAAACTGCNCCCCCCAATGGGTGAACAAA 

GTAAAGAGTAGTAACCTAAAGTTCACCTGAGTAAGCCACTGNGGAGCCTTAAGNGGNGAG 

GTCTTCCAATTTNANAGNGATGNGNCTTCAACTTGTATNATNATTTTANGCGGAAAAACA 

TAA 

Sequence 2106 

TCGGCGTCGCGACCCCCGAGGACCTCCTCTNCTCGCTCTGTGGCATACACTAGTCCTGGG 

CACTCAACCGCGGAGAGCCCCCGACCCCGGGGTAGCGGCTGAGCCTCAGCCGGGACCGGN 

ACCGGANCCCGCGCGGAGCATGTNATCCGGGCTGGGGGCAGCTGGNACAGTGGGCTGGGT 

TGGCCCTCCT 

Sequence 2107 

MTT TGTGTGTTTGTGTTTGTTTGGGG I M MGl M I I ATTTCTTACATTANAGTNCATA 

TTTTCTGGGATTTAMTTATAGGTGTATTTTCCTATTCTCTTGAGAAGNAGACTAAACAG 

TCTTTGCAATGATGACGGATGCACACAGAANAAAACATTAGAAGACATTACTTTCTATCC 

TCTCATGTGGTTGANCATTCTTACACGCCAAATGACTAAATTGGTGTTTCNTAGGAAGGA 

GCAGCTGTCACTTACAATGTGAAAATATTMTGTTTTAGGCCAGNGGGCCAAACCTTCAA 

GGGGGCCTNGTTGGNCAATTTATNGTCCCCTNATNCTTTTNAAAAATTTGAANGGTTCAN 

NNAMNTTGTNNAACCACCAACMAACNCCCCCTTTNANCATNGNGGGGGCAANGGGGTT 

GGGGACCTTNGNGACCTTAANTGNATTGGCCAATTACCGCCTTGGGGCCTTGNTNAANGC 

CTTCAACNCCNAMANAATTGGCCCACCMTTGGGGGGTCTTTTNGTNANACCTTNTTTG 

NACCAACCAACAACCTTAANCTTNGANCTTCACAAANGGCTTGGGGCNAGGGTANCAACA 

ACNTTCAAGGTTTATNCCGGCTTCNCMANGGGCNCTTNMCCTTNTTTGGNGAATTTGA 

AGGCTTAAAAACCTTGGCCATGGAAATCCCANCTTTAAGGGCCCAATCCCCTT 

Sequence 2108 

ACCACGCGTCCGAGCTCGCTCAGCACTCCCAGGTCCTTAGCACTCCCAGGTCGTAGCTGG 

CG CAGTCAG TAGGAACTGTAACTATGTCTCTGATGCACCACGTGTTTAGACACAGCACAG 

TCCM M I I CTGTTCCTACGGTGGMGTAGTTTCTCTTTGGGCATGCTGACAGCACTTTT 

TCATAGCCTCACCGATGAGCCCTTTCTGCGGGAGTGACTCCATGCCTGTATACAGAGTAT 

TTATACAGATGTTTTAGCATCTTCATATGCGGTGTTAACCCCTAGTTCTGTACAGCATAT 

TCTGTTCAAGTATTTTTTTACAAGCTTGTGCTGTAGGCACATGCCTTCTGCTGCAGAAGT 

GGACACCCGTGGCACACCCACCCCGCCCCAGTGGGGTGCCATGCCTTCCTGGGACATTGC 

CACTTCTGCCCTGGAACTCATGCAGGTACGTAGTAGCTGCTATTGCCAGA 

Sequence 2109 

NCGCCTATCACATAGTCAAACCCAGTCCCTGGCCACTGACAGGAGCTCTGTCAGCTTTCT 
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TAATAACGTCCGGCTTAATCATATGATTTCACTTCTACTCTATCACCCTACTAACACTAG 

GCCTACTAACCAACACACTAACCTTATATCAATGATGACAGTGACATCGTACTGAGAAAG 

CACATACCAAGGCCACCACCACAATACTCCCGTCCAAAAAAAGCNCTTCCGGTACCGGAA 

TTMCMCATATTTTATCANTTCTCNAAAAAGTTATTCTTTTTNTTT 

TCTTGAGCANTTTTTANCCACATCTAAAGGCCCTCAGCNCCACAACAACACCTGGCTTTA 

AGGNGGGGCNCAATTTGNACCCANCCCANACAAGGGCGATTCANCCCCCCCCTGNAACCC 

CCCCATTAGTAAAGGTACNCCCCTNNCTTAAMACMCCCTTCTTGGNNATTTAACTTTG 

NCAATCCTTGGGGGGGTNAAACAAANTCAACCCTGGAAGGCTTCACCNACAAGNCCCTNC 

ATTAAAATTAAATNGACCCCGAAAAAANCAAAACCAAANTCCCAAAGGCNCCCTGCCTCA 

TTCACCAATACCTAACTTAGGGTTACCCTTAT 

Sequence 21 10 

CGTCCGGGACCTTTATGTCTTGTNAAGATGTCTAGGCCTGGCCGGGCGCGGTGGCTCACA 

CCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGGCAGGAGTTTGAG 

ACCAGCCTGATCAACATGGNGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGC 

ATCGTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCCGAGGCAAGAGAATNGNTTGA 

ACCCAGGAGGTGGAAGTTGCAGNGAGCCAAAATCACGCCACTGCACTCCAGCCTGGGCAG 

CANAGTGAGACT CCGNC TNAAAAACAAAACAAAACAAAAAAGCAAAACCAGATGTCTAGG 

CCAATGATAATTATTTTTGATGCATTGTGGATTANGNTCTTTTGTTAACCCCACTGTCTT 

GGGG MTGATGCCTGCTGGGAAATTGAGTTTTTTGACTGAAACATGGAACCTTNCTGCTT 

TTTTTCTGGNTCCTATGAAGGTTTTGGAACATNTGAAAACACAAAAACTCACCTTGAAAT 

TTGAGCAGGTCGATGATGGCAAAAAATTATTT 

Sequence 2111 

GCGTCCGCTGATCTGCTTTGGGACGGCCTTTATATACTTCCTCCTTTCCAGGCCTTCCAC 

CACCAGTGACCACTATTCGACATCTGGCCCACTCTCAGTCATCCTCCTGCTTATGCTTGT 

CTNCTCCTTGAAGGCTTCCCACTGCATGTAGGACAAAGGTCAGATTTTGTAACAGGCCAG 

GCCTGGNCTTNATAGTCTGGNATCCACTAATTTATGGTCTNAGTCTNATCCCTTGGAGGA 

TTACCTCTGNCCTTNGNAAGCTCTGTGCTCCNG 

Sequence 2112 

TTCATAACAATTCTCCTACAAATCACCTTAATTCTGACATTCATGGCCACAGAACTAATT 

ATATNCTACATTCTATTTGAAACCACCCTTATCCCCACCCTAATTATCAGGTACTAAATG 

AGGCAGCCAAGCAMACGCCTTMCGCMGNACTTACTTTCTATTCTACACACTAACCGG 

CTCTCTACCCCTACTCATCATACTAAGCGCACACCTACAAACAACACAGGCTCACNTAAA 

CATCACAGCTANCTAACACTTCACGNGACCAAAAAAACNTAACAACCCACCTGGGNNCCC 

CACAGCANTTACCCTGGGCCTAGCNANTGGCATATATAAGCGCTTTTATGGACCAAAAAA 

TACCCCCTTNTTATNGGGGTCTACACNCTTATTGACTTCCCNCANAAGNCCCATGNTTTG 

AAAGCCCCNCATTTGCAAGGGGNTCAAATANGTCCTTTGCACGNCAGNTACCTTCCTTAA 

AMCTAGGGCNGGCTATGGGGCAATAAMTACCGGGNTTTAACTCCCCATTTCTTCAAAN 

ACCCCTTAMCGGGAMTTANCATAAGCCNTACCCCATTTCCCTCATTATTAATCCCTTA 

TGGGGGCNATAANTCATAACAAAGGCCNTAAACANTGCCTCCCGACAA 

Sequence 21 13 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCTACTTTTTAAACAATTTCAACTGCA 

G CTCTC TTTCACTMGTNAGATGGGNAMGCATGCCATTTCTGTTTNCTTGNGATTTTTA 

Ci I M i AGAAACACACATGCTTCCACTGCCATCTGACACTTCTCCACACGCTTTCATNTT 

GTAMCCTGAATTCTATTTNGAGTACCTATCAAATACTTTCTGGAGGNGGGGCACGCTCC 

GCTCGGTCATGATGCTGATCCACTTGGGAACATCAGTTCCTTTCCTCTTCACTCCAGCGT 

CATAGAGATCCCGAGCATCTTGGNNAATCAGTTCATAATCAATGACAGAGCCATCCTCTG 

CTCTTCTACCCTTTGCCA 

Sequence 2114 

GTGGGGGGGGGGGGGGNAANCTCGTTGGGACNCGCCCCNAATTNNNANNNAACCNNNGNN 
CNNNCTNGAGGGCGAAGGNATNNATAAGCTTGAGGGGGGGGGGCTGGANACCGANGNATC 
CACTAGTTCTAGAGNGGGCCGCCNACCGGGGTGGGNGCTNCAGCTTTTTGTTCCCTTTAG 
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TGGAGGGGTTAATTGCGGCGNTTGGGCGNNAATNATGGGNCATTAGGTGGTTTCCTGGNG 
TGGAAATNGTNATCCGGGGGGGGAGGTNCANACAAAATAGCNGAGCCCGGNNAGNATAAN 
AGTGTAAAAGCC 
Sequence 21 15 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTTGTGTCCCTTTCTCTTCAAAGAT 
CCTGAGCAAAACANNGATACGCTTTCCATTACCTNATGGGGGGGNCNNGGACCCAGCTCT 
TGGCATTGCTANGGNNGGCTANNATCNNGCCACNTGAGGNTGTGGANNN 
Sequence 2116 

GTMCTCACATTAATTTGNGTATGCGCTCACTGCCCCGCTTTCCAGTCGGGAAAACNCTG 

GTCGTGCCAGCCTGCATTAATGAAATCGGCNCAACCGCCGCCGGNNGAGAGGGCCGGNTT 

TGCGTATATGGGCAGCTCTTCCGCTTTCCTCGGNTCACTTGACTCGCTTGCGCTCGGGTN 

CGTTTCGGGCTTGCGGGCGAAGGCGNGTATTCAGGCTCACTCAAAGGGCGGGTNAATACG 

GGTTATTCCAACAAGAATTCAGGGGGGATAANCGCANGNAAAGAACCATTGTTGAAGCCA 

AANAGGGCCCAAGCAAAAA 

Sequence 21 17 

ACCTGTCGTGCCAGCTGCCATTAAATGAATCGGGCCAAACNCGCCGGGGNAGAGGGCGGT 

TTGCGGTATTGGGGNCGCCTCTTCCCGCTTTNCTCGCTCACTGACTCGCTGCGCCTCGGT 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTTCACTTCAAAGGGCGGGTAATTACCGGTT 

TATCCACCAGNAATCAGGGGGNATAACCGCAAGGGAAAGAACATTGTGAAGCCAAAAGGG 

CCCANCAAAAAGGCCANNGAACCCGTAAAAAAGGGCCGCNGTTTGCTTGGCCG 

Sequence 21 18 

CCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGCCCTCGCGC 

TTNTCCTCGGCCAGTTCGCGGAAGAAGTGGCTCACGCCTTCCAGAGCCACATCATCGCGG 

NCGAAATAGAAGCCCANAGAGAGGTAGGTGTAGGAGGCCTGCAGGTACAACTTGTTGGCC 

TACATAAAACACCTAGATGGTAACAACGAGGCAGCCCTGGAATGCTTACGGCAAGCTGAA 

GAGTTAATCCAGCAAGAACATGCTGACCAAGCAGAAATCAGAAGTCTAGTCAC7TGGGGA 

AACTACGCCTGGGTCTACTATCACTTGGGCAGACTCTCAGATGCTCAGATTTATGTAGAT 

AAGGTGAAACAAACCTGCAAGAAATTTTCAAATCCATACAGTATTGAGTATTCTGAAC7T 
G 

Sequence 21 19 ' 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTATT 

GGAAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGC 

TTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGGTAATGGC 

CTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 

Sequence 2120 

TGCCCGGGCAGGTAC I I I I I I I I I II I I I I I I I I I I I I l GGGATGGAGTCTTGCTCTGTT 

GCCCAGGCTGGAGTGCAGGGGCGCAATCTTGGCTCACTGCAACCTNTTCCTTCCAGGTTC 

ACGCCATTC TCCTGC CT CANCC TNCCAAGTAGCTGGGACTACAGGTGCCAGCCACCACGC 

CTGGGCTMTTTTTTGTATTTTTAGTAGAGACGGGATTTCACTTGTGTTAGCCAGGGAT^ 

GTCTCAAACTCCAGACTCTCGTGATCCGCCCACCTTTGGCCTCCCAAAGTGCTGGGGAAT 

TACANGGCATGAGCNCACTTGCGCCCNGGCCAAAAMACMACTTTTTTTAAAAAAATGG 

TACCTNGGCTCGCTCTAAGAACTAGGGGGGATCCCCCGGNGCNTGCAGGGAATTCCGATA 

TCA AAGCN TTATCGATTACCCGGTCTGAACCCTTNGAGGGGGGGGGGGCCCGGGTACCCC 
AGCI I I I 1 IGGT 
Sequence 2121 

GGTACCTTGTCTGGAGAATGCAGTGACAGCACCGGCCCATGCTTGAGAACCCANGCGGCT 

GTGCAGAGGGCAGCCGACCACTATAGCCAGCAGATGGCCCAGCAACTGAGGCTCCCCACA 

GACACGCTCCAGGAGCTGCTGGACGTGCATGCAGCCTGTGAGAGGGAAGCCATTGCAGTC 

TTCATGGAGCACTCCTTCAAGGATGAAAACCATGAATTCCAGAAGAAGCTTGTGGACACC 

ATAGAGAAAAAGAAGGGAGACTTTGTGCTGCANAATGAAGAGGCATCTGCCAAATATTGC 

CAGGCTGAGCTTAAGCGGCTTTCANAGCACCTGACAGAAAGCAT 
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Sequence 2122 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCNGGCCAGGTACCCTTGGAAGATGGGAA 
AGGTGAGGGAAATATNNGAAGCAGGGTCAGAACATCCACTAAGAACATAGCACCTNAGTA 
NAGCTTACATTATATGAGCCAGGGTAGAGTTANTACTGAAT 
Sequence 2123 

TACTA TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACI M I I I I I I I I I I l l 

TTTTTGCAACTCGAGACGTTAGTTTACTCCAGAGTCATGGAGACAGTGAATGGAGTGTGT 

GTGTGTGTGTGCGTGTGCGTGAGTGAGAGAGAGAGAGAGGAAGAAGGGAGGGAGGGATGG 

TTACAGGAGTGTGGGAAAAGTCAACAGGAAAAGGAGGGAAGGTTCGCTGTNAACTGACAG 

AAACTGTGACATCACAGGCATGTGCGGAGTCCCCGCCCCCCCAGGCACTGTGCCAAGGCC 

AGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGNCTGCAGGGAATTCT 

GATATTTAAAGCTTATNNATACCGACAACCTCNGAGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTNGCCGTANTCATNGCCAATAGCNTGTTT 

NCTGTGTGAAAANTNGTTTATTCCCGCTCACAATTTNCACACAACATACNAACCCGGGNA 

GCATAAAAGTGTAAAAG 

Sequence 2124 

GCGMTTGGATCTCNCCGCGGTGGCGGCCGAGGTACCTTTTTAAATCTAGCCCAGTATAA 

ACATTA GCCT GCTTAATATTTAGACATTTATAGGTAGAATTCTGAGCACTCAACTCATGT 

TTGGCATTTTAAAGTAAAAACAAGTGTGACTTCGAGGACCAAAGAAATTGTCAGCTATAC 

ATTTATCTTTATGMCTCATTTATATTCCTTTTTMTGACTCGTTGTTCTM 

AGAAGTGTTCTTATAAAGGTCTAATGTATCCACAGGCTGTTGTCTTATTAGTAAATGCAA 

AGTMTGACTTTGTCTGTTTTACTCTAGTCTTTAGTACTGGTTGTCAGGATTCAGCCGAA 

TGGCTTGCCTCAGAGGGTCAATGGCGTTCTGAGATGGTGGCCAGTTGTCCAGACGGTAGC 

TTGCTATTGCCAGTCCAGAGGTGAATTCTGGGTTCTTTGGCT 

Sequence 2125 

GCGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGAAAGTTCTTGATGAGGG 

TCTCAATGGCCCTCTCCACATCACTGAATTCCTGAGCATCCTCTGCGTTGGCTGACCGAC 

ACTGTGCCATGGTGCCCACTGTGTCTGGTCCTTTGGTGAGAGTTCTGTTGTCCTATAGCT 

GGCCCCAGAGGAGCTGATGGCTCATGATCTGTTGGCAGCCGCTGAGACAAGACAGGAGGC 

CCCGCGTACCTGCCCG 

Sequence 2126 

CCGCGGTGGCGGCCGAGGTACCCGGTGCGCATAAGAGGAAGATTTCTGAAGAGTGCAGCT 

GCCTGAACCNANCCCTGCCGAACAGNTGANAATTGCACTGCANCCATGANTGAGAACAAT 

TAGAATNCCTTGGNGGGCAGCCTACGGNANCTNAAATGCCATTTAACCTGGAACTTGATG 

Sequence 2127 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACGANGTCATATGTTATTTACAATT 

GGGTTTGTGTGGGATGGGAAGTAGGGCGGATGAGCCAGTACTTNTGCAATGAAGATGCAA 

TAGTCNTTGTCCTNTCCCACTGTCNCCTNTTTCCTCACCCNATGGCAGCTTACATGACCT 

ATTCCCAAAGGGTCCACCGAGNCCTGAACTCAGCTTCATCACCAACATTCCTCGCCTTNA 

GNAAGAATTCAACACTGTATAAGGGAGTNGAGGCANAGACTTGGGNCAGGGNGAGGGTGG 

NNAACACNNAAGCACACTNTCTTGTCATCAACCCAAGTTCAGAGACAAGGCCTCCNCANA 

TGNGGAAGATGATGCCCTTAGACACCNCAACCTGNTTGTCNNCNTTCNTNGAAGCNCAAA 

GCAGCCTGNATCTCAACTGAAGAGAAGGGG 

Sequence 2128 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGGGGAATGGAA 

TGGAATGGAATGCAATGGAATGGAATCTTCCGGAATGGAATGGAATGGAATGGAACGGAA 

TGGAACGGAATGGAACGGAATGGAATGGAATGGAATGGAATGCAATGGAATGGAATCTTC 

CGGAATGGAATGGAATGGAATGGAATCAACCCGAGTGCAATGGAATGGAGTGGAATGGAA 

TGGAATGGAATGGAACAACCCGAATGGAATCGAATGTAATGGAGTTGAATAGAATCAATC 

CGAATGTAATGGAATGGAATGGAACCGGAATGGAATTGAATGGAATGGAATGGAATGCAA 
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TGGAATGGAATCAACCCGAGTGCAATGGAATGGAGAGGAATGGAATGGAATGGAAGGGAG 
ACTACCCGAATGGAATGGAATGTAATNGAGTGTAAGGGAATTGAATAGAATCAATCCCAA 
TGTAATGGAATGGAATGGAATGGAATGCAATGGAATGGAATCTTTCCGAATGGAATTG 
Sequence 2129 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGNACTGAAANTCCA 

CACGACANAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTG 

CCTAATTTACAGCACCCATGAGGAAAGGCCACTTANGGATGCAGCAAGAAGGAGCCATCT 

GCAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACT 

TCTAGCCTCGAGAACTTACAAATGGTGATGATCATCAGGTCAAGGATAGTC 

Sequence 2130 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAACTGAAAATCCAC 

AAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTGC 

CTAATTTACAGCANCCATGAGGAAAGGCCACTTAAGGATGCAGCAAGAAGGAGCCATCTG 

CAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACTT 

CTAGCCTCGAGAACTTACAAATGGTGATGATCATCAGGTCAAGGATAGTCTGGAGCAATT 

GAGATGTCACTTTACATGGGGAGTTATCCATTGATGACGATGAAATGCC 

Sequence 2131 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATCATTAATGTAT 

TATATACACTGATACTTTAAMCTTGTGTGGAAAAAACTAACTTATAATTTTGTATCACA 

CACCCTGGATATGTGTTCTGTTTCTMGCGACATTTGTGAGAGATTATTGTAAAATGAGA 

GCGAGCAMTAAAACTTMTTTMTCTTTGCAGATACATACTTATGGGAMTTTGAACAA 

ATGAGTG AAACTCT GTGTTTTTAGTAGGCTGTGATAAACATTTCCGGGCACTTTGCAAAA 

GGACTTTC I I I I 1 1 GCCGGGNGCTTTAATNANTTAATAAAAATTTTTTTAAAAGTTAAAA 

A AAAT NGTGGNAAANAAAAC I I I II I I I I I II NTTTTTAAAAANNAGGNTTNNANNAAAC 

NTTTTNTNTTTNGCCGNANNAMNCCCCCCCCCCCGGTTTNCNGGGGAAANAAAAAAAAT 

NNGGGCCCNC CNTTTTN TTTTTTTCNGGGGGGGGGGGGGGGGGGGGN I 1 1 M 1 1 I I M GN 

GMM GNGNTGTTTTT TNNCCCCCCCCCCCCCTCTTTTTAANANMAAMATTMTTGGG 

GGNNNI I I I I II I I! INNATTNNNNACCCCCNCNAATTNGGTNTTTTTTANNTNAANANC 

CCCCCNGGCCNTTNNTNATANATGCCCCCCNCCCCCCC 

Sequence 21 32 , 

GAGGTGGTTACATTCGTCGAAGGACACCAGCTGCGGAATTTGCGGNTTTGGCAGATTGAA 

ATCATGGCNGGTCCAGAAAGTGATGCGCAATACCAGTTCACTGGTATTAAAAAATATTTC 

AACTCTTATACTCTCACAGGTAGAATGAACTGTGTACAGAATCCATTCTCATTCTTTACT 

TGCTACATTATGACCATGAGGAGGGCANAGTAGAGGTGAACTCTCTGTATACTTGCTGAA 

AGTCTTCTTGTACCT 

Sequence 2133 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCAGTAAGAGATGGG 
GTTTCACTATGTTGGTCACTATGGTCTTGATCTCTTGATCTCGTGATCTACCCACCTTGG 
GTTCCCAAAATGTTGGG ATTAC AGGTGTGAGCCACTGCACCAGGCAAACTGCGATCTTTT 
AGTGGTGCCTCTTCTCTCTTTTGACTTAAGGATGTTGTCCCTTAAGGAAACCTGGAGGCT 
ACTACTGTGATACACTACTTGAGAGATGGATTGTTGCTCTTTCTTCTACAGTCTTACANG 
GAGT AGATT ATAMGACGGAAGATGTTACCATTTGCNTTAATTGTTGGAAGCTGANAGCT 
TT TAAl l l l iGGTTNCMCTGTTTTTGNGGGANNTCCCGNMAAATTTTNNNNNAATTTT 
TNl l l l I I l NGGGAAMGGGGGCNCNTTTTTAAAAACAGGGGGGGGNNGAAAAAATTTGN 
GGGNGGGGGGCCCCNNCCMAAAMAAMAAAAAGGGGGTTTmNTTNCCNGNNNCCGG 
GGGGGGGGNNGGGNCNNTTTTTAAANCC I I I I I I I I I GNTTNCCNCCNAAAAAANNAACC 
CCTNNNGGGGNG II 1 1 I I II III 
Sequence 21 34 

C CGGGC AGGTACAAGAGATAGAAAGACCAGTCCTTGCTGAAAGACAAGTCTGAATGCTCC 

ACTTTTTCAATTCTCTCTCCATTCTTCAGTAAGTCAACTTCAATGTCGGATGGATGAAAC 

CCAGACACATAGCMTTCAGGAAATTTGACTTTCCATTCTNTGCTGGATGACGTGAGTAA 
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ACCTGAATCTTTGGAGTACAGGACAATNAAGACTACTCCTATNTGCGGAACAACTAGCTT 

TCTATTTAGTTCTAGAATGTTGAAACTGACCGATTGGCTGACATAAMGTCACATTTTAC 

AAAAAAGTGTCTCCAMTGCTTTGACTAGGGGAAAAACCCCTTTTCAATTAGAGGGAGCC 

ATTNTGCMCA MTTTCC CACAAATMTTCGCTTATTCCMGGGGCMNGGCACCATTTG 

ATATNGGGAMTTTTTTTGTTTTTNGNGCCAAAATTTAAGGGNA 

Sequence 21 35 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCAAGACTCAGCACTAGTCTGATG 

ACCTGCTMTTCACTGACAGCATAGGGCTGTCTGTTGTTTTTGCGCAAGTTGGTGTGAAC 

MAGTTCACAATATCTGGTCGAATAGGAGCCTTGAATACAGCAGGCAAAGTGACATTTTT 

GCCAGATGACTCCCCCTTTTCGGAGTACCTTGTTCAAAAAACACCGCTGAGTCACTTCCA 

GGTGCTGTTMGTTTTCTTTAGTGAAGATGTCTATACCAGAGGGAGCATAGTTCCAGATG 

ATTTCCTCAGCGGCAATGTAGTAGTGTCTMCATGCTTCCCACGGATATTATCCTTTTGA 

TGAAGACTTTGTTACACTCCTGGACCTGGAAAAAGGCTTGCAAACCGGC 

Sequence 21 36 

TNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAATGTAGAAA 

CAATGAGGATGGACCTGGTTTAATAATGGAAGAACAGCACAAGTGTTCTTCGAAGAGCCT 

TGAACATAAAACACAGACACCTCCTGTGGAGGAGAATGTAACTCAAAAAATTAGTGCCTG 

GNAAATTTGGGCTGATGAGCTTTCTGGATCCTMGCCACCTACCGAATTIsriTGGANGGTT 

GGCTGNNGGTGTGGGGAMCACAAGTC7TTTCCAATTTTTACAAAACGAACCAATTGACC 

CCAGGGACTTNTTTGGTTTATTGGGTGGGG 

Sequence 2137 

CCGCGGTGGCGGCCGNNGTNCNCGGNGCCNGAAGAGGAAGATTTCTGAAGAGTGCNGCTG 

TCTGAACCGAGCCCTGCCNAACAGCTGANGAATTGNACTGCAACCATGACTGAGAACANT 

AAGAANTCCTTGGAGAGCACCCTACGGCMCTAAAATGCCATTTCACCTTGNAACTTGAT 

GGAGGGGAGAAAACT 

Sequence 2138 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCAGGTACCANATGAANNG 
NNAAGACAAGGCCATNCNCCACTTTATAGAGGGNGTNAAANTAAACCANAGNTCCNGGGA 
GAAAGAAANG 
Sequence 2139 • 

CCGGGCAGGTAC I I II I I I 1 1 1! I I I I I I I I 1 1 I GTTTNATCATATTTAGGTTATTTATT 

AATGAAAGGTATm"GANATTTTCAGGAATACAAATTTTGCACCCTGATGACCTCAAATGC 

GTGCA ACAAGATGTTTAATACANGAAAATAACACAAAAACTGTTGTTACAGTGGTTAGAA 

i 1 II I AACTTTAAMAACCATGAATTTGTATTGNTTTAATTGCACAATAAAATAATGTTG 

ATATATACTTMGCTTAMTTAATTNCAACANGGNAAACATTTTCCAACCCAGAGGTGTG 

GCCTGATGTTGGGGTTCMNTCTGGACTTNTATTTTTTGGGTANCACACTCMCTTTTGA 

ATTGTNTTAAGGGNTTATTTCNANCCATTCTAACTCTANGAAAAAATNTTNAAATTNCGT 

TTCCCAAAGNCATTANCCGNGGMTTTTTTGTTGATTTTCTNCMGAAAMATTT^ 

GAGGTTAAMGGAAGGNGTNTMTATCACTTMTTATCCCMCATTTTCTAAAGGGGGGG 

GGAAAAC 

Sequence 2140 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTATCAGCTAATTGGGCTCCTTT 

GAAATGCTGAGGATCTGCTTACGCAGGTTAACTCTTTGAGGAAGGGGGTNGGGTAAGTAG 

CCCTTMTGTCTTGTAAATCAAGGGGCCAGATGGCGTTCGTCAGGCTTTCCCAGCTAAGG 

GAGAGTCTACTCATATGGGAAACAAGCCTAGGTAATTAAGGAGACAAAAAGGGAAAATTT 

AAAAATAGGGTTAGTAAAAACAAGGTTAAAATAGGGTTAGTAAAAAAACAAGGTCAGGCA 

TTACAAAAGGATTCTCCTGTCTCAGCCTCCACTGGGATTACAGGCTCGCACCACCAAGCC 

CAGTTMCTTTTGTATTTTTAGTACCTGCCCGGGCGGC 

Sequence 2141 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA Cl H I I I I I I I 1 1 I I II I I I I I I 
NTNAGCAAGCACGTGCACTTTATTGAATGACACTGTAAACAGGTGTGTGGGTATAAACTG 
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CTGTATCTAGGGGCAGGACCAAGGGG 
Sequence 21 42 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTACCCTTGTCCCTCTCCTAA 

AGGGAGCACAAGGAAACTGAAGAGACTGAAAAAGAAGAGAGTTTGTAGCTGCAAAAGAAT 

AGGGATAGCAAGGAAACCCAGAACTGCATTCCCCTAAGTGGGGCCATCCCATGTGATTGA 

ATTGTCCATAGCTTGCCTATGGTGAGAAATGTGCATGCTCCGTGAGCTGGTCTCTTGAAA 

CAGGACTTATGCTTCCTCTATATTCTGGTTAMTTTTCCAAACACATAAGTTCACTGAGC 

ACAGATTTCTTATCCAGAGACMGTAGAATCTAACCGCAGACTGTTGGCAGAGTTTCCAG 

GCACTTAGCCATGTTCCCTTCCTGACTCAAATCCCCAAAGGCCTTCACTCTCACTGAGAA 

TCACACTACTGTCCCATAGATAAGGCAGGCATTGAAGCACCTGTCGTGATCCTCTAGGGG 

GGAGMTGAAAGGTTATTTTCCTGCATTGCATCATCATAGCTTTTAATA 

Sequence 2143 

AGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNGGNGCNNNGCTCTACAGAATAGAGGCI 
ATNCTTTAGCTTAAGCCTGTCTGCTGACCAGAGAATGGAATTNTGCGTGGNCTCANGGAA 
CAAAAGGAAACTAGGCAGGGAAGGGGAAGAAAAGTGC 
Sequence 2144 

GGCN AATTGGAGCTCCCCGCGGTGGCGGCCGNGGNACTTN l I I I II I I I 1 1 I I I M I 1 1 1 

TTTTATMCTGATMTTCTTTATTACTAMGGTTTATTTACATAGNGTTTANNGCNTAAT 

AAAAAATAAATTACAATACAAAAGTGCTNTTTAGGAAGGAGACACTAAACAACAGGCCCA 

TNTTACCCCTTGCTNTTTNGCCAAGACATAAGCTACAMTTTTGCCGGAAAAACTGNCCA 

TACATTTTTAACTACTTCTTCATTATTCTTATGGACCATCATCCAGGACATNTGTTTGAA 

GAAATATCCAGTTATMTATTTTCAAAGGNTNGAATTGNGAAGAAAAAAATATAAAATGT 

GATTAMGGATNTNTAGCCTTCAGATGTAATTTACNGGGTTTAAAAATTGGCCCCTTTAA 

AA CTTT TGCTTTTTTAGACNAGNNNTAAAAAGGCACMTGGAGACNCCATATTGTTA^^"G 

GATTTTCANAACNGGGGGNTCTTNTGNGNAATNCNTACCANTTTTTTT 

Sequence 21 45 

AGGCACNAGANGANTTTNC I I I I I I M I I I I I I I I I I I NCTGNNATACAAAGAGCAGATT 

TTTATTGAACTTGGGCNATMCTATATTNCCATACMTNTAAATATTCATGAATAGTTTC 

CCAAGTCTGGAGCGACCACATAGGGAGAAAATGTAAATGTCTCAATTTTTGGTTNCACNA 

AAATGI I I I I I I I ATATCAAAATTTGNTTNTNAAAGCTTGGNGGATTAGCTTAAAAAANA 

AAAAAMGTTTCCTTGAAATCNGGGAAACAAGACATTTTAAATGMTCAANCAAAATTTC 

AAATTAAMAAATTATGAAAATATTATCCTCATTAGTTTCATTTAGTGCCCATGGAAANT 

TAATTATTCTCTCTGCNTTGATCTTTGGGGGGACMGTTTCAATGGAAAGCCTGTCAGNT 

TAANTTCATTAAAAGTTT 

Sequence 21 46 

GCTCCCCGCGGTGGCGGCCGAGGTACAAAGGAATAAAAATGTTACAACCTAAAAAAACAA 

AACACCGTAAAACATTTAGAGTAAAACATGACAAACGTGCTGCTTTTAGAGGAAAAGAAG 

TTTCATTTGMGMTTTGGACTTCAAGCTACAACATCTCAATGAATAAGTGCAAGACAAA 

TTGAATCATGCAAGGATTGCCGATTACAAGAAGAATGGGTCGTGAAGGACACGTTAACAT - 

AAGAATCTTCCCTCATCTATCTCTTACTAAAAAGCCTATTGGAGTTAGAATGGGATCTGG 

TAMGGTTCAGCTGAAAMTGAGTAGCTGTTGTTAMGAAMTACAGTTATGTTTGAAGT 

TGGTGGAGTTAAAGAAGAAATAGCTTCGTGATGCACTTCGTTTAGGTGGACACAAACTAC 

CNTGTTAAATGGMNGMTAATTNNGCANGGGAGGAATAATGGAATTCAAAGATTTAAAG 

AAAAAMCTACTGAGCATTTTAGMCCGTTATTAANTTGMTATTAMTCAGAGCTTTTT 

ACCCCTTCGTTTTAGAAATTAAACCTCAACCAACTNGGATCATNCCTCNAAAATTCAAAC 

MGGTTAGAAAANGGTTTTNCCCGGAATTTTTACNNCAATTTAGCCACCCAAAAAAAAAA 

Sequence 2147 

TCCCCGCGGGGCGGCCGAGGTACATNTGCCTGNCTNCCTNCTGTCCTTCCTTTTTATTAT 

AAGGATACATTTATAGNACCCCATAGAAGGAAAAGATAAATTTCATAGGCTGNTAAAAGA 

GGCTAGGCCTAAGTTATAATGCCTCCTCCTCACAGNCCAATTTNCCCAAGGGGCNTTANC 
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ACCAGAGCAGNTTTTCTAGCTTGNGGACAATNCCNNCAGGCTTGAGTGATAATGNCCCNG 

TNGCGGTAGCTCTCCACTTGNTNAAGGACCAAANACACCTTAGCAG 

Sequence 2148 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA C I 1 1 1 1 M i I II I 1 1 I 

TTTTTT TGAC TT TATA TATAATACTTTATTTTAT^ 

TTTCACTT TTMTTT TATGATATGTGTGACATATTTTTMTTTTAT 

TATCATAATTTTTTTAMGTTTGTATCTTTCATTAATAAGGAGGNTCCTTATTAATGGAT 

TTTTT1TCCTGTAGCTTCTGAGMCACATTTTTAGATACCCGGCTTCTAGTTATACCTGA 

AGCTCCACAGTGTAGACATGTTTTGGCCMCCTTGTTTTATCGGTGTATGAAATTTGTGC 

TMTAGGATGGATCCMTTGTCATTACATTAGAAAACTAAATGGGAAACATTTCTT 

Sequence 2149 

TCCTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA Ci I 1 I I M I 
I I I I U I 1 11 I I I I GACTTTATATATAATACTTTATTTTATACAATTCAGTATATCATGT 
TMCATATTTCACT TTTM TTTTATGATATGTGTGACATAT^ 

ATTATACTATCATMTTTTTTTAMGTTTGTATCTTTCATTMTMGGAGGTTCCTTA^ 

AATGGAI I I I I I I ICCTGTAGCTTCTGAGAACACATTTTTAGATACCCGGCTTCTAGTTA 

TACCTGMGCTCCACAGTGTANACATGTTTTGGCCMCCTTGTTTTATCGGNGTATGAAA 

TTTG TGCTAATAGGATGGATCCMTTGTCATTACATTAGAMCTMTNGGGMCATTTTT 

TTTTACATTNGGGGCAACNTACCAAAATTTT 

Sequence 21 50 

CCGGGCAGGTACACGGGCAAAGGGGCTTGAGAAGGCCCGGNGGCGAAGCCGAAGAGAAGC 

AACTGTGCCCCGGAGAAGAGAAGCTCGCCCATTCCAGACTGGGAACCAGCTTTCAGTGAA 

GATGGCAGGGCCAGAACTGTTGCTCGACTCCAACATCCGCCTCTGGGTGGTCTTACCCAT 

CGTTATCATCACTTTCTTCGTAGGCATGATCCGCCACTACGTGTCCATCCTGCTGCAGAG 

CGACAAGAAGCTCACCCAGGAACAAGTATCTGACAGTCAAGTCCTAATTCGAAGCAGAGT 

CCTCAGGGAMATGGAAAATACATTCCCAAACAGTCTTTCTTGACACGMMTATTATTT 

CMCAACCCAGAGGATGGATTTTTCAAAAAAACTAAACGGAAGGTAGTGCCACCTTCTCC 

TATGACTGATCCTACTATGTTGACAGACATGATGAAAGGGAATGTAACAAATGTCCTCCC 

TATGATTCTTATTGGTGGATGGATCAACATGACATTCTCAGGCTTTGTCACAACCAAGGT 

CCATTTCCACTGACCCTTCGTTTTAAGC 

Sequence 2151 

CCGGGCAGGTACGCGGGGNANTGCNANANACNCAAANCNNGNTANTACANTGCATCAAAC 

ATGTTCAAGATTNNCCAATTGACGGGATTGGATTNAMGATATNCCACCACTTTTAGCAA 

GATGGNGAAGTGCTAAATNACACAATTAATCAACTGGCTGAGTTAGCTAAAGATGCATAT 

GmTTATAGGTCCANACGCMGANGTTTCTTNCTTGGGACACCTACTGCANCTmTTTA 

AAAAAACCTTTTATTATGGTAAGAAAACCTAAA 

Sequence 2152 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGNNGGCCATTACCCTACCAAC 

TMCTAATGTTCCGCACCCCCATTTTTMGTGMGCTGTGMGCTCCTTTCTATTACTCA 

TCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 2153 

AGGTACACTTCTAGCACCTAGCAGAGAGAGGCTTCACTACATNATGCTTCCTGACATCTC 

TCCCTTTGAAGAGCAGTCAGACTCCTGCTTTGCTCTTCAGACTTMTTTGGGGGTTTAAC 

AGGTGAGGTTGCTGGGGGAACTCTTTTACAACATCTCTCTGAAAGAATCCGGGCTGCCAG 

TTTCATTTGGTTTGGGTGTCAGTAGCATGATGGAAAGACAAAAAAACACAACTTGACATC 

TGCAGAAATGGGTTCAMTTTTACCTGCAACTCACCMTTCTGTGGCCTTGGTTCAGCAA 

TTAMCTCCCTAAMTTCAGTTTTTTCTTTGTAAAATGGGGTTATGAACAGTACCAAATG 

AAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCA 

AGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAA 

MTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAA 
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ATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCA 

GCATCAAGCTGGAATGGGG 

Sequence 2154 

GNCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTGTGCACGTNGGTTAN 
AGGCTGCGTGGCAGGANNGNGNTCAGATTNTCCCCTGCACNGNAATTGGGCTTTNAGGGG 
GAAATGGTGGGGCCATCT 
Sequence 2155 

GAGCTCCCCGCCGTGGCGGCCGCCCGGGCAAGGTACANGTGTGTGTGGGTGAAATGGAGA 

TTTGGAATTGAACTCTCTGCCTGTAAATGTTCCCCAAATAATTGTTGTGTGTATGATACC 

GTGTATAATAAAAGTATTCTTGTTAGAATCTGAAAAAAAAAAAATGTAAAAAAANNNTAA 

AATAAAGGTTCCTTGGCCCGCTNTANAACTAGTTGGATCCCCCGGG 

Sequence 2156 

CGGCCGCCCGGGCANGGTACGCGGGGACACAAGACATCATCTTGAAGGAAGGATGGCTTT 

GGCCAGACCAAGACCGAGACTTGGAGACCTGATTGAGATTTCTCGCTTTGGCTATGCACA 

CTGGGCCATCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGTCACTGGTG 

CAGTCACGACAGTAGGTGTGGCAGCAGGCCTGCTGGCTGCCGCAAGCCTTGTGGGGATCC 

TGCTGGCCAGAAGCAAGCGGGAAAGGCAATAAATCCAAGAAATTGTCCCAACAACCACCA 

ATTNT TACG GAGGAATATTATTTAGCCAGCAAGGAGTGGAGTTTTGGTTTAC7TGATTTT 

ACT GNTT TTNTGGTTCATGGAATCTTTATTTTTAATTGGAGTTAAAAANCNCAGGNAAAT 

GTNTTTTGGAAATTGCACCTTNTTATNGAATTNTTTTTAAAGACACAANTTNGGGCTNTT 

CCNAAAAAAAAAAAAA 

Sequence 2157 

CCGGGCAGGTACCATTTGGAGTTTAATTGCTTCGCTCCGATGAATGAATTCCTGGCCAAT 

GCACCAAAATGATACGGCTCCGATGACTGGAGGAACACCAGGGTCCTTGGTCTCGCACCA 

GTTTAGATAAAATGACACAGACACACATGTAATGGTTTTAAGGAGTGGAGAGTTTATTAG 

GCAAGAAGGAAGGAAGAAGAAAACAGCTCCCCCATACAGAGACAGAGGGAGGGGGGATTA 

GAACAAACAGAAACTTCCCCGCGTACCT 

Sequence 2158 

CCGGGCAGGTACAGCGTCATATAGGCTTTGCCTTTMTGATCTCTTACGGTTAGAAAACA 

CAATAAAAACAAACTGTTCGGCTACTGGACAGGTTGTATATTACCAGATCATCACTAGCC 

AGATGTNACG TTGG CACATTTGAAGTCCTTTATTGAMTTCATAAATAAAAGAATTGTTC 

TTTCTTTGTGGTTTTAATAAGAGTTCAAGGAATTGNTCAGAGTCTTGTAAMTGTTATTT 

T MTM TCCCTTTTAAATTTTTTATCNTGTTGCTGTTTACCCTCTNTGAAATATGNAT^ 

TATTTTAGATTTGCCTMTGNCCANTTCATTTCAAGGNAAAATTGCCCAAAGAGGGGTAT 

TTCC CCTT NGGG GGAAA ANNGGGGGNCCNTCTTTACCAAGTGGTAAAA NI 1 1 I I I I I CCC 

TCCCTTTTMCCCTTTTTGCTTTAATCATTCCMN^ 

AATNCCCCTTTGGTTGAAGNGGCAANGTTTTGTTTGGAACCTTGGAAGTTT 
Sequence 2159 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATCCTACCCGG 

CCTGGGCTGACCCATGGGGAAGGCTGGCTMTTTCAGTGCTTCTGCTTGGTTGTTCAGGG 

CCATT TCAG GTTTGGGTGTTTTCTGGGGATGTTAACATGGGATTCAGGCTCAACTCACAA 

GAMCTTTTCCATCTCATGATGGATGCTGTTGGGCATGTCCAATGTATGACTTCATGAGT 

TACACAGATGCTMTTCGTAGGGGCACTTGGAATCACATGGTTGTTTTGTGTCCCATGGT 

CMGCATTCTATCTTATCAGGGCCTACAGTAACATGCCAAAAGTTGCTTCCAACATATTT 

CTCTGCTTTGGATG 

Sequence 2160 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCCCAGTCGTGGCCCTCTGGACAA 
GTGGCGGGCCCTGCACTCATGAGGGCTTCCAATGTGCTGCCCCCCTCTTAATACTCATTG 
TCMTTTGAGAAAMGGACATATGAGTTTTTGCATTTATTMTGAAACTTCCTTTGAAAA 
ACTGCTTT GMTTA TGATCTCTGATTCATTGTCCATTTTACTACCAAATATTAACTAAGG 
CCTTATTAA I I I I I ATATAAATTATATCTTGNCCTAAAAAAAAAAAAAAAAAAA 
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Sequence 21 61 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCACGTAAAGACGTTAGGTCA 

AGGTGTAGCCCATGAGGTGGCAAGAAATGGGCTACATTTTCTACCCCAGAAAACTACGAT 

AGCCCTTATGAAACTTAAGGGTCGTTGAAAANAAANGANTCANAANTNTANAAAAAAATN 

ANGNNCCTN 

Sequence 21 62 

NCCGGNCAGGTACGCGGGGCACAGCGGCTTNCTTGATCCTTGCCACCCGCGACTGAACAC 

CGACAGCAGCATGNCTCACCATGAAGTTGCTGATGGTCCTAATGCTNGNGGTCCCTTTNC 

CAACNACTGTTTACGCANGGCTTCTGNCATGCTCCCTTTATTTGGAGAATTGNGANTTTT 

T CAAGN ACAATTNCAATTNCACCAAGTTGTTCTAATNACCTGAAATATCCAACAGANACN 

NTTTTTNMNGMGTTTTCANTAANACCNNACMNTGGCCNACTTTACNAAAATTGGCCA 

TTATTATTGAAATNTGTAANNGNANATTGTTT 

Sequence 2163 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGACAACACAAA 

TTACTGCAACTCCAACAACTATGGCAATAACAATCACAGCAATGACCCAAACATCCAAAC 

TGTCMGTATTCCTTCCTCAGGTTTAGGATATCCTGGATAATTTGAGACTGGAGGCTTGT 

AAGTAGGCCTAGGACCTGAGGCACTGGACGCTGGAGATTTTGTAGTGGAAGAAGTCGACA 

CTTTAAGACACTTTGGAACTGGGGGATCCCAAGTACTTGTGGAAGGTGGACACCATCACA 

TCCAGGGCCTTCTCCAGAGGGCACGCCATGACAGCAGTCAGGATCAAACCAAGAAAGAAG 

AAGCGCTTAGGAGAGAGGGGTTGTAGAGAGGGGGAAAGAATTCCCGCGTACCTTGCCCCG 

GGCGGCCCGCTCTAGAACTAGGTGGGATTCCCCCNGGCTTGCCGGGMTTCCNATTTTCA 

AAGCTTTNTCGGATNACCCGGCGACCCTTNCAAGGGGGGG 

Sequence 2164 

ACTATA GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTA CI I I I I 1 1 I I 1 1 I I I M 

TTTTTTGCGAGATGGNGTCTTGCTCTGTTGCCCAAGCTGGAGNGNAGGGGCGCAATCTTG 

GCTCACTGCAACCTCTTCCTTCCAGGTTCACGCCATTCTCCTGCCTNAGCCTCCCAAGTA 

GCTGGGACTACAGGNGCCAGCCACCACGCCTGGCTAAI Mil IGTAI I 1 1 I AGTAGAGAC 

GGGATTTCACTGTGTTAGCCAGGATGGTCTCAAACTCCAGACCTGGNGATCCGCCCACCT 

GGGG CTCCCNAAGNGCTGGGATTACNGGCATGAGCCACTGGGCCCNGGCAAAAAAACAAC 

TTTTTAAAAAATGGACCCTGGCCCNGGGCGGGGCCGTTTTTAAAAACTAANGGGGGATNC 

CCCCCNGGCCTNGCANGGGAATTTCCMNNATCAMGCTTTTNTTTGGATAACCCGCCNN 

CCCCNNTNNGGGGGGGGGGG 

Sequence 2165 

AGGTACMGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTGTTTACT 

GAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGNGCACAGCTTGNAAAGCANTTAGAG 

TAAGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCANG 

AGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAANTNGCAATAAGGAAGAATGCCA 

TCCCCATGGTAGGCANCACNCAATANATTTNTGGAAACCCACCTNTTCCCGGGATCANGG 

CCTTTCCATTTGCTCACCGGATGCNTNACCGCCTGGGGCCGGGCCCGNCTTCTAAGAACN 

TAGNTGGGATCCCCCCGGGGCTTGGCMGGGAAMTTTCTAATAATTCAAAGCCTTTAAN 

TNGG AATA CCCCGTNCGGAACCCCTTCCGAGGGGGGGGGGGGGCCCCCGGGTTACCCCCA 

AGGCI I 1 1 I I I GGNNTTCC 

Sequence 2166 

CCGGGCAGGTACGCGGNGACGAAGTTCCGGTCCAGGTCTCTGACTTCGGGCTTGTTCGCT 
GGTGGCNGTCGGAGCCGAGCCGGACTGGTCAGGATGANCACGGACGTGCAGNTCGCCATC 
TTCGACAACATG 
Sequence 2167 

AG GTACT CCTGGGCAACAGCGAGACTCCGTNTCAAAAAAAAAAAAAAAAAAAGAAACCAT 
TTATTTTAAAAATGATTAGATTGCTATGCCTCAACTCATAGAAGATGAACCCTTCAAGAA 
AACGTGAAGNAGGAACCGGGNGGGCCANNAAATGAAAACAGGCAAGTANAGNTATTANTT 
NGGAAAAACATTTTNTCAACACCAAATGTTAAAAAGACTTTCCTTTTGNTAAAACCTGGG 




WO 01/070979 



PCTYUS01/09126 



TABLE 1 

361/467 



ATTATGAGNMGMCTTmCAATTGGGGTTANNTTCTAGGGATANACTCMGTATNm 
GCANCCACCTTMNAAMCTTGCCATTAGAAAAMCCTGNAAMGGTTTATTGTTTCCCA 
GNATAACTTTTCCGTTGTTTTACCCAAA I 1111 I NTTTAAG AACTTTG 
Sequence 2168 

AGGTACTCCAGGGCTTCATTCATATTTCCTTCCAATTTGTAGACGAACCCAAGGAGGCTC 

MGCTTT CCAGA TCTMTGCCTTTCTCCGMGTTTCCTTAAAACCAA7TTCTTCAAAGAA 

TTTGATACTTTTATCCCTTTGTTMTGATAGCCCTGTTCTATTTTTATAGCTT^ 

MTGGATAATTTGCATTGACTGTCAGATTTCTTTTGAAATTCCTGNAAACNCGACNCATN 

AGTTGG AAAATTGN TATGTCTCTGCATTGTTTTCTTCTACCACNTGGNTTTCATCGCNAT 

TAAACAAI I II I I I IGAAAMATTTCTCTTTCAAGCCTTTNTCTGTGGATTGGCCTGCCT 

TCTMTA TTACCMTTTC CTTGGCCANGGGTTCTTAGGATGTAGCCCACCCTCAAAAATG 

TGGGGCC I rTTTTTTI ICCCACCTGGCAAGAATTCAAANAATCGGAAAAATAATGGGGCC 

NTG 

Sequence 2169 

CGAGGTACATTTTNAAAGAGTTGTTTTTTGGCCGGNNTTCAGTGGCTCANGCCTGAAATN 

CCAGCACTTTGGGAGGCCGAGGTGGGCGGANCACGAGGGCTGGAGATNGAGACCATCCTG 

GCTAACAAAAGAAGAAANCCCGTCTCTACNAAAAATACAANAAAATTAGCCAGGCGNGGG 

NGGCTGNGCACNCTAGTAGGCCCCAGGCCTACTTTNGGCAGGCCTGAGTGCAGGNAGTAA 

TGGNCCGCGCANCCCTGNCAAGNGAAANTAAGGNTNTGCNGGTCGGAGNCCCAAAGNANT 

GCGGNCCCCTTGCCACNCCCAAGGCNCTNGGGCCAAACCAGAAGGCNAAAGGANCTCCCT 

ATTCCTCNAANAACNCAAGATATACCNCAANNGCGAGGGAGGTTTNNGGTTNCTTACCTC 

ANTTGGGCNNCAAATTNATAAGGGNTCAATANNANCNAGGNNTAACCAATAAANNACCCN 

CACAGGGGNNTCCTTTAACCAAACCCNTAAAANNCACCTGGGGTCAAGTCCNAAATAAAA 

A 

Sequence 2170 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACCTGCTGGGAGAAGTT 

TCTGAACAACNATATCTGGGATACACAGAAAAGTNTGGAAGANGAGAAAGAAANGCCTAA 

ATNGGAATGAGATCCAAGACTAAACGCNAGAGCTAGATTGAGCCGCATTTGAAANCTCCT 

TCCCNTTGGGGCNTTGGCAGAGGGGGAGAAAAGGCTTCAAAGGAACTNGGTGGCATNANC 

ACCCCCCTCCCCCAATGAGGACACCT 

Sequence 2171 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACTGAGCTCAAGA 

CACACTTGGTGCTTGCAGTGCTGATACCTGCTCTTTGCCGGCCCCCGCGTACCATCCCAT 

GTGGAATCTGTGAGTGTCCTCTTAAGTAGCGTGGGCTAGCCAATCTGCTCGTTNATGGGT 

GTATTTGTAAAGTCCGAATTCCATATGTAATANGGATGCAAGTCTAAGCGTTTCATGTGG 

ACATAAATGTATCTAMTMAACTTTCCCTAGCACTGTGGCTGACCTCACCCTTACTTTT 

ATACTTTAGTATGAAACTG ATGAGAACTTTGGTAGNGAGT A I I I I I I I I ATATATATACA 

TATATATGTATCTATCTATATATATATCTCMGCATCTTTCAGGTCTTTGTGTNGTGGNT 

TTTNTTAAAGCCCCTGTTGTAAAAAAAATTACTATTGTGGGATGGGCAGTCTCTCACATC 

ACAGATGTNGAAAAGNATMTTTTTATANTNNGTATTTNCAAANAAAATAAAATCTGGGG 

AAAGGTNNCATTCCTNTACTGGNGGNCCAAAAAATCAATNGGTTTTGTNTGNCCAAAAAA 

AAATATTAAAATTAAAAATATANTTNTNTTGAACTTAAAAAAAAA 

Sequence 2172 

AGGGTACATTTTTAMGAGTTGTTTTTTGGCCGGGCGCAGTGGCTCATGCNTGTAATCCC 

AGNACTTTGGGNGGCCGNGGNGGGCGGATCACGAGGTCTGGAGTTTGAGACCATCCTGGC 

TAACACAGTGAAATCCCGTCTNTACTAAAAATACAAAAAAATTTAGCCAGGCGTGGTGGG 

CTGNNAACCAGTAGTNNCAGNTACTTGGGAGGCTGAGGNANNGANAATGGGCCNTGAACC 

TGGAAGGAAGAGGTTGCANTGAGCCAAAAANTGCGCCCCTTGCAACTNCAGCCTTGGGCA 

ACAAGANCAAGACTCCCATCTCAAGAAAAAAAAAACAAATAAGNAAAAGTACCTTGCCCG 

GGCGGCCCGCTCTANAAACTAGNGGATCCGCCCGGGGCTGNAGGGAAANTCTATANCAAA 

GCTTANNGAATTCCGCCNACCTNGGAGGGGGG 
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Sequence 2173 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATCCCAGCCCCCCAGGTTCGC 

CTAATAGTCTTGGCTACTTCCCTACAGCTGCTAATCTTAGCGGTGTCCCTCCACAGCCTG 

GCACGGTGGTCAGAATGCAGGGCCTGGCCTACAATACTGGAGTTAAGGAAATTCTTAACT 

TCTTCCAAGGTTACCAGTATGCAACCGAGGATGGACTTATACACACAAATGACCAGGCCA 

GGACTCTACCCAMGMTGGGTTTGTATTTAAGGGCCCCAGCAGTTAGAACATCCTCAGA 

AMGAAGTGTTTGAAAGATGTATGGTGATCTTGAAACCTCCAGACACAAGAAAACTTCTA 

GCAAATTCAGGGGAAGTTTGTCTACACTCAGGCTGCAGTATTTTCAGCAAACTTGATTGG 

Sequence 2174 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTCGCGTCGGTTGGCGACTCCCGGACGTAGGT 

AGTTTGTTGGGCCGGGTTCTGAGGCC1TGCTTCTCTTTACTTTTCX)ACTCTAGGCCACGA 

TGCCGCAGTACGCGGGGGGGTGAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGAT 

GGCAGCCACCACGGGCTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACT 

CGAAGAAGGCCAGAAAGGAGTAGGAGATGGCACAGTTAGCTGGGGTCTAGAAGATGACGA 

AGACATGACACTTACAAGATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGA 

AAACCGAATATACAGCCTTAAAATAGAATGTGGGACCTAAATACGCAGAAGCACCCCCCT 

TTGTAAGATTTGTAACAAAAATTAATATGAATGGAGTAAATAG 

Sequence 2175 

CCGCGGTGGCGGCCGAGGTACCTTAAAGTCCTCCTGGCTCTGAAGCTTCATAAGATGCGT 
GAAGAAGGTGAAGAGG AAGGNGAAGGAGTCGAGCGGCCGCCCGGGCAGGTAC I I I I I II I 
MI MII MI rTTTTTTTGGGGTTTATCATATTTAGGTTATTTATTMTGAAMTATATG 
ACATTTTCAGGAATACAAATTTTGCACCCTGTGACCTCAAATGCGTGCAACMGATGTTT 
MTACAGAAAATMCACAAAAACTGTTGTTACAGGGGTTAGMTTTTTAACTTTA 
Sequence 2176 

CCGCGGTGGCGGCCGAGGTACTTCTTACAGTCTTCAGGAAATTCATTAAATCAGTGCCTC 

CAGTTCCTTTGGCTTCCAGTTTTGMGGGTCTTCAGAGGTCTTATTCTCCTTTGGCTGCT 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAA 

CCCGGTATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTC 

Sequence 2177 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGTAGAGTGGTGCTGCTTTAATTCATAAATC 

ACAAATAAAAGCCAATTAGCTCTATAACTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAGGAAAAAAAAAAAAAAAAAAAAAAAAAGGTCCGGNNCANTCGNG 

GNGGCNTAACCNCNCTNGGGGNGGNT^AGGGNTGGAAMCNCTNAMATNNNTGTTTTN 

AN NG G G NTA AAG N NTTG AA ATTTTG CC AG NTTTNT A AAGTC ATTAAAG N C AA AAGTTTTT 

TTAAGGGGTN I I I I I 1 1 I I I AAN AAAACCCATTAACNTTCCATTGGGCCTTTANAAANGG 

CCCCCGGGGGTTTTNGGNNAMNTTNTTTTTTTNNCCCCNNTTTTT^ 

GGGGCCCCCAANTTTTTNGNTGNCCCCNCCCNCCCCCANTTGGGGGGGNNGGGNGGGGAN 
ANCCCCCN I I Ml MM I I 
Sequence 21 78 

GGGCGAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I II I I I I I M M I I 
M I M I M M M 1 GCCMTTGTTATTTAGTTTTTATTTCATAATCATAAACTTAACTCTG 
CAATCCAGCTAGGCNTGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 
AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGNGGGGTGCTCTCCTGAGCT 
ACANAAGGA ATGATC TGGTGGCMNAMTTAACCANTGCTTTTATTATCNANGCACTTNN 
CACTATGCACTTTTTCTGAAATATTTTTGTAACAC I MM I GTATTTTTGCCATTTGAA 
Sequence 2179 

CCGGGCAGGTACAMTGATGAMCGGAAAGACAAAGGAAATTTTCCATTTTTGAAGAAAA 
AGTGTTCAGTGTTAATGGAGCCNGGGGAAACCACATGGACTTTGGTCAGCTCTATCAGTT 
CTTAAACACCAAAGGATGTGGGGATGTTTTCCAGATGTTCTTTGGTGTANAAGGACAATG 
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ACATCAAGAGTNGTTGAANGTATCTTGCCACTGNTGGCCTTTTGAI I II I INTCCCACTT 

TTTCTTGAMGATTAAGTMTTTTATTTTAGTTCCATTCTAGMTGTTGGGGAGTGNGGC 

ACAAGAAAAAATANTATANCTGAAATGCATCTGTTAAAAATGTNATGATTGNAAGCATAA 

CTGAGTTTCA 

Sequence 2180 

TAGGGCGA ATTGGA GCTCCCCGCGGTGGCGGCCGAGTAGCATGCCAGGAAAGGAAGAGAC 

TGGGATGGTTTTTTATCTGTCGCTTTCTTAAATCAAGGGCCGCCGGGCCGGAGATGGATG 

GAGGGACCGGGGATTTGGGAACTCGAAAACGAGCTGAGGGAAGGGAGCCTGTGGAAATAG 

ACTGGAGTCTGGGTAGTGTCGTTTCCTAGAGAATGGTCTCGAAGTAACTTCTCGGTAAAG 

TCTTCAC GGM TTTCCAGACCACGCTTGCCCACTGGGAGGCTTTTAGGACCCGAGACGTG 

TGCAGGCTTTTCCAGCCAAMTGAAGTTTAATCCCTTTGTGACTTCCGACCGAAGCAAGA 

ATCGCAAAAGGCATTTCAATGCACCTTCCCACATTCGAAGGAAGATTATGTCTTCCCCTC 

TT 

Sequence 2181 

I I I I I I I I I I I M INGTTACATAAATTAACCCATTTATTATAGGCCAGTGATGTCTCAAA 

GAGTAGAGGAGCGTCTACTGGTCTTTCMCTCCTTCAGTCTTCTGATGGCGGACTTTACC 

GNGACAGCGGAAGTGGTATTGNACCTGATTTTATTTCCAGTTTTGATCCGAATCCACTGG 

GGMTGGGACGATTTTGCTTTTGTTTCTTGGCCAGGAATCGCTTAATCCT 

Sequence 2182 

AGGTACTCATCGGCCAGCACGGAGATGCACAGGTTAAATGGTTTACCATCCTGAAAGGGC 
ATATTGNGGCATGTCACCTCATACTGCCAAGCCCCATTCACGCGGCTGTTCATGACCACC 
CAATGACCAAAGTACCTGCCCG 
Sequence 2183 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGCCCGAAGCGTT 

TACTTTGAAAAAATTAAGAGTGTTCAAAGNAGGCCCGAGCCGCCTGGATACCGCAGGTAG 

GAATMTGGMTMGGACCGCGGTTCTATTTTGNTGGTTTTCGGAACTGAGGCCATGATT 

AAGAGGGA 

Sequence 2184 

AGGTACGCGGGGACTCGCGTCGGTTGGCGACTCCCGGACGTAGGTAGTTTGTTGGGCCGG 

GTTCTGAGGCCTTGCTTCTCTTTACTTTTCCACTCTAGGCCACGATGCCGCAGTACGCGG 

GGGGGTGAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGATGGCAGCCACCACGGG 

CTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACTCGAAGAAGGCCAGAA 

AGGAGTAGGAGATGGCACAGTTAGCTGGGGTCTAGAAGATGACGAAGACATGACACTTAC 

AAGATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGAAAACCGAATATACAG 

CCTTAAAATAGAATGTGGACCTAAATACCCAGAAGCACCCCCCTTTGTAAGATTTGTAAC 

AAAAATTAATATGAATGGAGTAAATAGTTCTAATGGAGTGGTGGACCCAAGAGCCATATC 

AGTGCTAGCAAAAATG 

Sequence 21 85 

CCGCGGNGGCGGCCGCCCGGGCCGGTACGCGGGGAACTGAAAATCCACAAGACAGANTAN 

CCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTGCCTAATTTACAGC 

AACCATGAGGAAAGGCCACTTAAGGATGCAGCAAGAAGGAGCCATCTGCAATCCAGGAAG 

MATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACTTCTAGCCTCGAGA 

ACTTACAAATGGTGATGATCATNAGGTCAAGGATAGTCTGGAGCAATTGAGATGTCACTT 

TAC 

Sequence 2186 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I 1 1 I I I I II 1 1 I 1 1 I I I II I I 1 1 I 1 1 NGGGNA 

AAGAAACCTTTAATGAGGATTCAAGGTTAATAAGGAAGACNCAGAGGGCCAGCACTCAGC 

CCCAACCTNCTCACGTGTACCAAGAAAAATAAAGAAGAGGCTGCAGAATATGCTAAACTT 

TTGGCCAAGAGAATGAAGGAGGCTAAGGAGAAGCGCCAGGAACAAATTGCGAAGAGACNC 

ANACTTTCCTNTCTGCGAGCTTNTACTTNTAAGTCTGAATCCAGTCAGAAATMGATTTT 

TTGNGTAACAAATAAATAAGATCAGAGTCGGTAAAAAAAA 
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Sequence 2187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACTATTGTAGAGACTATCCGGGTTAGTT 

TGCAAGGGAAGCCAATGATGAGTAATTTGAAAGAAATTCACCTGGTGAGCAATGAGGACC 

CTACTGTTGCTGCCTTTAAAGCTGCTTCAGAATTCATCCTAGGGAAGAGTGAGCTGGGAC 

AAGAAACCACCCCTTCTTTCAATGCAATGGTCGTGAACAACCTGACCCTCCAGATTGTCC 

AGGGCCACATTGAATGGCAGACGGCAGATGTAATTGTTAATTCTGTAAACCCACATGATA 

TTACAGTTGGACCTGTGGCAAAGTCAATTCTACAACAAGCAGGAGTTGAAATGAAATCGG 

MTTTCTTGCCACAAAGGCTAMCAGTTTCAACGGTCCCAGTTGGTACCTCGGCCGCTCT 

AGAACTAG 

Sequence 21 88 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGGACAAA 

TCTCTCCACTTGGAAGGAGATCCCAATCCTTCTGCAGCCCCAACATCCACCTGCGCACCT 

AGGAAAATGCCCAAAAGGATTTCAATATCCAAACAACTGGCTTCAGTGAAAGCTCTGAGG 

MGTGCTCAGATCTGGAAAAAGCTATTGCCACCACTGCTCTGATTTTCAGAAATTCTTCT 

GACTCTGATGGTAAACTTGAAAAAGCTATTGCCAAAGATCTGCTGCAAACCCAATTTAGG 

MTTTCGCAGAGGGACAAGAAACCAAGCCAAAATACAGAGAGATCCTTTCTGAACTTGAT 

GAGCACACAGAAAATMGCTAGATTTTGAAGACTTCATGATCTTGCTCTTAAGCATCACT 

GTCATGTCAGATCTGCTACAAAATATACGGAATGTAAAAATTATGAAATGAACAGTTTTA 

AATATGCTGTATAAAATAATGGCAAAAGACAGTGTTATTAAAATGTTTCCATCTCAAAAA 

AAAAAAAAAAAAAAA 

Sequence 2189 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAAGTTGAAAGTCACA 

MACCATAGAAAAGTCTCAGTTTCCTACTGTAGAATTCTTGGCATTTCAGTAGTCACAGT 

TAMCTTGTTCTAGGTTTTTCAGATGCTCTTTTCCAGGAC 

TAAAAATTCTTTCTTTCAAC I II I I I GATTATAAGCCTTGCTTGCTTCGATTTGGGATTT 

AGGCATCGTTGTCCTTTATTTTCTTTCAGGGCMTAATCACTTCTATTTTGTCACAGTTG 

TTACTTGGGTACCTAAAACCCGAAGAACCTTCTGTAAGAAGTGTGGCAAGCATCAGCCTC 

ACAAAGTGACACAGTATAAGAAGGGCAAGGATTCTTTGTATGCCCAGGGAAGGAGGCGCT 

ATGATCGGAAGCAGAGTGGCTATGGTGGGCAGACAAAGCCAATTTTCCGGAAGAAGGCTA 

AAGACCACAAAGAAAGA 

Sequence 2190 

CCGCGGTGGCGGCCGAGGNAC I I I I I I I I I I I I I I I It i 1 1 I I I I M C M I I I 1 1 I I M I 

TGCNANCACANAGGTCAGCAACAAGTTTATTTTGCAGCTAGCAAGGNAACAGGGTAGGGC 

ATGGTTNCATGTTTAGGTCAACTTCCTTrGTCGNGGTTGATTGGTTTGTCTTTATGGGGG 

GGGGNGGGNNAGNNGAANTCCAATNTGAATCCCACAGAAACCAGGGGCTGTANAAACANA 

ACCTGANAAGAACGAGCACTCAACCNAGCTNACTNNGGTTCANNANAAAAAATNCCAANA 

AGGAAAGCCAA 

Sequence 2191 

CCGCGGTGGCGGCCGAGGTACAAAGATAAGTCATCTCAGTAAAAGGTCTATTATCTAACT 
TGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGC 
TTGAAAAGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGC 
CTGGCTGAAGGCCCAGGAGGGANGGANTTTTNANACCCCCTTTAAANNTNNCTTTCGCNT 
TAAGAAG AGNCCCCCCGNG GGNGNCNCC Cl I 1 1 I 1 1 I I GGAANCNCCCTTTTCCCCGGGA 
ATNAGGG I 1 1 I I I I I I I I I I I NAAAAANTCCCCCCCNNGGGGGGGNCCNNCNNTNTTTTT 
I I I I I I I! I INGAANCCCCCCCCCCCCCNGGTTNNNNCNNTCCTTCTTTNACAAAAAAAA 
AAAAAATTNCCNCNCCCCCAAAAAAAAANNNN I I I I I I I I NTCCCCNNCCCCCNNNTTTT 
TANAAAAAAAAAAAAAAN 1 1 1 I I I il I I I I I I I I I 1 1 I 1 I N 
Sequence 2192 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCCGAGA 

GAAGCAGTAGTCMTAMGAGAGTGCCGTATTTCGCAGATTGGAGCTGAGCTGTGGCTGC 

CAGAAGATAGCGAACGAATGGAAACTGAAAGTGGAAATCAGGAAAAGGTAATGGAAGAAG 
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AAAGCACTGAAAAGAAAAAAGAAGTTGAAAAAAAGAAACGGTCACGAGTAAACAGGTGCT 

TGCAGATATTGCTAAGCAAGTGGACTTCTGGTTTGGGGATGCAAATCTTCACAAGGATAG 

ATTTCTTCGAGAACAGATAGAAAAATCTAGAGATGGATATGTTGATATATCACTACTTGT 

GTCTTTTAACAAAATGAAAAAATTGACTACTGATGGGAAGTTAATTGCCAGAGCATTGAG 

MGTTCAGCTGTTGTAGAGCTTGATTTGGAAGGCACCAGAATCCGGAGGAAAAAACCTCT 

GGGGGAAAGACCAAAGGA 

Sequence 2193 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTCCATGGGTCCACAA 

AATCCTCTTCAGCTTCTGTGGCATCTGGGCCATGATTACTGGTAGGTGCTGGGTTCCCTG 

GAGGACAGTCAGCCTTGTAATCCTCCCCCGCGGCAGCTTTGTAGCTCAI 1 1 1 IATGACAC 

CAACATCTTTACTGCAGAATCAATTTCATCCCAGGTTGTGGAAGACACTGCAGAGGTGGC 

CACTGG TCACGCTGGGCAGTTGAGCTGCTGAGAGTCCCCCGCGTACTATCTTCACTCTTT 

TTTTTTCAGMGCCAATGTTCTCTAMTCTGCAGCTTCATTCCACAGCTTTACAGAATCA 

TAATCTCTTGMTATATTTCCMTGTTATTAAAAAATAAAAAATCATACAAGATATATTT 

AGCACATTAAAACTTAAGAGGTTACAGTATAACTGTCCAGACCTCCAGGTACCTGCCCGG 

GC 

Sequence 2194 

GTGGCGGCCGCCCGCCAGGTACATTTTTAAAGAGTTG I I I I I NGGCCGNGCGCAGTGGCT 

CATGCCTGTAATCCCAGCACTTNGGGAGGCCGAGGTGGGCGGATCACGAGGTCTGGANTT 

TGAGACCATNCTGNCTAACACAGTGAATCCCGTCTCTACTAAAAATACAAAAAATTANCC 

AGGCGTGGTGGCTGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGANGCAGGANAATGGC 

GTGAACCTGGAAGGAAGAGGTTGCAGTGAGCCAAGATTGCGCCCCTGCACTCCAGCCTGG 

GCAACANAGCANGACTCCATCTNANANAAAAAAAAAAAAAANCAANGAGNNGTTTTCTNA 

TGTTCATTATAGGGCATTACAGTTACATNGNCCGAAGGTCTTACAACTAATCACTGGGTA 

GCMTANATGCTTCNGGCCCACATGATGCTGATTAGTTCNCANTTTTCATNCAGTTGACA 

ATATTAACCCCCATTNCTCCCTCCCTGCCCAAGGNTCATAAANTNGTGACTGCCTAACAA 

CCAAMTTNGGAAGGCCCANTCTTNATTTTCANTCAGACTTCTTGGAACCTGAAAGATTA 

AACNTTTTGGCTMCCCTGGAMTATCTTTTTATCTCACTTATAGCNTTNAAGGCCATTG 

NATNMMCNTGATTTCTTGGAMGGGCCATACCCATAAANTCMGGGGGGTTTTTCCTG 

AAAAGTAAANGNTTCCAAATNAANNCCAAACANTTTCAACCCCAAAAAAAAAAAAAAA 

Sequence 2195 

AGGTACACTGAACACCAAGGCTCTCACTCTTGAGTCTAGGGCACCATATATATAAAGGGA 

GTTCGGAACTTGTCAAGGCAATCCTATTGTTCGTAGTCAAATCACACTGAAATCCGGAGG 

CCTGGTGTGAGAACCTACNAGCGGACACCGGGCATTTMTATTTTTTGCACACCCACACA 

GCCAGGGCCAAAGTGGTCAAGGCACTCTCCTAACACAGACAGGATCTTCTGAGTTGCAAT 

TCTTTCTGMGGMCATTTTTCACTTGAMTTCCATCAGAAMTTTCCTGAGATCTTGTG 

AGCGCTGAGGTTTCTGGTTTCATCTAGATCCAAAAACATGTCCAAATCACATCCTAACTT 

CCCAAAAGTGTTGACTGAGGAGCCAAAGGGTCTGACTATGCAGTCTGGAAAATACCCCGG 

CGGCCATGTCTTCAATAAGAGAACAGGTGAGATATCGGAGCTTAAGTGTTCTCCTCTGTT 

AGCTGGMCTCCTTCAAGAAGAGTGTTCAAGCTTGATCCGTCTATACTTTCTGCATAACA 

AAGTMTTCAAAAAGCTGCTTGTTTGAACCGTGGCAACTGANTACTTGACCCGTACCTTG 

CCCCGGGCGGGCCCTNTAAGAACTAGTGGATCCCCCGGC 

Sequence 2196 

CGCCCGGGCAGGTACAGTGGTCCTTTTCAGAGTTGGACTTCTAGACTCACCTGTTCTCAC 

TCCCTGTTTTAATTCAACCCAGCCATGCAATGCCAAATAATAGAATTGCTCCCTACCAGC 

TGAACAGGGAGGAGTCTGTGCAGTCTGACACTTGTTGTTGAACATGGCTAAATACAATGG 

GTATCGCTGAGACTAAGTTGTAGAAATTAACAAATGTGCTGCTTGGTTAAAATGGCTACA 

CTCATCTGACTCATTCTTTATTCTATTTTAGTTGGTTTGTATCTTGCCTAAGGTGCGTAG 

TCCAACTCTTGGTATTACCCTCCTAATAGTCATACTAGTAGTCATACTCCCTGGTGTAGT 

GTATTCTCTAAAAGCTTTAAATGTCTGCATGCAGCCAGCCATCAAATAGTGAATGGTCTC 

TCTTTGGCTGGAATTACCAAAACTCAGAGAAAATGTGTCATNAGGAGGAACATCATAACC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

366/467 



CCATGGANGGMAAAAGCCCCCCAAATGGGTGGGAACTGGATAATAAGCACTNATGCTTT 
AAGAATTGGGCACACTTCTCACCTAAGGTGAGCGCATTGNGNCCAGGGGGTGCTTAAATG 
CTTACATACCTCCAACTGGAAATGGNTAAGGGAAGAAGATTGATNCCAATTTNAAAAAAA 
AATTTAAAAN CCC ANTTTN AAAA AA AA AA 
Sequence 2197 

CGCCCGGCAGGTACGCGGGGGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGC 

TCTGGGCAGCAGCAAGAATTCCTCTGCCCCCCATCCTACCATTCACTGTCTTGCCGGCAG 

CCAGCTGAGAGCAATGGGAAATGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCAGAAGCTG 

GGTTGGTTTATCCAGGAATACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAG 

ATGGTGAACACCATCTGTGACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGTGCAGGGA 

GTGGCCATAGGTGGCTCCTATGGACGGAAAACAGTCTTAAGAGGCAACTCCGATGGTACC 
T 

Sequence 21 98 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CAAAGTATTATGACTTTAAMACCCCATTATTGAAMGTACACAGGAGGCAAAGTGTTTC 

ACATTATAGACTTCACTTCCAACTCCTTGGAATGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGC7TAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCAAGCTGGTTTTCTAGGCCTGT 

TAGCTGGAAGCATGGTGAGCACCAATTTCTGGGACGCTCAAGGCCGTGTCGGGGCTTCAA 

GTCATCTTCACCACACCANGTACCTTNGGNCGNTTCTANNAACTANTGGGATNCCCCCGG 

GCTGTAGGGAMTTTNANTTTNMGCCTTATTCGATTACCCGTTGANCCTCTGGGGGGGG 

GGNC 

Sequence 21 99 

CCGCGGTGGCGGCCCGAGGTACAAGATNGNCATCTCAGTAAAAGGTCTATTATCTAACTN 

GCCAAACTTGTTNACTGAGAGCCCTANGGAACTAAAACTGCCATAATGCCGTGCACAGCT 

TGAAMGCAATTAGAGTAAGCANGATTAGTTTTTCCTCCCTTNCAGNCTNAGNAGGCCTG 

GCTGAAGGCCCANGANGGAAGGAANTNTANNANCCANCANTAAAAATAGCNATATGCAAT 

NNNAAGAATGCCATCCCATGGAGCACACCA 

Sequence 2200 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACMTTTTTAGAAT 

AGTTGATGTCT7TAAGATGGATATTTTACAATTTCAGTAAAAAAAATACAACATGAAGCT 

GCTGCTGTCACAGATCACTGTAGTAAAAAGATATAAATGCAATACCATGTCGTAGAACAA 

TATATATATCCTCTGATATTTTACAAACTTTGTACTGTGTCCAACAGCTGAAGGAATTTG 

AGGGGAAGACTTTAGTGTCAGTCACCAAAGAAGGCCTGGAACTTCCAGAGGATGAAGAAG 

AGAAAAAGAAGCAGGAAGAGAAAAAAACAAAGTTTGAGAACCTCTGCAAAATCATGAAAG 

ACATATTGGAGAAAAAAGTTGA 

Sequence 2201 

GCTCCACCGTTTTGGCNNCCGAGGCCCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTC 

CAGGATCAGAAGAGMGCCATTTTGACATCCTCGATAAAACTGGGGATNNGCTGNANTTC 

TGTCTGTTCACGMGTGGTTGAMAACAATTTCGAGATCCAGAAGTCTTGATGGGTTCAC 

CATCCAGGTGTTCACAAAAAATCAGAGAATCTCTTTCGAGGTGGTGGCCGCCTTCAACGC 

TCTGAGTAAGCATTGCTGGGTGTCAGGAGAGAAAAGCCAAAGAAGCGGGTGCCAGACAGC 

TCTGTGCAACCTCTAGGCCATGAGTGGGATAGATACCACTGCTGCTTTAAAAAATGGGAG 

ACCATAGACCCTCAGGAGAAAAGAATCCCTTCTACCCTGGACTCGCTCTCTTCTCTGGAA 

CTAACTTCTCCCCCATACCCTGATTGTCTTTGGAGAAAATGTTCTGGATTCTAGAATCTA 

AGGCAGAGCCTTTTAAGCCATACTGTACCTGCCCG 

Sequence 2202 

AGGTACAGACAGGGTTTCTTCATGTTGGTCAGGCTGGACTCGAACTCCTGGTCTCAACTG 
CCTCAGCCTCCCAAAGTGCTGGATTATAGGCATGAGATACCGTGCCTGGCCTCCATCACT 
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GATTCTATGTATGCACATGACTCCTTTTTTACTTTTACTTAGGAGGCTTAAAACACAATT 

ATTCATTCAATATMTTATAMGCAGAGTCTTTGTTAATTTCTGAACTTGGTAAAAACAT 

TTTTTATATTTTATCTATAGCTCATTAATTTAGGTATCAAAAATGTGTTTTAGT^ 

AGCCCAAGCTGCCMTTGTTTGATATCATCATCCTCTTCTTCTGCCCTCTGGGAAGATGC 

TGCTCGGGATCGAC 

Sequence 2203 

CCGGGCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGAATGGAAAGTC 

AAATTTCCTGAATTGCTATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGACTTACT 

GAAGAATGGAGAGAGAATTGAAAAAGTGGAGCATTCAGACTTGTCTTTCAGCAAGGACTG 

GTCTTTCTATCTCTTGTACACTGTATGCATCATCAGTCCAAAGTTTTTCAGAGTTCTATG 

TGACTGGTTGAGGAAACACTGCTTGAATTTCCCCATAGCCTCTGCAGCGGCATCACTGTC 

TATGMCTCTTGAAGMGCTCTTTGTATTCAGGTATAGATATGTCGGAATTGATGGTCTT 

TTCAACCATGTCCTCCAGGAGTTTGCAGCCAGAATCTGCATAGCAGTGCAGGAGGAGGGC 

CGCCAGCATGAGGACCATCAGCAGCTTCATGGCGAGGCGGCTGCTGTCTGTGTTCAGTCG 

TGTGTGGCAGGGATCAAGGAAGTTGCTGTCCCCGCGTACCTN 

Sequence 2204 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTTGNCAGGATTCAA 

GCCGAATGGCTTGCCTNANAGGGNCAATGGCGTTCTGAGATGGTGGCCA 

Sequence 2205 

ACACTACTATAGGGTGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTGGTTTAT 

CAATGGGACGTTCCAGCAATCCACACAAGAGCTCTTTATCCCCAACATCACTGTGAATAA 

TAGCGGATCCTATATGTGCCAAGCCCATAACTCAGCCACTGGCCTCAATAGGACCACAGT 

CACGATGATCACAGTCTCTGGAAGTGCTCCTGTCCTCTCAGCTGTGGCCACCGTCGGCAT 

CACGATTGGAGTGCTGGCCAGGGTGGCTCTGATATAGCAGCCCTGGTGTATTTTCGATAT 

TTCAGGAAGACTGGCAGATTGGACCAGACCCTGAATTCTTCTAGCTCCTNCAATCCCATT 

TTATCCCATGGMCNCTTAAAACMNGGGTTGGTTTTGGTTCCTGAAGCCC I MM 1 GCT 

GGGGAATGGGGCACCTTAATGGAAAATTTAAAGGGGAAAANCCCTCAGGCCTGNNGGGGG 

NGGGGCCCCTTCACAGCCTTTTNCTTAACTNAGAAANCGGGGAAAATTGNAAACCATTGG 

GGGAGAAAATGGACNACTTCNCACTNTTGGACNGGTTTTTTTCCAAAANTGTNAAAACAA 

GAATTCTTTATTMTGGAAANGGGGTTTTACCCCCCTTTTAAATT 

Sequence 2206 

CCACTATAGGTTTATGGGAACTCCCCGCGGNGGCCTCACTTTTNACCTTCTGGAANCCGC 

CCACCACTTTCAAGCTCACTATTGAATCCACGCCGTTCAATGTCACAGAGGGGAAGGAGG 

TGCTTCTACTTGTCCACMTCTGCCCCAGCATCTTTTTGGCTACAGCTGGTACAGCTTTC 

TTCGTCCTCCATGCTAAGAGATGTAAAAGCTTAAGGGTCAAACAATACCAATTGTATAGG 

CTTCAAAAACCATCTAAGTTAGGGCATTCTIvfrAGTTTTAGCTAAGATACACCTGGAACAC 

TGACAAGTCATCACTTACATAGMTMTGTGAAGTAAATTTTTTGAMAATMATTTTAG 

TGGAACAATCCTGAAGGATAACACCAGAAGAATAGCAGGTTACCAGTAAGGGGTCAGCCA 

ATTTGTTCCAGTCACTTTTGAATCCATGTTCTATAATCTAAAMTTATTCTCTTTCCCTA 

AGCTGAGAGCTTCCTATCATGTCAGTATCTATGNTATGAAG 

Sequence 2207 

GGCGCCGAAAGCGAAGAAGGAAGCTTCAGCCCCTCCAAAAGCTGAAGCCAAAGCGAAGGC 

TTTAAAGGCCAAGAAGGCAGNGGNGAAAGGGGNCCACAGCCACAAAAAGAAGAAGANCCG 

CACGNCACCCACCNTCCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnCTNGGGCGGAAAACAANGGCAAAAGCNGGNNCC 

CGGGGGGGAAAANGGGNAACCCGNNNCACAAAANCCACNACAACAAACCGAGCCCGGGAG 

CAAAAAAGNGGGAAAAGCCCCGGGGGGGGGCCCAAAGGAGGGGAGGCCAACCCCANAATA 

AAANGGGGGGGGNGCCCCACNGGCCCCCNTTCCAAACCGGGGNAAAAC 

Sequence 2208 

GGCGAATTGNNGCTCNTTCGCGGTGGCGGCCGAGGTACTGAGAGAGGTGGTGGACATACT 
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TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCT 

GCAGGCCAGGATGCCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTT 

CAGTTTCGGTGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGT 

TCATCCTGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTC 

CTTCATT 

Sequence 2209 

AMTCCCGCGGCGGCCGNTCTAGMTTAGGGGANCNCNTTTNACCNCAGGNNTTTTGAAC 

Cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGNAAAGAGGGNNCCNGGGNGAAAA 

NGGGGAACCGCGCACAAANCCACACAACAAACCGAGCCGGGAGCAAAAAGGGGAAAGCCC 

CGGGGGGCCNAAAGNGGGGNGCNAACCCAAAANAAAANGNGGNGGGGCNNAAAGGGCCCG 

GNTTNCCAGGGCNGGGGAAAAACCCGGNCGGGGCCAACNNGGNAAAAAAAAGGAAAANGG 

GGNCCAACCCCCCCCNAAAAAAAGGGCCCGGNNNNGGGGGAAAAAGGGGGGGNGGCCNNN 

NACCGGGNNANCCCCGGGGNAAAAAAAAANAAANNNNNNGGGGGCCCCGGGGGGGGGNNN 

NGNNNGGGGGGGGGANNGGGGGGGGNNNAANGNNNAAANCCNAAAANGGGGGGGGGGAAA 

AAGGGGGGGNNNCCNCAAAAAAAAANANGGGGGGGAAAACCCCGGGGGNAAAAAAAAAAG 

GGGGGGGAAAAAGGGCCCCCCNAAANGGGGGGGGGAACCCCNAAAAAAGGGGGCNNNNNN 

GGGGGGGGGNNNNAAAAAAGGGGNCNNCCCCCCCGGGGGGGGG 

Sequence 2210 

AANCCOCGNGGGGGCGGCCGNGGNNCCCCCGACGGNNNCAACCCCGNNGTANAAAACCCC 

CAGGAACAGAAGAGAAGGGGNCGGGNCAAGNNCGAAANCGNGGGGGAAAAGCAGAAGACC 

CGACNGNCCACGAAGNGGCGCGAAAAACAAAANCGAGAACCAGAAGGCCCGGGAGGGGAA 

CACCANCCAGGGGNCCACAAAAAAACAGAGAAGCCCNNGCGAGGGGCNGGCCGCCNGCAA 

CGCGCNGAGCNNAAAGGAAAACCCCAGCCCCCGGAACCANCGAGAGCNCAAAAGANCCNG 

GGAAAANACAAAGGCCAGNCCNGGGGAGAANNGCAGNCCGCANCACNGAACCCCAGCAGA 

NAGGGNGNNNGGACAACCGGGCCCGGAAAAANA 

Sequence 221 1 

TACTATAGGGC GMTTGGAGCTCNCCGCGGNGGCGGCCGCCCGGGCAGGGNACTTrrTTT 
I M i 1 1 i I I i I IGGNGG I 11 I ) M 1 1 1 1 11 I ) J I 1 II 1 1 1 1 1 I I 1 1 1 1 1 I 1 1 I) 1 1 1 I I) 

nTl i M I I M M I I I I I I I I 1 i I I I i I TNAAAAAAANAGCCAATTTTTNNAAAN 

NNAAANNTNCNNGGNTAAANGGNNNNNGNNAAAAATTTGGGGGGNAAMGGGNTNNTTTN 

ANCCCACTGNGNGGGGCANCCCTTANAANNNGGGGGAANNCCCCNGGNAAAAAANGGGGC 

NGGGGNCCCNNNTNNGNANNNTTCCCNGGGGGGGGGCCCCANNAAAANNNNNNNCCCCGG 

GGNGGGGGGNNCCCCANTT7NNCCNAAAAANCTANGTCCCCGGGNCCNNNAAAAAAAAGG 
, GGGC 

Sequence 2212 

CCGGGCAGGTACCCGGGTGGTTAAACTTCGCAAAATGCCTAGATATTATCCTACTGAAGA 

TGTGCCTCGAAAGCTGTTGAGCCACGGCAAAAAACCCTTCAGTCAGCACCGTGATGAAAA 

CTTGCGGAGCCAGCATTTACCCCCGGGACCATTCTGATCATCCTCACTTGGACGCCACAA 

GGGGCAAGGAGGGTGGTT1TCCTGAAGCAGCTTGGCTAGTTGGNCTTATTTAACTNTGTN 

GACCTGG ACCCT NTTGGTTCCCTCAATTCNGAGGTTCCCTCTTACCGAAGNAACACACCC 

ANGAMAATTTTGGTCCATTTGCNCACCTTTTCAACCCAAANAATTCGGATNTTCAAGGC 

AAATTGTTANAAATAAATCCCCCAAMAACMTCNTTAACNTGGAATGGCCTTTACTTTT 

CA 

Sequence 2213 

CCGC GGTGGCGGC CGCCCGGGCAGGTAC I M I I I I J M I I M I M I M 1 1 I NACCGGAAA 
TTGTCTTTTATTTTTCTTTGCTGGGGTTC 

GGATATAGAGTCATCAMAGAAAAANCATTCTTCTTCCTCTCTGTTTCCCTAAAAANAAA 
ACAGTCACAGGAGACTTTCCTTTTTTCTTAATTGTTCTCATCTCATCTTTMTTATTGCT 
GCTGCTGTTATGTGTTTATGTTTCCC I rTTTTTI I AAGTTTTGAGGGTAGATCGNGTCAC 
TGGGCGTMGANACTGAGAAATCAGGTAGCTTCACTGGCTGGTAGGGGACTTTTGCTTTT 
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GAAACGTCACAGGCTGTCTTCCTTTGGATGTAATGGGACGTCCTGATGACCCA 
Sequence 2214 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I I I I I 1 1 I I I 
I I I I 1 1 1 I I I I I I I n I I NATTAGGGTTGAAAAGGTAAGTCCTATATTANATACATAGGG 
GCTGGTAGTCTTATCCTCTNTATAAATACNCAC I I I I IGNCACTGCCCCTTCCCTGCCAT 
GCAGAGTCCCCATGGCTGAATCTCCCTATGTCTCCAAGNGNGCGTCACTTGCTGTCCTNT 
GGGCTGGGTCCCATGNGCTGNGCCCACAGTCTT 
Sequence 2215 

CCGGGCAGGTACAATACACTAGAAACCAACATAATGTATTTNTTTAAAACCTGTGTGAAA 

AAATAAATGTTCCACCAGTAGGGATAGGGGAAAAGTAACCNAAAGAGAGAANNGGAGAAA 

GGGMTGCTGGTTTATCTTTTGTANGATTGTAATTCGMTGNGAGAMTTTTGCAGTATT 

TTAGCCACTTATTAAGGGAA I 1 1 I I I I I II I GTAAAANTGGAAGACCTGANACNTCTGTT 

CAAAATGGCTTTTCANTGGAACCCTTGNGTNTTGGAGACCGGTTAGNGAAAAGGCANACA 

AAAACCGTGGGTAACCCTGGTGGACCTAAAGGGGCNCTGNGTGCCAAGGGNACCTTGGGG 

NAAATTGGTCCATTNGATTANTAGGAATGGGTGGGGGGTTTTTTTCCCCCCCCTTTTTNA 

GNAAANTGGTNTTGNGGATTATTTTAAAGNTGGANTTTTAAAAAACACCNI I I I I U 1 1 A 

AACTTCCCGGAAA 

Sequence 2216 

C CGGGC AGGTAC I I I I I I I I I I I II I I I I I I I I H I II I I ATACATTAAATATTTATTAA 

TGTTTTGGGAATGTTGTGTATCAGTAATTATACCTAGTAAAAAGAAAGATCCACATCCAA 

ATTTTCCAGAAAACCGGGTGGCCAGGTNNGGGNGGAAGMGAGCATTTTAAAAACCAAAA 

CCAAAACAAGGGAAAAGCTAAAAAGGAGTTTNTTCCAAATANTCCTGAATTCATTAGGAG 

GGTTGGAATTNACATTTTTTCAGGNCTNTATTCCCCCCAAAGTTGAAAI I I I I I AAGGGG 

AC CTTGGA NAAAGNCTTTATTACCCCCATNTGAAAAACAATGCCTGCCAATGGATTTACC 

CTNTTTTTCACGGTTCTTCCAGGGTTCCCMNTNACCGNTTGTCCTGGTNMTCCCTTTT 

TGGGCCCI I I I J I I M I 1 1 

Sequence 2217 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGAGCTGTGTGCATGCT 

GAGCAACACCACAGCCATTGCTGAGGCCTGGGCTCGCCTGGACCACAAGTTTGACCTGAT 

GTATGCCAAGGGTGCCTTTGTTCACTGGTACGGGAAGGATTCCTGCCAGGGTGATTCTGG 

GGGTCCGCTGGTATGTGGAGACCACCTCCGAGGCCTTGTGTCATGGGGTAACATCCCCTG 

TGGATCAAAGGAGAAGCCAGGAGTCTACACCAACGTCTGCAGATACACGAACTGGATCCA 

AAAAACCATTCAGGCCAAGTGACCCTGACATGTGACATCTACCTCCCGACCTACCACCCC 

ACTGGCTGGTTCCAGGAACGTCTCTCACCTAGACCTTGCCTCCCCTCCTCTCCTGCCCAG 

CTCTGACCCTGATGCTTAATAAACGCAGCNACGTGGAGGGTCCTGATTCTCCCTTGGGT 

Sequence 2218 

CCGCGGTGGCGGCCGCCCGGGCAAGGTAC I I II I 1 1 I I Ml I M 1 1 I I I I GGGATGGAGT 

CTTGCTCTGTTGCCCAGGCTGGAGTGCAGGGGCGCAATCTTGGCTCACTGCAACCTCTTC 

CTTCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGTGCC 

AGCCACCACGCCTGGCTAA I I I I I IGTAI I I I I AGTAGAGACGGGATTTCACTGTGTTAG 

CCAGGATGGTCTCAAACTCCAGACCTCNTGATCCGNCCACCTTGGCCTCCAAAAGNGCTG 

GGGATTACAN GGCA TGAGCCANTGGGCCCCGGCCAAAAMNMANTTTTTTAAAAANGGC 

NCCNNGCCCCTTTTAAAAATTGGGGNGNCCCCNNGGGGGNGGGGGAAAATNNAAAAAAAA 

TTTNTNNCCCCCCCCC 

Sequence 2219 

GGCGGCCGAGGTCGGCCCACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCACGC 

ATGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGA 

AGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAN 

GTACCTGCCCG 

Sequence 2220 

CGACTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTT 
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TTNTNNTTTNTTN I I I 1 I I 1 1 I I I I CCCGGCCATATTCCATTNTTTNANAACTTGAATTG 
CNCTGCCATCCACAAAGGCTNGCNCGGTTTTATCCACATTAAAATCAAACTGTNGGAGCN 
CCAGGACTATTTCATTATTGCTTTTGTNGGGAACMCTGATCTMCTGCATAMTCTTTT 
TCCTTGACATTCACATGAGTNTTGGNGTTTCANCCCTTTTTGCTTTTAAGNGGAANAGGC 
NMTTTCNGGAAAANANGGGGGANCCTTTTNCCTTTTTACCCCAAAANTCCCNAANGGNA 
AAANCCTTN 1 1 I H I II I I AMNTTTTTTTCCCCCNNNTNCCCMAAAMNNGNTTTTTT 
rTTTTTTTI ITTTNTTGTG NGGGGGGNGGGGNGNGGGNANAAMANANNCNNNTTTTTTN 
NNNNNNTN I Fl I I I I I I I ANAAAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGNGNNNTT 
TTTNNNNNANAAAATNNATNNNNN 1 1 i I H 1 I I 1 i I 11 1 i I I 
Sequence 2221 

CACTACTTAG GGCG ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGT 

TTTGGCACACTTTTCCTGACAAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGT 

TAACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGAC 

TGGAMTACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAG 

GAGATGGAGAGCAATTCTTTAAAGTTAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATAC 

AAGCCTNTTAAAATGGGACGCATTTGCCTCGCGCCTACTGGGGTGTCTGCAGCTCAGCTT 

GGTGCCCCACACAGGACACCG 

Sequence 2222 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCCGGAAGTCTCTACTGAGGAAAGC 

TATGAGGATACTCTGTTCGTMGCTCCCGGTGAATTTTGTTCCACAGACTCGGAAGAAAG 

GTTGGATAAGAGTTCACTGGAGATTGACAAGTACCTGGAGGAGCCCCTCACCCAAGTGCG 

CATCTTAACGGCATTGGTGGAAGCTGGGGTCAGAAAAGAGAAATGACCATTTGGAGGGGC 

GGGGCCTCCTAGMGAACCTTCTTAGACMTGGGGGGAGGATGGGACTTTGTTTTTTCCA 

AGAATAAACTTCAACTCCTGT 

Sequence 2223 

CGAGGTACTTCCCACCACAGTGTTGAAAGGGAGAGCAAAGTCTTATGGATAAACCCTCCT 

TTCTTTTGGGGACACATGGCTCTCACTTGAGAAGCTCACCTGTGCTGAATGTCCACATGG 

TC ACTA AACATGTTATCCTTAAACCCCCGATGCCTGAGTTGAAAGGGCTCTCTCTTATTA 

GGTTTTCATGGGAACATGAGGCAGCAMTCTATTGCTAAGACTTTACCAGGCTCAAATCA 

TCTGAGGCTGATAGATATTTGACTTGGTAAGACTTAAGTAAGGCTCTGGCTCCCAGGGGC 

ATMGCAACAGTTTCTTGMTGTGCCATCTGAGMGGGAGACCCAGGTTATGMGTTTTC 

CTTTGMCACATTGGTCTTTTCTCAAAGTTCCTGCCTTGCTAGACTGTTAGCTCTTTGAG 

GACAGGGACTATGTCTTATCAATCACTATTATTTTCCTGTTACCTAGCATGGGACAAGTA 

CCACCTGCTGGGGATCACTCTCTAGGATCACCCCTCCATCCACAATTTCATCCACAAGCC 

AAGAACAAGCCCCTTCATGTTCTCCAGCAGTGCTCGCTTTTCTAC 

Sequence 2224 

TCACTNTAGGGCG AATTGGAGCTCCCCGNGGTGGCGGCCCGCCCGGGCAGGNACTTTNTT 
> I M I 1 1 I I 1 1 I I I INC I I I I I I M I I I I I I I II I I I I I I GGCC I MM I AATATCTTTA 
TTTGACAAAATAAAAGTCAGCAACCTATCTCG ATTTCCAA \ MM I GTGGTGTTTGAAAT 
TCCAATTGANACCTAAAGCATAGCTCTGGCCTTGGAGAGATTTCCAGGAGAGTCANAGCC 
CANAAGGGAGCAGGATCCAGGAGGCCCTCATNTCCCAGCACTCCAGCTGAGCCAGCCGGG 
TTATGGMCATCACTGAGCAATTAAMTATTATCAACAGACAAMAMGTTTATTGAATA 
CAAAACTCAAAGGCATCAACAGTCCTGGGCCCAAGAGATCCATGGCAGGA 
Sequence 2225 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGGCTCTTTCAGGTCCCCCT 

TCCTCTGCAGACTCAGCCTCCAGGCTGTTTAGCTCTCCCAGATAGCTGCCCCTGCCACGC 

AGGCCAGGAGTCTTCACACTCAGGCACCAGGCCTGGTCCAGGAGGAGCTGTGGCACAGTC 

GTGGTTCAAGTGTCCACATGCACCTGTTAGTCCCTGAGAGGTGGTGGGAATGGCTGCTTC 

ATTCCTCGAGGATGCCCGGGCCCCACCTGGGCTTGTCTTTCTGTTTAGAGGGAAGTGTAA 

CATATCTGCCATGAGGAACATAAATTCATGTAAAGCCATTTTCTCTTAAACAAAACAAAA 

CTTTCTAAGTACAATTTGTTACAAATAACGCAGACTTCAAAAACAAAAAAATNACAACCC 
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AAACAAACCAAAATTTAAATGATCAGAATTGGCAAGCACAAAGAAAACGCCCTNTCCTGA 
CTTNTATTGTGGGCAGTTCTGMCGCCCCCAGAAMTTTGTGCCAMGAGTTTTAGAAAA 
NTAAATATTCCAATAAAAGTAMCACTATACNCCACCAAAACAGGC 
Sequence 2226 

GGCGAAT TNGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I 1 1 I I I 1 1 1 1 » I I I 1 
I I I I 1 1 I ICTMCTTTATATATMTACTTTATTTTATACAATTCAGTATATCATGTTAAC 
ATATTTCACT mAATTT TATGATATGTGTGACATATTTTTM 

ta ctatcatm tttttttaaagtttgtatctttcattmtaaggAggttccttattaat 
gai 1 1 i i i i 1cctgtagcttntgagaacacatttttagatncccggcttntagttatacc 
tgaagctccacagtgtanacatgttttggccaaccttgttttatcggngtatgaaatttg 
ngctaataggagggatccaattgtcattacattagaaaactaaatgggaaa 

Sequence 2227 

CCGGGCAGGTACGCGGGGCCCTTCTAGAGGCAGGCAGAGGGAAGAGAAAGGGTCTGTTGT 

TTTTCTCTCCTGTTTCTCGCTCCCTCTCTGCTGATCACAAAGCTGCTGACCGGGTCAGAA 

AGTCCTGATGGAAATCCACCAGCGCTGGGCAGGCCCCTCCTCCTCCAGGGAGCTTGTCCT 

TGCCTAATTTTTCTTCGTCCTGATGAGAACAAAAAAGAGAGAGAGAAGAAAAGAAAAACC 

ACAAACTTCCTTTGAAAACCAGCTTGTAGTCAGGGCCCGGAGCGCATGCCATAGACTCGG 

CGACTCAGGAATCCTGAAGACTCTCTGAGCGACCTGGAGCACCTTGGGCTGTGTCCCTGC 

CTGCCTTCACCCTCCTCCAGTGCCCCCAGTACTMGGMTCTTTCTGTTTTGGGGTT 

Sequence 2228 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACACAAGACATCATCTTGAAGGAAGG 

ATGGCTTTGGCCAGACCAAGACCAAGACTTGGAGACCTGATTGAGATTTCTCGCTTTGGC 

TATGCACACTGGGCCATCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGT 

GAAATTGCTGGAGCTGGTGCGGCCAGTGTCCTGTCTGCCCTGACCAACAAAGCCATAGTG 

AAGAAGGAACTGCTGTCTGTGGTGGCTGGGGGAGACAACTACAGGGTCAATAACAAGCAC 

GATGACAGATACACACCACTGCCTTCCAACAAAATCGTCAAGCGGGCAGAGGAGTTGGTG 

GGGCAGGGAGTTGCCTTATTC 

Sequence 2229 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTANAAGATATT 
GATCCTAGTCAATTAGGCATTGTAGACTGTNATGACCACTTAATAAAAAATTNTGGACCT 
GANGCTCACGAGCATCCAG 
Sequence 2230 

NTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGNGGTACCTGCAGNNCTNNTACACCT 

ACCT CTCTNTGGGCTTNTATTTCGACCGNGATGATGTGGCTCTGGAAGGCGTGAGCCACT 

TTTTCCGCGAATTGNNCGAGGAGAAGCCGCAGAGGGCTACGAGCAGTNTCCTGAAGATGC 

AAAACCAGCAGTGGC 

Sequence 2231 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTTCAGAGTTTC 

ATAAGACACAGTCTCMCAGGAGTTTTTTTAATAAATGCTCTATCTCTAGTTCCAGAAAC 

TGAATTCCAAGAAGCTACACTGAGGATAATTCAGCTCTGATATTGTGATTACTGTGATGT 

TCTTTCATTCATACAGTAAGTATCTGCCCAATACGTAACTACCGAGATCTATTGCTTCCT 

ACATAATTAGACAAGCTCCTTACACATATGGGCCCATGCCCGGCACATAGCAGGCACTTA 

ACAAGGGGTTGCTTMCTACNGGMGGMTAATATMTTNGCCTTTCCNTTTTTAACTGN 

TTTACCCT I I I I CTATACNTTGNTATATTTTGGAAAACNACATGCTTGCAAAACTAAAAA 

TCTMCATGCATTACTAACTTTATAAMGATCCTTCAGTATTTTTCAAAAAGGGAAAAAA 

TNATTAAAACCAATCCCCCAAAT 

Sequence 2232 

TACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCAGAGCTTAAGAGACCCTTCAG 
CTTCTCGGTGTAGGTCACCAGTCATGTCACGCTCACAACCCTTTCGCCTCCCTCTAGGTC 
AGGCACCGCATCCAAGTTAAGCAACCGACGCCGGCTCCTTGGTGTCCCAACAAGGTCTAG 
TAGMAAGCAACTTTATAATTCGTTTCACAGATTTGATTAAGAGCCAACTGAGTAGTGGG 
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CCTTGTGCTAGGGCTCCGAGGGCCCAAGGACCCCATGGAACCAGATTTAGTCGCTGACCT 

CTAGGAGCTCACAGGTGAGTGACTCCCCCCGCGTACCTCGGCCGCTCTAGAACTAGGTGG 

GATCCCCCGGGGCTGCAGGAATTCCGATATCAAGCTTATCGAATACCCGTCGACCTTCGA 

GGGGGGGGCCCCGGTACCCAAGCTTTT 

Sequence 2233 

CCGGGCAGGTACCGNTGTGTCCGGGTGGGTGGTCAGAATGCCGNGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGNGTTTCCCAAGCTATGTGAGTTCANATAACCTCANAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 
ACT 

Sequence 2234 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCA 

CGATGTCTTGAGTTTCATTCACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGA 

MTCCCAAAACACACAATGAGGCTGGCTCAGGTTTGACTCTATCTTGGAAAAAAATAGGA 

AMCTTCATTTATGGMTAGTTTTGAATAACCGTGGATATCACAGGTCCATTGACCTGAG 

CATTTCCATTTTTGGAAACGGGTAG 

Sequence 2235 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGTCATCACACGTGACAGGATGTCA 

ATGACMTCCTCCAGAGTTTACTGCCATGACGTTTTATGGTGAAGTTCCTGAGAACAGGG 

TAGACATCATAGTAGCTAATCTAACTGCGACCGATAAGGATCAACCCCATACACCAGCCT 

GGAACGCAGTGTACCT 

Sequence 2236 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATGAACCT 

GTGCTTTGCAGCAGGCCAACTTGTAAAGCTGTTCTCAACCCACTTTGTCAGGTTGATTAT 

CGAGCAAAACTTTGGGCCTGTMTTTTCTGTTTTCAAAGAAATCAGTTTCCTCCAGCTTA 

TGGAGGCATATCTGAGGTGMTCAACCTGCCGMTTGATGCCCCAGTTTTCTACAATTGA 

GTACGCGGNGACAGCGGrJTTCCTTGATCCTTGCCACCCGCG 

Sequence 2237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGGTACNCNGGGCCCTGTGCTG 

TCTGCACCGAGGAGAGCGGCCTGNCGGAAGNGGGCCACCATATCTGGAAACTACAGTCTA 

TG NTTN GAAGCGCANAAGGGAATAAACANTTAANGACTCCCCCNGGGACCTGNAGGATGG 

NCTTTTCCATGGGGGCCGGAGCNGCAGCTTACAATGNAAAATNACANACTGGNGCTIvnTG 

GAGMAACTATANTTGGCATMTTCCCATTMCCACNATGACTTCAAAATTTTAA 

Sequence 2238 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCTGA 
AATGGCTTCTGCAGAGATGGACCTATGCCGGGGACAGCACACCTGGACTCAAAAGGCAGG 
G ATTGG ATAGGAAGAGGAATAAAATATAAAAATCAGAGAACTGCTGAAATTCTGTGACCC 
Cl U 1 1 AGTTAAAAAAAAAAAAAAAANGAAATAAAAGTACCT 
Sequence 2239 

GGTACATTTTTAMGAGGTTGTTTTTTGGCCGGGCGCAGTGGCTCATGCCTGTAATCCCA 

GCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCTGGAGTTTGAGACCATCCTGGCT 

AACACAGTGAAATCCCGTCTCTACTAAAAATACAAAAAATTAGCCAGGCGTGGTGGCTGG 

CACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCTGGAAGGA 

AGAGGTTGCAGTGAGCCAAGATTGCGCCCCTGCACTCCAGCCTGGGCAACAGAGCAAGAC 

TCCATCCCAAAAAMAAAANTTNAAAAAAAANNNNGGTACCTTGCCCGGGGCGGCCGCTC 

TANMACTMGGNGGGGATCCCCCCGGGCTTGNAAGGGMTTTCNATATTCAAAGCTTTA 

TTCGAATNCCGTTCGACCTTCGAGGGGGGGGGG 

Sequence 2240 

CGGTGGCGGCCGCCCGGGCAGGTACNTGGGGAAGGCGCNNCGTGCGTCAGCTTGACAGCA 

GAAGCAGGAGGAGAGCTGGCGGGAAGACATGCACCCCTTGAAGACCCAGAGAGAGGCCGT 

CTGTCTACCGCGTAGCAGTTACATCAGACTGAGACACTTCCTGTTTACAGGAGACTATAA 
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AATTCCTGCCCCGTGCTCATTTGGGGCTGACGCCATTTTAGACCTCAGCCCATCTGCACC 

CAGGCGCTCACTGAMCAGTGTGTTGCTCCACACCGCCTTGTTTTGCTTGNTGGCGCGCT 

CTCAGG GTTCC GACCAATCCAAGAGCCTTGCAGAMGCATTAACGTGCTTTTNTCTTTGG 

CAAGAGTTTTTCTTTGCTCTGATCTTGGAGACCATCCCTCTGCCTAGGGGGAAAACATAN 

GGGAATACAGA 

Sequence 2241 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCATGTTAAGA 

GATAGCAATGCAACCGCAGCCAAGATGTCTTTAGATGTAATGATTGAACTCTACAGAAGG 

AACATCTGGAATGATGCAAAAACTGTCAATGTTATCACAACTGCATGTTTCTCTAAGGTC 

ACCMGATATTAGTTGCCGCTTTGACATTCTTTCTTGGGAAAGATGAAGATGAAAAACAG 

GACAGTGACTCCGAATCTGAGGATGATGGACCAACAGCAAGAGACCTGCTAGTACCCTCT 

CTTCCAGCACCCAGGCCAGTATTGAGATCGATTCTCTCTATGAAGGAATCGACTTCTATA 

CCTCCATTACCCGTGCCCGATTTGAAGAACTGAATGCTGACCTGTTTCCGTGGCACCCTG 

GACCCAGTAGA 

Sequence 2242 

TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGGCGTGGGCC 

TGCTTTCCCGCCAGTGCCCACTCAAAGTTTCTTACGGCATAGGCCGATGAGGAGCATGAT 

CAGGAAGGCCGGGTGATTGTGGCTGMTTTGACTCGTTTGTGCTGGTAACAGCATATGTA 

CCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTCCAGGATCAGAAGAGAAGCCAACGTG 

ACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCTGTTCACCGAAGTGGTTGAAA 

AACAATTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCATCAGGTGTTCACAAAAA 

Sequence 2243 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCTTCACCTG 
CTTTGGAGAAATCCTGAATGTGTCCCCAGAAATGAGACGAGAAGTCTTCTCCTGATCAAA 
GGTCTTTTCTTGATTGAAGGGTTCGTGGTTTCACAGGCTTCAAGGAAAGAAGCCATGGAC 
CTCAGTGGTGAGTGTTACAGNTCCATTAGAGAAACATGCAGACCCCCCCGCGTACCTN 
Sequence 2244 

GCCGCCGGGCAGGTACTTTCCCTTCCCTAGAAAAGCNNGACTNGNCGCTAACGGTGAAGG 
ACCANGGCAGGCGTCCCTGAGTGGTCTGACACCTTTGAAACGTGAGTGAATAATCAGAGA 
GGTGTCCCTGNAATGATAAACACCAAGGAAAGGCTGCCTTCCCAGTCTGNGACCAAGCGC 
CAGAGTTTTGGGTCCACGGATAAAACGTGTCTCGTTTGTCTCTACCAGAAAATGAANGGA 

ATTTGAAATTAAGAGAAGGGAGAGATTAAAAAAAAAAAAAGAAAAAAAAAAAGTACCTCG 
GCCG 

Sequence 2245 

AGGTACGCTTCGACCCACGCGTCCGAGMAAGACTATTMTTATTTGTGCCTTTAAAGGT 

TAATTATTTCAACCTTAAAAGAATCTTATAAGAACCCATTAATAAAAAAGAATGCAAATA 

TATTGCCCAAAAMATACACAAATTATTGTCTTTCATATAGGGGAGGAAATAAATATGTT 

TCATCTGAATTGTCTGAAGTAAGTCATGAATGGAGTCAGGGTAAAAAAAAAAAAAAAAAA 

AGTGCGGCCACCTGCCCG 

Sequence 2246 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGCGAATACTGCATACTGTTTAAGGTAGTA 

TATMGTTTATGAGAGMGTGGAGAGCTTTCTTCCTTGAAAAGTCGGTATTTGTTGAGAT 

ACCATTTGCCTCACAGAGAGGTGTTCCCCACTCCCATCCCCATTGCCAGATAATAAATAT 

TTTGAGAAAAGTGACCTAAAACAGCTGAAATCTTAGGTGCATCTGTCTGCAGACCTCCTT 

MGCAGGC TGTAT CTTACAATTCCCTTACTGCACTGGGTMGTGTTMCTTAGTTTTTGT 

TGTTTGCTCTTTTGCTTTAAATATTCTCCAAATTACACCT 

Sequence 2247 

AGGTGTACAAGCTTCNACCCACGCGTCCGAAAGATTCTTGGNTGAGCATGGTGGCTCATG 
TCTGTCATCTCANCAATTTGGGAGGCAAGGGCAGGAGGATAGTTTGAGCCCATGAGTTTG 
AGACCAGCCTGGACAACATAGTGAGACCCCATCTCAGCAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAGTGCGGCCGGCCGCAC I I I I I I I I II I I I I I I I I I GCTCAATANAAGTNTGG 
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AATMTTCCAGGTAATTTAAAGCATNTTTTTNAATTGGTGTAAGCTGCTCCATGAAGTCA 
NCTNGCTCCTCTMTTGGGATGGTTNTTNATNACACGGCATGTTNTAAGAGTTTGATGAC 
AGGANCATNAGTGTNNGGNCCCAGNACCCCCTTCTGNGCGGAC7TNTNGGCATCCTCCCT 
TCAMNCTCMTNCTTTTNANGGGGCCCTGAGGAMGTNNTTTATTAAAAGGGNTNTTTG 
CAAAI MUG 

Sequence 2248 

AGGTCA C 1 1 1 M M 1 1 M ITTTTTAAMTMTTCTTTTMTTATTGATGCTTTGAATAAN 

MGTCCATTTTACTAMTTTAGTATAAATTATCGGACGCGTGGGTCGAAGCTTGACCTGC 

CCG 

Sequence 2249 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACTGACACGTTTCTCTTGACA 

CTGAGATGGTCGCAAACAAAACACCGTTCTTGCCTGGATGAAGCAAGAGTTCACATAAAA 

GAGCTTTATAAAATGTCTATGAAGGAGAATTGATAATATCAGAAGAGCTCCAGCACTTCA 

ATTGMTATAATCCTCTATTATTCTTTTCTTGATTTMTTTCTGTAGCTCCCGAAAACTT . 

ACTT CAATCTTGTTGAGCTCAGAATAAACAGATATCTAAAGAAAAGAGTTTCAGAGTAAC 

TTTTACCATAATGTCTACATAGAAACAAATCTTCCAAGAGTAAGCTATTTAAGGACAAAT 

TTTMGAATMTTGTMTTCAGGTGATACTTGGGTTTTTGTACAAGCTTCGACCCACGCG 

TCCGACCTGCCCG 

Sequence 2250 

CCGGGCAGGTCMGCTTCGACCCACGCGTCCGATMTTTATACTAAATTTAGTAAAATGG 
ACTTCTTATTCAMGCATCAATAATTAAAAGAATTATTTTAAAAAAAAAAAAAAAAAAAA 
AAAGTGACCT 
Sequence 2251 

TGAGCTAACTCACATTAATTGCCGTTGCGCTCACNTGCCCGCTTTCCAGACGGGGNAAAC 

CTGTCGTGCCAGCCTGCATTAAATGGAATCGGCTNAACGCGCGGGGGTAGGAGGCGAGTT 

ANGCCGTATTGGGCCGCTCTTCCGNTTTCCTCCGCTCACTGGACTTCGCTNGCCGCTTCG 

GGTCCGTNTCGGGCTGCGGCCGAGGCGGTTATCAGCTTNACTCAAAAGGCCGGGTAATAC 

GGGTTATCCACCAGAAAATCAAGGGGGATAAACGGCAGGGAAAAGAAACATGTGGAGGCA 

AAAAGGCCAGCAAAAAGGCCCAGGGAACCCGTAAAAAAGGCCCCGTTGCCTGGGCGGTTT 

TTTNCATTAGGCTTCCCGCCCCCCCTGGACGAAGCATTCACAAAAAATCCGACNGCTTCA 

AAGTTNAGAAGGTGGGCGAAAACCCCGAACAAGGGACTAATTAAAGGANNCCCAGGGCGT 

TTTCCCC 

Sequence 2252 

AGGTCTTCGACCCACGCGTCCGATTTTACTCTGTGTATTCGTGGTGGTGGGTTATAGTNT 
ACCAAGTTGTCCAAAATGGATAATTCATGTGAGATAGGCAAGANCAAGNGTTGTATCCCA 
ATACCATTTGGTGAAAAAGTTGAACTTCANAAAGGATAAGTANTTTGCCAAGATCACCTG 
NCAACCTGCATAGTCAAATTTGAGATGGAACTCAGCTACTCCAATATCTGTATTTGCAG 
Sequence 2253 

CCGGGCAGG I I I I I 1 1 II I I I I I I I I GCAACATTGCCCTTTGATGTCCCCATGAGGGCCA 

GGCCCAGGCAGMCCCATCCCATTTTATCCTTAAACTCANAAGGAAATTTGTCTAAATAT 

TAAAGGATTAATATGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTG 

TCTCTCTTAGAGTTCAATGMCACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGG 

AATATCTAAATATTGTTGATTTAGATMCCACCCTAGATTTCTCACCACCCTAGAACATT 

TAGTGGGGAGACATTCTTTTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGNG 

CAAATGGTATTTNTCCGGACGCGTGGGTCGAAGCTTGACCTnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATTGCGCGCTTTGGCCG 

TAAATCATGGTCATAGCnnnnnnnnnnnnnnnnnnnnnnnnTCCGCTCACAAAATTTNCA 

NCACAACATACCGAGCCCGGGAGCCATAAAAGTGTAAAAG 
Sequence 2254 

AGGTACATGTGCCANNTTTGTTATATCTTTANCTATATNCNNNNCTACTTGCCCCTGATC 
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ATCTATGNGTAGACMTMTTAGTMGGTCAMTTTTTCACAAACTCCTCCTTCAGCTGC 

TAGCMGTMTCMGAGCTAGTCTATTTTGATAGATAGCATTTCTCATCAGAGTCTCTTG 

CCAGGCAAGAACAGTCAAGGCTTGAGTGGTNTNNTTANTGNCTGCATTCTAAAACTGCTT 

GGCNCCATATGACANNGCCGANCATGTAGATGGGGGTTCGATATCCCCATGAGACATCTT 

GTGCCCAAGTGGCAGGTCCATAGTArTGTMGATTTTTTCAGGGGGCCATTCATCGTCTT 

TCCMTCAC CTATG GCTATGCTTCATTTTTCACAG'GAAGCACAGACTGGGMGCCCAGAA 

GTTTACCTGTTTTTATGGNCCCCTAAAAMAAAANANGGTTCNATGGGTGCCAAATTTAC 

ACAGNTTACCTGGCCCACTTGATTNAGGGMNNTTTTNANNCATTAAGNNCCGAANCCCN 

NNGGGNATCAAAAAGNTTTGGCCCCTGGCCCGGNNNGGGCCCGGCCCNCC 

Sequence 2255 

ACCTGCCCGGGTGGCCGCTCGAGGCCGCCCGGGCAGGTTTTATTTAACATTCAAACTTCA 

TTMGACATGTGCMTATGGCAATTTTACTGGGGATTAMCCCTACCTAGGATTGCTTGC 

TGGGGCTTAGCMCAGGGTCCAGTTCACACTTAGCACTMTTAAATACTTTATTGAATAA 

ATACAATACCAAACAAAATGCATTCAAATGCCGGACGCGTGGGTCGACTCAAGCTAGGTC 

GGACGCGTACCT 

Sequence 2256 

ACI I I n I 1 1 I 1 1 I I I I I I I I I I I AAAAATTCAAAAAATTAGTTTATTAGCTTAATATAA 

TTAGGTCAATGG MTCC TGTTTTGATCTCAATACTTCCCATATTGCAATATATAAATGTG 

ACAAATTCAGCTGTTTTGCGGCATAGATAAGTGTCTAAGCTGGGCAGTTAGTCTACCCGT 

TTATAGTTCATGTTCTTCATGGCTTTGCAGCATTTGTCACTTTCTATGATGTGTTCAAAG 

ACCA GAAAAGGCCACACTTGACCTGTCAGCTGGTCCTTGAACAGCTGTAGGI 1 I i I I 1 1 I 

TTTTGAGACAGAGTCTCCCTNTGTTGNCCAGGCTGGAGTGCANTANCGCAACCTCGACTN 

ACTGNAACCTCTGCCTCCAGGTTCAAGNGATTCTNCTCCTCANACTCCTGAGTAGCTGGG 

ATTACAGGCCCATGCCACCATGCCCGGCTGATTTTTATATT^T^AGTAGAGATGGGG^^^T 

CACCATGTTGGCCAGGCTGGTNTTGAAAACCCTCGGCCGNTTCTANAAACTAGTGGGATC 

CCCCCGGGCTTGCMGGGAAATTTCCNATTNTTCAAAAGCNTTAATCGGATACCCCGTCC 

AACCNTNNGANGGGGGGGGGGCCCCGGGANCCCCCAGC I I I I IGGTT 

Sequence 2257 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGACCTCAGATTAAAAATAT 

TGCTGAGGTCAGACGCCACMTTTTCATGACTTTCTTCAGMGTAGCACATTTTCGTGAC 

TTCCGCTGTCCTCTGAAAAACAAAGTTATTTGGAACATGTTCATGCAAAAGTGATTCTGA 

CCMGTCTAMTCGAGCTTTTCTACTGACATGAAACTGNTGGAAAACTGATCTTATTTTA 

TMGMAGTGTTTTCCCCTNAGGGGGGGNNGTGTGNTTTCNGAACANCCCNCNNATTATT 

TTTTCCCCCGGNNCCNTATAAAATATTGGGGTCCCCCCCCNNNCGGGGGGGNNGGGGNAT 

TTTAAAAATTNTTTTTTTNNTCCCCCCCCCNCTGGGGGG 

Sequence 2258 

GGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGACCCACGCGTCC 

GAGCAGAACTTGGCAGCAACAGAGGAAGGGCCCCTGGAGCCGGCTGTCGTGGATGCCTTT 

MTCAAGCCTGGCATTTGGTTGCTCACGAATGTCCCAACTACTTCCGCTAGGCCCATCAT 

GGCTCAGGCTGCCCMGGCTTTTCTGTCACCTNTTTTGTTCTCTCACACTGACCAAGTCT 

TACCTGCCCG 

Sequence 2259 

CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTA 
GTAATAGGAATTAACTGACCCCTTTNGGATGGGGGAGAGCATCAGGCTGGGGTCAGGTAA 
GTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCC 
Sequence 2260 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTAGTTATCAATTCATGGGCTATGGCCA 

CTGGTTTGCTGGATGGTCAGGGACTTGGAAGGAACATGACTGGAAAATTGGTGACAAAAC 

GGTCTGTGGAAGAGGTATATACACAGATCTTTCTGAATGGGTGAAAAATGTGAGGATATT 

TGTGTCTCATATGAATGCCCCCAAAGAATGACTTCAGCAGAAGAGGATTTTAATAACCAA 

GTAGATAGGATGACCTGTTCAGCCCCTTTCTCCAACCATGCAGGTAACTGCCCAATGAGC 
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TCATGAATAAAGTGGCCATGGTGGCAGGGGTTGTTCGGACGCCGTGGTT 
Sequence 2261 

TGGCGAATTGGACTCCCCGCTGTGGCGGGGNGNCGGNCGNCNGAGTAAGACTTCGTCTCA 

AAAAAAACAAAAACAAAAACAAAAACAAAGAAGTTAGCCTGGTTGGGCATGGTGGCTCAT 

ATCTGTAATCCTAGCACTTTGGGAGGCCCACTGGGGATCATTTGAGGCCMGAAATTTTG 

AGACCGAAGCCTGGGCAACGTTAGTTGAACCCTCATCTNTTACCCAAAAAAAAAAAAAAA 

AAAAAAAAAGTTGCCGNGCCCGCTTCTTANTAAACTAAGTTGGATTCCCCCCCGGGGCCT 

GGCAGGGGAAAATTTCGMTATTCAAAGGCTTTATTCTGAATACCCCGTTCNGACCCTTC 

GNAGGGGGGGGG 

Sequence 2262 

AGGTGGACTTGATTCATAACCAACCAACCCAGGCTGTCATGCTAAAAAAAAGGGCAGAAT 
CTTTGCTGCTGMCTACTNGCATAGGGTTGACTACCATTGNTCCTTTAGT^^TAGTTTGG 
GCTAGCAAAAAGGTGNGTCCTTTGCCATGTAAATAAAAGCCNTNTCNNGGTAATNAAAAN 
GGTNTNTNI I I I IAACTT 
Sequence 2263 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCACAGGAAT 

GAGCCACCACACCCAGCCMATTGGGCACAAATTTAAAATTTGACTTTTATTAATGATAT 

GGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCAAATCATT 

TAAAMTATCTAMCTATMTTTNCTGTAGTTCACATGAATTGGATATTCTTGAAGCGGA 

CGCGTG 

Sequence 2264 

GGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTTGGGGGGAGAGACACAG 
ATTTTTACACTMTATATGGACCTAGCTTGAGGCMTTTTAATCCCCTGCACTAGGCAGG 
TAATMTAMGGTTGAGTTTTCCAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCC 
CGGGCGGCCGGCCGCCCGGGCAGGTNTCGGGCCCGAGAAGACCTCCTTTTTC 
Sequence 2265 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGTCCGAGCTAACGAATGCTNGACT 

AACTAGATNCCAAAGCTTGCTCTGTGAAAATTCCCGNATAACCNNTGAAGTGGGCGACAC 

CNTAACCCTGCACACCTTACTCCTGGTNTCAGAGAGCCCAGTNTGAACATAAACTGNGTA 

GAGGTGTTAGACTCANCCTACCCTAGTAANGCCCAACCTCCGAGACCAACCTTAAACATC 

AGTAGACTGCGAGCTGTATGTGGATAGGAGCAGTTTNGNCAACCCCTGCNAAGTGACTCT 

GAAAAAGAC 

Sequence 2266 

CCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCTAGCAAAGCTGTTTCCACTGAATGCA 
TCTAAGCANNGATGGANCTATGCCAAAACCACCACAGGNGTTTCACTTNAATGATACCNC 
GAAACAAGG 
Sequence 2267 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATATGCTACAGGGAAGA 

AAACGTTCCAAAAACAAGAAAAAGTTGGAAAAGGCAATGAAAGTGCTCAAGAAACAGAAA 

AAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I I I I I I 1 1 I I I I I I I I I I I CAATN 

TTATTTATCA AAATM ATTTATTAAAAGTTTTCAMGACCNCTTTAAAGTGTANCTGGCT 

TNMNACAGATTTTTGGNACTCNTAACGGACACTGCAGTTTTNAACNCCATAGCACTCAT 

TNTATTTCACACATCATTTTTAAC 

Sequence 2268 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGGTATGTAATCCAGG 
AAGTGACCAGCCTGATGCGTGTTATGACTCACTGNAAGCCTCCCATGATTAAGGAC 
Sequence 2269 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCGACCCAC 
GC GTCCGCCGGAGAnnnnn nnnnnnnnnnnnnnnnnnnTGATCTAGATCACGAGCAGCCG 
CACI I I I I I I I I I I I I I I I I II I I I I I | | I GGACGAATTCAGGAGTCCTTTATTAGCCGG 
CAGCCGANAGACAGCTAGCGCTCAAAATTNTNTTGGCCCCGAAGAAGGAGCTAGATTTTC 
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TTTTATACTTTGGTTTAMCAGGGGAGAGGGGGAGTNTAGTTGAAACAATNTTACAGAAG 
TAAAGTAGGCAAAAAGTTAAAAGGATAAACGGTTACAGGAAAGTAAACAGTTCCAGGNGC 
AGAGGCTTTAAGTNTATCCTAAGGNGATGGACCCCGGGCTTTGGGC 
Sequence 2270 

MTTG GAGCTCCCCGCGGTGGCGGNCGAGGTCAAGCTTCGACCCACGCGTCCGGTTTGTT 

TTTTTCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGNCATTATAATTGCTTACT 

CTCATCGCTTAGCGCCCCAGGTGGGATGTGTTTCCAAMCACATTTITGTNTTTATAAGG 

AMTGTAGTTAGGATTAATTTTATTGTCCTMTTAGMCTCACATTTTGGTTAAATCCTC 

A ATTTC ATT AAAAA A AAAA AAA AAAAAA AANG G N 

Sequence 2271 

CNCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTC 

GACCCACGCGTCCGGTGATATGTCACAATGCCGTGTAGCCAGAGCCTAGACAAAAGTTAC 

AGCACCTGGGAGATCAGTGCAGAGATATGTCACAATGTCCCCAGTAGGCAGAGACCAGGC 

AACAGTTGGATCACCTCGGGATCAGTGCAGAGACATGTCTCAATCCCCCTGTGGGCACAG 

CCTAGACAAGAGTTAAATCACCTCGGTTAACAGTGCAGAGATATGTCAATATTCCCCTGT 

AGGCCGAGCCTACACAAGTGTTACATCACTAAGGTGATCAACGCATAGATATGTCAAAAT 

ATTGCCGTGAAAGCAGAGTCTAGACAAGAGTTACATCACCTGGGCGATCAGTGCAGAGGT 

ATGTGACAAGGCCCCTTTAAGCAGAGCCTAGACAATAGTTACATCACCTGAGTGATCAGT 

GCAGAGGTCTGTCACAATGCCCCTTTAGGCAAGAGCTTAAACACCTCGGC 

Sequence 2272 

AGGTACTTGACCCCACAGCCATCTGGGATGAGCCGCTTTTCAGCCACCATGTCTTCAAAT 

TCATCAGCATNGAACNNGGTGAAGCCCCACTTNTTTGAGATGNTGGATCTTCTGGCCGGC 

CAAGGAAACTTGAACTTGGCCCTGCNGCAGGGCCTCMTCACATTGCTCCTTTGTTTCTT 

GCAAGCTTTTGGGTTGCCGGGATTGGGGACATNGATAAACTTGGGCCAAATTGTGAAACC 

CTGGGCCACAAGTGCCCTGGGGGNGCCTTTCCAAAAAGGGCCACCCTCCGCCATGCNCTG 

TTNTGGGAAGNCCTTGTTCAAGCCCCCCAANCNACAGGGGAACCAAACCAATTCNTTNGT 

TTGGAATGGCCGGGAATTGAACCGTGGNAAAGGGGGGGTTGGGGAAGCCCCGCCACCCCC 

NGGNATAATGGGGAMGCCMTCTTTTGCCAACAMCTTTTTTTAACCAATGTTACCCTT 

GCCCGGGGCCGGGTCCGCTTCTAAGAAACCTAGGTGGGGGATCCCCCCCGGGGGCCTGGC 

GAGGGGAAATTTTCGMTTATTCNAAAGCTTTAATTCTGAATACCCGTCCGGACCCTANG 

AGGGGG 

Sequence 2273 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTAGCTTGAGTCGACCCACGC 
GTCCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAA 
AAAAAAAAAAAAAAAAAAAGGGGNGGCCGGCCGCCCGGGCGGGTCCCNTCANANANAAGG 
GGGNGGGNGCTAATCCAGTACCAAACNTTC 
Sequence 2274 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTACTGT 

AGAGGAAAATATATGACAATTATCACTGTCACTGCCTGACATACAAAAATGGAACAGAAC 

AGTGGGGTAATGAAGGGAGGGGAGGGAAAGGGAAGAGCAGGAGAGAGAGGAGTTGGAGGA 

GAGGGGAAACAAAGGGAAAAGGGTCTATTAAACAGAGGCCTAGAGAAGCTAAAATTTGGA 

AATGGCAAATCTGAGAAGAGCCTGAATAAAAAGTGGGGGTGAGGCCATGCACAGTGGCTC 

ACGCCTATAATCTCAGCACTTTGGGAGGCCGAGGCAGGTGGATCACCTGAGATCAGGAGT 

TCGAGACCAGCCTGGCCAACATGGCAAAACACCT 

Sequence 2275 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 
TCCGAAAAATGGGAGACAATTTCACATGGACTTTGGAAAATATTTTTTTCCTTTGCATTC 
ATCTCTCAAACTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCA 
TTCAACCTTACCAAAAA AAAAA AAAAA AAGACCTGCCCG 
Sequence 2276 

CCGCGGTGGCGGCCGCCCGGGCAGGTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTAG 
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AGGGATGMCCACCATGCCCAGGTMTTTTTTMTTTTTAGTAAAGGTCGGGTCTCACTA 

TCTTTCCCTGCTCAGCTGGTCTGGAACTCCTGGGTTCAAGTGATCTTCCCACCTCAGCCT 

CCCAAAGTGCTGGGATTAAAGAAGTAAACTACCACACTCAGCCACACATAGGTAATTTAA 

MTATTTCCATAGTCACAATTAAAMCACATATAMTAGGTAAMTTMTMCATTTTAT 

TTMCCCAATATATTAAAATATTTCCACTTTAAAAAGAGACCTCGGCCGCTCTAGAACTA 

GTGGGATCCCCCGGGCTGCAGGGAATTCGATATCAAAGCT 

Sequence 2277 

ATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCGACCCACGCGTCCGTAG 
TTT7TATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAA 
AAAAAAAAAAAAAAAAAANGT 
Sequence 2278 

GNACC CAGTMTCACATAAAGGTNTGCAGGTCATGNTGTTTATTTAGCTTMCTGTTTTT 

TTTTTATTTGTTGAAGGGGGTNGGTGTTATTTCAGNCTTTTTCTTAT^ 

CTTGGTAAAATCTGTAAGAAAGTTCCATAAATTATTGGGGGGGAAGGNATTTCCTCTGAA 

ATTGGGCTAAAATTCCTTGTAGCTGAAAAAAAAAANAACAAAAAAACAATAAAATANGGN 

GGCCGGGCCCGCTTCTAAGAAAACTAGGNGGGGGATCCCCCGGGGGCCTGCAGGGGAATT 

CCGNATATCAAAG CCTTAT CGGATTACCCGNCGNACCTCGGAAGGGGGGGGGGGGGCCCC 

GGGGTACCCAAGCTTTTTGGTTTCCCTTTTMGTGGAAGGGGGTTAAATTTGCCGCCGCT 

TGGGGCCGTAAAATCAATGGGTCATAAGGCTGGTNTTCCCTGGTGGNNGAAAAATTTGGT 

TAATCCCGCTTCAACAAATTTCCCACCACCAACCANTACCGAAGCCCGGGGAAGCCATAA 

AAAGGTGGTNAAAAGCCCTGGG 

Sequence 2279 

NGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAAAAAAAAAAAA 

ACAAAA AAAAACCACTTCAACTAATTCATCTGACAATGCTGTTCATATTCATGACGCCAT 

Mill IGTTTGTTGTTGTTGTTGTTTTCCTMTAATAAAGAAGGAGACTTAGGGCTGTTG 

GGCTGATATATGTTTGGGGGTCCCACCTCCCCGCCTCATCCGTACCTCGGCCGACCAGTG 

CAAATATCTACCCAGTTAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTNTG 

AACTCATGAACCTCCTCAGACTGTTGCTGGGGACTCCCAGATATCAATACTCTGAGAACC 

ACTGATCTMTGTTTCTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAA 

GTAAT 

Sequence 2280 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTCAG 

GGCACAAAGTGATCATTTGGGATCCTAAGTTAAAAAGGAAATGCAAGAGTAGGATACTCC 

AATTCCAGAGTCTTTGCAGGAGGCTAATCCCACAAGAAGGGTAGCATCAGAGAAAGTGGC 

ATTG GTCT TAGTGGTGGATCATCAGGTAGACAAGTGATAGTGTGTGTAACCCATCTGAAA 

TTCATTTTACCGTCACCACTCTTACAAAGGACAGTTTATTCGCAAGGACAGTGCTGACGG 

GGAGGGGGACAGGCAGGGAGTTAGGAGGGTTTTCGAGGATTTCAAACAGGTGGAACCCAT 

CCATCCCTATTCCCAAGGGCCACTTACAACTCTAAGGGGTGGTTACAGGATTAACTACCA 

GTTCATTTTCAMATGCTGCTTTGAACTCAGAGGGTTGAT^ 

TGTAAAACTTTTTACAAAATAGT 

Sequence 2281 

CCGGGCAGGTTACAAGCTTCGACCCACGCGTCCGGTCATTATTACCCTCACTGTCAACCC 

AACACAGGCATGCTCATAAGGAAAGGTTAAAAAAAAAAAAAAAAAAAAAGTGCGGCCTCG 

AGCGGCCGCCCGGGCAGGTACAMAAAMCCTTACATAAATTAAGAATGAATACATTTAC 

AGGCGTAAATGCAAACCGCTTCCAACTCAAAGCAAGTAACAGCCCACGGTGTTCTGGCCA 

AAGACATCAGCTAAGAAAGGAAACTGGGTCCTACGGCTTGGACTTTCCAACCCTGACAGA 

CCCGCAAGACAAAACAACTGGTTCTTGCCAGCCTCTAGAGAAATCCCAGAACACTCAGCC 

CTGACACGTTAATAC 

Sequence 2282 

GCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCTACCCACAGCGTCCGTTT 
ATGTTCAGCAATAAAGGTTCTATCCGTAAAAAAAAAAAAAAAAAAAGNGCGGCCGGCCGC 
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CCGGGCAGGTACTATGACTAGACACATGATGCATGGCTAAAAAGCTCTTCTGGATAACTC 
CTTAGNGAAGGNCTAACNNCCCCACCACCATCAACCTACAGCCCTGCCI I I < 1 1 1 II I I I 
TTMNMGTCTGTCACCACMTNTTGTCTGGNGCTNGATTTCAAATMTACATTTNTAGA 
ACCTGCCCGGGCACACGCCCNATMCGANTGGTTTTNNNTTATATCAATTAACGTTAAA 
Sequence 2283 

CCGCGGTGGCGGCCGAGGTACTCCAGCCTGGGCGACAGAGCAAGGCTCAGTCTCAAAAAA 

AAAAAAAAAAAAAAGGAGAGGAATAGTAAATTTATAGTGGAGAAATCTGCAGTCACTAAC 

TTAACCAAATAACCACATGGATGTATCCCTTATTAAAGGTAATCGAAAGGGCACAGCGTT 

ACTTCTGTGGAATTCTTGCCCAAAATGCATAATCTCAATCAAATCATAAGAAAACATCAA 

AATTGAGAGGCATTCTACAAAACCAATAATTAACCAATATTCATCAAAAGTGTCAAGGTC 

ATAMAGACM GATG TTTATAGAAATACATTAATTTTGGTTTCTACTTAAATT^ 

TTAAMTAATTGTTTTAGAGATGTGGCCTTGTTAGTTGGCCA 

Sequence 2284 

AT AGGGCGAATT GGAGC TCCC CGCGGTGGCGGCCGAGGTGGGA I I I I I I 1 1 1 1 IGTTTTT 

TTCI 1 1 I I I I I I I GGTTTGTTTTAMTCAGTGCATAMTTTTTCTTTTCTCATTTCAGCA 

GATGGACAAACAGATGGACTCTACAGCTAAGTGGAATATCAAAGGTAGAGGGGTGATTCT 

GTGAGACTGATAGGCCTGACTATTCTCAATTCTCCCCACTGCAGTGTTCACGCGGACGCG 

TGGGTCGAANACCTGCCCG 

Sequence 2285 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCGTCGTAACAGTG 

TAACATGTATTATGGTAACTTCTAATCTTGTGGCCTTAGACAGTCTAGTCCAAAGGCATA 

AAGAAAGTNTGCTTTAAAAAAAAAAAAAGGAATGGTTATCTTCAAAAAAAAAAAAAAAAA 

AAAAAAGT 

Sequence 2286 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGACTCATATAGGGATCTAGA 
TCACGAGCGGNCGGCCGCCCGGGCAGGTCTCCCATCTTGCGCAAGTTGGTCACGTGGTCA 
CCCMTTCTTTGATGGCTTTCACCTGCTCATTCAGGTAATGTGTCTCAATGAAGTCACCG 
G ACGCGTGGGTCG AAGACCT 
Sequence 2287 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGAT 

TTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAG 

MTGGTCCMGGACTGTTAAACAGAGGAAGTATTTACATTTTGAAAACTTGCGGACGCGT 

GGGTCGAAGCTTGTACACCT 

Sequence 2288 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCTGGTACATAAAANAATTTN 

TTNGTCTTTAMTNGATACNAATGTCTATCANCTTTMTCMGTTGTMCTTATATTGAA 

GACANTTNGATACATAATAAAAAATTATGACAATGTCCTGGAAAAAAAAAAAAAAANAGT 

GCGGCCGACCT 

Sequence 2289 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATACTTGTCATTG 

CGTAGACTTCTTATCAAAATTTCACATTNATCTGTAGGAAAATGTAAAGTTGGTAAAAAT 

TGTTTACACAAATCA CACA TTTTCCATCCTTGACAATTGCAGNG I I 1 1 I I 1 1 I AAATATT 

GCTGTATTAAGACAATTTTAACTGAAGTAGGTTGTAGAGGCTANAAACCTGATTAATAGA 

GCAGTATTAGACAATTCTAACTGAAGTAGGTTGCAGAGGCTAGAAGAAACCTGATTAATA 

CGGACGCGTGACCT 

Sequence 2290 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAAGCTTCGACCCACGCGTGCG 

AGTCAGGGMGGTTTGTTTATGTTACATTTATTTCACCAGAACTATTTTAATATATCAAA 

GGGGTTTACTATGCCAAACAAAATTCTAGGGAAAAATACTGCTAAAAATGGATGCCTCAT 

CAGAACATGCTGTTGAGTCCMTGTGCCATMGACATnTAGCATGTTAAATAGCACTTT 

TAATAGCAAAAAAAGGCACATCAACTGCGAAGTTATCCTTAGTTTGCAAATGCTTTTTCT 
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AGATTAATGATTTTTCAATCATTAGGGTACCTGCCCG 
Sequence 2291 

CGCTACCTGCCCGGGCGGCCGGCCGCACI I I I I I I I I I I I 1 1 I I I I I I CAAGTTTTATGA 

TTTATTTAACTTGTGGAACAAAAATCGGACGCGTGGGTCGGACGCGTGGGTCGAAGCTAC 

CT 

Sequence 2292 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCTACCTCCAAGCAGGATGATGGG 
CTAGACATGGAGCATACATAAACGGGCAAGATTCAGTCCCTGACCGCAAGGCACTTACAG 
TCTAGTTGGGAAGGGAGACACAAATGTACCT 
Sequence 2293 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGAGACTCCAGGAMAATCAAAMTTTGTTT 

TTGCAATTAGCCGAGCACGTAGCCCAGTCTCTAAATGTCACTTCATATTATGTTTGTAAA 

ACAACTGTAATAAGAGATCAATGGCCATAAGAAGCCTGAGAATTAGTGCCTACAGACCCA 

GTTCCTGATGAATTCTCAGCCCAAAAGAATCACCCTGATCATTTCTAGGTTCTAAAAGTT 

TCAATTATTGGACAATATTGCATAGCTAGAAAAAAAAAAAAAAAAAAAGTGCGGCCTCGA 

GCGGCCGCCCGGGCAGGTACTTTGAGCAGGATMTAACATAAATTTCATTTAAAAAGTTG 

TATTTATAGCCCCAGTAACCGGAAAGAATTATAAGTAATTATGGAAGTATTATATTCTGA 

CCATACCAAGAGTTAAAAACAAAGAGTTCCTACTAAAGAGGAATATTTTCAAGATGATCT 

GGTCCATCATGTGCATAGTTAAAGAATGGTTGGTTTAATAMGATTCTTTTGCAAATAAA 

GAAT 

Sequence 2294 

AGGTACAAGCTTCGACCCACGCGTCCGATGAACAACTGGCTGCTCTGTCCCAGGGTCCAA 

TATCCAAGCCCAAGAGGAAAAGAGAGAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGG 

CCGCCCGGGCAGGTACAGATAAATATAGGAAACTACTTCAGATTATTGGACAAATAAGAA 

TTTCAGTGTGTCACTACCTATAATTAAGTAGCAGCACACCACAACCACAGCAAGTAAACA 

CTAATCTCCTCTACTGCCTTTTTGGGGTCTTTTCCAGTCACAGGAGCTAATTTACAGGGA 

TACCACTGGGTTTCAACCAATCTCTGAMGTTCCTTTATTATGAGTTTTGAMTTTAACT 

AGTGCATCACCTACAATTCTGTTGGCTAG I Mil l I CTTACTCTTCACTAATTAAGTTTT 

AATACAATATGTAATGGTTTCAAAA I MM 

Sequence 2295 ' 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGAAA 

CTTTAATTTTTAGGAATAACATATTGACTTACTGMCTGMGCATTCTGAGTTGAAAGGA 

GCCCCAGAGGAAAGGAGTTCTGTGTTGCTCACATGTTAAAAGCTTGCTCACCTTCAGAGC 

AGAGGGAATACCTATCTTCAGATATCCGCCCATTTTCATCTCTTCATTATAGTCAAACAG 

TGTGACTTGAGAGTGTTGCTCTGGTGTCTGTATTCTGGCTTATGMGATTATTTGAAAAA 

GAACTCTTACTACATTGAMTGCAGACTTTTAAAMTTTAAATATTGGATTAGGCAGTCA 

AAAAACCAAACAAGCATAAAAGGTCAGTAAGTTGTAATCTTAAAAGTAAAGGTGGAAAAC 

TCATTATAAATGGAAGAAAAGTTTTGATTTCC I MM I GTTTGATGGGCAGTATGCCATA 

TTATATCCAAAGTTGGTTTAAAAAATACTTCCATCAACTATTTTTATTTAAAATAAACAT 

TT 

Sequence 2296 

CCGCGGTGGCGGCCGAGGTATACTTTGCACCTTGAAAATATAAAATAAAATAAATTTAAA 

MTAMGGTAATTTTGTTCTTCCATGTCAGCTGAAAATAAGTGAAGACTGGGTCAGTAAT 

MCATTGCTTTGCTGAATTCAGAGAATTCTAATAAMTATTTTTAGTTGGGAAGCTATCT 

GTATTAAAAAAAATGATCTAAGGCTGGTAACAGTGGCCATACTTATAATCCCAGTGCTTT 

GGGAGGCCAAGGCAGAAAAATCACTTGAGGCCAGGAGTTTGAGATCAGCCTGGGCAACAT 

AGTAAGACCTTATCTCTACCCGGACGCGTGGGTCGACTCAAGACCTGCCCGGGCGGCCGG 

CCGCACl I M M M M M M M M I I AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAG 

GCAGTGCACCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCC 

Sequence 2297 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGTCGACCCACGCGTCCGCTTGTTTTGCTCTA 
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TCCCATAGGAGTTGGTATGTTGTGTTTCCAATATCATTTACTAAAAGAAATTTTCCTTTT 

TATTT CTTCATTGACCAACTGGTCATTTGCATGTTCTTTMTTACTATGTGTTTGTATAG 

TTTTCAAAATTTCTCTTATTMTTTCTAGGTTTTCCTGTGGTCAGAGMGATGCTTGATA 

CTACTTT AA I I I I I I GMTGTTTTMTACTTGTTTTGTGACCTAACATATGATCTACCCT 

TGAGAATTATCTATATGCTGAGGAAAATAATGTGTATTACACAGCCATTGGATGAAATAT 

TCTGTAACTATCTCTTAGGTACCT 

Sequence 2298 

GMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCTTTATTTATTTCTTTTAGGAAT 

TGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTAGGCAGGAA 

CATTAGACTCCAAAAGCAGAGAAATGCTTAATTTTTCTTCTACCTGTTTCACCACATTCA 

TGTAGAACTGTAGTAAAAAAGATGGTGAATCAGGCTGAATCAATCTAAATAACAACTTAA 

GGCTCCCAAATCACATGAACCTAGGACCACTAAATCCAATGTCAGACGTGTTTAAATGGT 

GCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTGTCATTATG 

ACATTCCATGAAAATGMNGMAATCTTTCAGGAAAAATTAAGAAAATAAAAATGTTTAC 

TAAAGAAAGAATGGTCCGGCTAAGTGCTANAGTTTNTTTCNN I rTTTTTTT A 

Sequence 2299 

AGGTACTATCAAGCTTTATTTTACCTGCAAAAATATTTTAGCTACACTTGGAAAAAAATA 
AACTTGAGAATATAACTTCACATTTCTAAGGCCAGCGGACGCCGTGGTCGAAGCTCGACC 
TGCCCGGGCG 
Sequence 2300 

AGGTACGATTTTCCCTTCGCTTGAATATTATCCCTGTATATTGCATGAATGAGAGATTTC 

CCATATTTCCATCAGAGTAATAMTATACTTGCTTTAATTCTTAAGCATAAGTAAACATG 

ATAT AAAAATATATGCTGAATTACTTGTGAAGAATGCATTTAMGCTATTTTAAATGTGT 

TTTTATTTGTAAGACATTACTTATTMGMATTGGTTATTATGCTTACTGTTCTAATCTG 

GNGGTAAAGGTATTCTTAAGAATTTGCAGGTACTACAGATTTTCAAAACTGAATGAGAGA 

AAATTTGTATAACCCATCCTGCTGTTCCACCTGCCCCGGGCCGGCCCGCTCTAGAAACTA 

GGTGGGATCCCCCGGGCCTGCAGGGAATTCGATATCAAGCTTAATCGATACCCGTCGACC 

CTCGANGGGGGGGCCCGGGTACCCAGCTTTTT 

Sequence 2301 

GGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGACCCACGCGTCCGGTGGCGA 

GTGCCTGTAATCCCAGCTACTGGGAGGCTGCGACAGAAGAATCACTTGAACCTGGGAGGC 

AGAGGTTGCAGTTAGCCAAGATCATGCCACTGCACTCCAGCCTANGCAACAGAATGAGAC 

TCCATCTCAAANNAGAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCCCGGGCGGCC 

GGCCGCACI I I I I 1 I I I I I I I I I I I I GCCATACAGGTTCATTTTATTGAGTGGAAAGCTT 

ACAAAAGGGCCACTGGCCCCTTCC 

Sequence 2302 

CCGC GGTGGCGGCCGCAC I I I I I 1 1 1 I I I 1) U M U 1 I I 11 I 1 I I I I I I H I 1 1 I 11 11 
TTTTAGGGTTGAGGGGGAATGCTGGANATTGNAATGGGTNTGGANACATATNATATAAGT 
MTGCTAGGGNGAGTGGTAGGMGTTTTTTCATAGGAGGNGTATGAGTTGGTCGTAGCGG 
AATN GGGGGTATNCTGTTCGAANACCTGCCCGGGCGGNCGGCCNCACI 1 1 I 11 11 I I I II 
TTTTACTTTGGCNGGGGN I I I 1 1 ICTTTTCI I I 1 I I I 1 I I CAGCTACNGGAATTTANCCN 
ATTCANAGGAAATCTTCCCCATMTTANGGAACTTTNTTACANANTTTACCAAGTNTGGG 
CNNCCCNATAAGAAAAAGACTGAAATAACAACAACNACTTTAANAAAT 
Sequence 2303 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNNTNTN I I I I I I I 11 NTTNGCCAG 

NTANMTCTNAGCTTTTTATTTGTAGGAAAAAATAAACAGATTNCCCTCCNNAACANGGC 

GTNACAANAAANGAGGCAATNAAGGGAAAAANGCANATNCTAAACGGACNCNTGGGTTNA 

ANCTTGACCT 

Sequence 2304 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 
CCGAGAAGAGTTTGCAAATGCMCAAMTATTTAATTACCGGTTGTTAAAACTGGTTTAG 
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CACM TTTATAT TTTCC CTCTCTTGCCTTTCTTATTTGCMTAAMGGTATTGAGCCATT 

TTTTAMTGACATTTTTGATAAATTATGTTTGTACCTGCCCG 

Sequence 2305 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCCGCA 

CI I I I 1 1 I I I II I I I I I I I 1 1 GNTTTCTGGGTGMGTTTATTCTGTTTTCACATCTAGGT 

TGTTGGGGAGAGTGATAGACAAAGTTCTGGATTCTGGGCATCGTCGGCGCATGCTTGTAA 

TCCTACTTGGGAGGTTGAGACAGGAGGATCACTTGAGGCTAGGAGTTGGAGGCTGCAGTG 

TACCTGCCCG 

Sequence 2306 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGGCCGCAC I ITTTTTTTT 



I M 1 I I I 1 I rTTTTCTTCTTTCTCTTGCCTGGATTCAGTCCCAGAAATGTTAGGACTACC 

TCAGTTTTGCTCCAAACCAAACTCAAACAACAGCAGCCACTGGAAATCAAGGAAACTTCA 

CTMGMTTTMCAGATCATCAAMCACCGCCTCCTTCCCATTTTAGCCGGACGCGTGGG 

TCGAAGCTTGTACACCTGCCCG 

Sequence 2307 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTAAATA 
TTCTGAGAGGTGAATGTAAAATATAAAAGGTATAGG I I I I I I I I I I I I AAAGAAAACAAT 
NAACTTTCAAAGAGAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCC 
GCACTTN I (TTTTTTTTTTTTTI I AAAANANANATGAGGTTTTGNTATGTTGCCCAGGCT 
G 

Sequence 2308 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACC 
CACGCGTCCGTTCTAAATGATCGACAACTCTCAAGCAATAACTTGACTGTTGAATAGAAG 
ATT AAG AAAAGTTG GTTAAAAAA A A AAAAA AAA AA AAGT 
Sequence 2309 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTCG 

ACCCACGCGTCCGMATMTAAAGCTAGAAGTMTATTTTTCCTTTTGTCTATTTTCCAA 

ATTGACTCGATATTGATGGCTACTTTTGTAAGTriTTATTTAAGTTTAAAGGGMTA 

ATTGATCACCTCTATGTGCTCAGTACCT 

Sequence 2310 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAGCTTCGACCCACGCGTCCGCA 

MGTTCACCMATTCGCTACAGCCTMGACTAGCCTCACCAGTCCTTTTTCCCATTAATC 

MAACTTTGCAGMGAGACAATGATTTTTACCATTCATTCAACCAGTTTGCACAGAGAGA 

GGCTGMGCCTGACTTGTAAGAAACTCTTGTCCTTTTGCCAGTGTGCCAGGTTTCTGGGT 

TCCCCTTCTCTGAGTGGCTTTGATGACCCTGCTTGCTGTGCCATAGCTATGGGGGGGCCA 

AGCCATGTTACCCAAAAAAAAAAAACNNTNNNNNNNGGTGCGGNCNGNCCCTCGNGATCT 

MAN CCCCATAGGGGGNGGGATTAACMTTNCCATTCCCNNGGGGAMTTTTTTGGCNCC 

TTTTNTTGGGGAANAAAA I I I I I I I I I I GN AAATTAAAANGGTTTAAAAANCCNCCCCTT 

TTTGNAAAAATTTANTGGNTTTTAACCCCCC 

Sequence 2311 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI It I M I I I I I I I I I I I AANGCTG 

CTCCTTGAGGATAAGGGCTAACTCACAGGCAGNGCACCAAGAGCCACTATGAAAAGATCC 

TTAATGAGCAMATATATNCCCTATTATTTTCCTACMGTTGCTTTTTACTTGAGTAGGA 

ACCCTTGATTG 

Sequence 2312 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGGTATGCCAGA 

TGAGAATGACAGGAGCCATCCGCAAGCAGTTGGCGGCTTTCTTAGAAGGCTTCTATGAGA 

TCATTCCAAAGCGCCTCATTTCCATCTTCACTGAGCAGGAGTTAGAGCTGCTTATATCAG 

GACTGCCCACCATTGACATCGATGATCTGAAATCCAACACTGAATACCACAAGTACCTGC 

CCGGGCGGCCGGCCGCACI I I I I I I I I I I I I I I I ACCTGAAAATGCTTATTCTAGCTTCA 

CATTTGATTGTTTGGCTAAGAAGAAAATTATTTATTAGACTTMTTTTCCTCACGAGTTT 
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AAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGA 
Sequence 2313 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGCAAGCTTCGACCCACGCGTC 

CGCAAAGTTCACCAMTTCGCTACAGCCTAAGACTAGCCTCACCAGTCCTTTTTCCCATT 

AATCAAAACTTTGCAGMGAGACMTGATT7TTACCATTCATTCMCCAGTTTGCACAGA 

GAGAGGCTGMGCCTGACTTGTAAGAMCTCTTGTCCTTTTGCCAGTGTGCCAGGTTTCT 

GGGTTCCCCTTCTCTGAGTGGCTTTGATGACCCTGCTTGCTGTGCCATAGCTATGGGGGG 

GGCCAAGCCATGTTACCCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCA 

GGTGTCACTTTCM CTTGGTT ATGCCTAAAACAAAGTCTCCCTCATCTCCAAACAATTTC 

TCCCGACTTTCTTTCTTTTTTGAGATGGAGTCTTGCTCTGTCGCCCACGCTGGAGTGCAG 

TGGCATGATCTTGGCTCCCTGC 

Sequence 2314 

CCG CGGTG GCGGCCGCAC I I I 1 U I 1 I I M U 1 1 I 1 U I M M 1 1 NAAATGCTCAAAAG A 

ACM TTTTTATGTMGTTTGTGATAGAGGCCTCAGGTAATTCTACAAAAATTAAAACCCA 

TTTT CAATG CAAMTTCCCGAACATAAACAMTGCTTTTAAAAATATGGATGGNGTGGTT 

ACTC TTTTAGTMTAC TTGGATTATCATCAMGATTTAACTTTATTTTTTTGNGTGTG 

TGTGI i i I I I I I I I I IGNGNGNGI I I I I I I I I I I I I I I I I I ATTATACNCTAAGTTTTAG 

GGTACCTGCC 

Sequence 2315 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACC 
CACGCGTCCGACTTTTTGTCTTAGACCCAGTTAGGGTCACCTTACAGTGCAGGTGGAAAG 
AAAGCAGGACTGCTGAGAGGAGCTCAGGA 
Sequence 2316 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGT 
CCGGATTAAAGTCCTACGTGATCTGAGTTCAGACCGGAGTAATCCAGGNNGGTTTCTATC 
TACTTCAAATTCCTCCCTGTACCTGCCCG 
Sequence 2317 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCGAGGTGTACAANGCTTCGACCCACGCG 
tCCGCCACACGTMCTGAGATGCTCCTTTAAATAAAGCGTTTGTGTTNGANGTTAAAAAA 
AAAAAAAANAAAAAAAAAAGT 
Sequence 2318 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACTrTTNN l ITTTTTTTTTTT1 

ATTATCTAMTCAGNTTTATTTMGAATTTCCAACANTGACAACTNTTATAAAGGGGCAT 

CCAAGCACAGGACACANAACTGCNACAAACAGCATTCTTACGGACGCGTGGGTCGAAGAC 

CTCGGCCGCT 

Sequence 2319 

CCGCGGTGGCGGCCGAGGTACAGAATGGTAAAAATTCCAATCAGTCAAAAGAGGTCAATG 
AATTAAAAGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCC 
GGGCA GGTCGAGCGGCAGCAC I I II I I II I M I I I I I I I I I I I ATGGNTTTTATTTTTCA 
ATTTTTATTTTGGTTTTCTTACAAAGGTTC 

AAAAAAAMTTCAAATTTTTGGGGGAGCGGGGGAAGGAGTTAATGAAACTGTATTGCACA 
ATGCTCTGATCMTCCTTCTTTTTCCGGACGCGTGGGTCGAANACCTCGGCCGCTCTAGA 
ACTAGTGGGATCCCC 
Sequence 2320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTCTTGTCTAGT 

ATACTCAAGGCAGCCTAGTAAATTATTATTTATCTATACAATACTGGAAAAACTNGNAGA 

CAAAAACATGACTTGAATTGCTAAAAAAAAAAAAAAAAAAAANGAGGGAGAATGAAAACT 

TCCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 2321 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTNCATTTACATACAACTGA 
TCCAAACAGGAAGTAAAAGCNTTATGAAAAAAGAACATGATGCAAATCATTTCCCNNNGA 
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fcAAAAGGAGGGATCTGCGTGAATTACAGCAAATCAAATGATTTCCACATTATAAAAGCAA 
GCCATGGCTTCTACAGATATTTTAATTCCTTGGGAGGAAGTTTCACCTNATCGGACGCGT 
GGGTCGAAGACCTGCCCGGGCGGC 
Sequence 2322 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACC 

CACGCGTCCGATTTTTGTCTCCTGAGGTCAMCTGTGCCCATCTTTACTGCCCCTGTGTT 

GGTGTCATTCCTTTCAGCGCACTCCATGTTCTGGCAAAACACCCACAGCTCACGTTCTCG 

TTGTAGCCACATTACGCCTTTGCATTACTCTTTTGCCACTGACCTCACTGTCAGTCTTCT 

TGTCACGCTGTTTCCTCCAGGTGACTCTGACATCATTAGAAGCATGCCAGAACAGACTGG 

TGAAMGTAAACCTTTTCACCTACAAAATTTCACCTGCAAACCTTAMCCTGCAAAATTT 

TCCTTTAATAAAATTTGCTTGTTTTAAAAAAAAMAAAAAAAANAAAGTGCGGCCGCTCT 

AGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnCGATATCAAGCTTATCGATACCCGTC 

CGACCTCGAGGGGGG - 

Sequence 2323 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCG 

CTGATCTGTGATTAAAGCAGTAATATTTTAAGATGGACTGGGAAAAACATNAACTCCTGA 

AGTTAGAAATAAGAATGGTTTGTAAAATCCACAGCTATATCCTGATGCTGGATGGTATTA 

ATCTTGTGTAGTCTTCAACTGGTTAGTGTGAAATAGTTCTGCCACCTCTGACGCACCACT 

GCCAATGCTGTACGTACCTGCCCG 

Sequence 2324 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACCCACGCGTCCGCAT 

TTTCAGGCTTCTGCAGCTGTAGATTCTCACTGTGAATCCCTTGCTTGCTCATGCATAAGT 

GTATTTG CAAT ACCAAATATACAGGTTTAGTATTTTGCCTGTTAGTGATTGTTTCACATG 

TGTAACGTTTTGGTTGAGATGTTAAATGGTGGACGAGTACCTGCCCG 

Sequence 2325 

CTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTGGCCGCACTTTT 
I I I I I I II I I I I I I I I ACTATTCGTTAGGCTTTTATTTTTCTCTATGTTCTGCAGTAACT 
AAGGAAAATCATGGTAAATGTCAATCTTCACACAACAGCAGACACAAAGGGTTTCAGAAA 
CCGGACGCGTGGGTCGAAGCTTGTACACCT 
Sequence 2326 

GGTGTACAAGCTTCGACCCACGCGTCCGGTTTCTGAAACCCTTTGTGTCTGCTGTTGTGT 

GAAGATTGACATTTACCATGATTTTCCTTAGTTACTGCAGAACATANAGAAAAATAAAAG 

CCTAA CGAATAGTAAAAAAAAAAA AAAAAAAAAGTGCGGCCACCTGCCCGGGCGGCCGGC 

CGCACI I I I I I I I I I I I I I I I I I I I CACTGTTTAAGGAA I I I I I ATTAAAGCAAGAATTT 

TATMTCCAMTTACGTTTCCTTGCTCAGTTATCMTTCTGTTACTTAAAACAGAACTGA 

CATTCTGAGCTATTCCACAGTAAAGAATTACAAAATTAMGAAAGGAATGCTTTAMTTT 

TTGTACCTGCCCG 

Sequence 2327 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I II I I CTTTCTACTTTTAGGGTTTATTT 
ACTGGTAGTTTCAGGTCTGGAAATACAAAAAGGAACATGACATGGGTTCTGCCCTTGAGG 
TGCTCCCAGTCTGGTGAGGGACACAACACCTTTCAGGGGACAACCGTGGTACCT 
Sequence 2328 

TAGGGCGAATTGGAGCTC CCCGCGGTGGCGGCCGCCCGGGCAGGTGCCGCACI I I I I 1 1 I 

I I M I I I I I I I I I I 1 I I I I II I ANAATTTAGCCACTTNTCAGGCCTTNTTCCNNATAATT 

TGGMCTTTCCTTTGGATTTGATCAAGTTGGATAGAGTTGATCAAACCCTGATCAAGTTG 

GATAGAGTTGATCAAACCCAATGGGAAAAAGACCAAAACAACANTAAAAAACAGACAACA 

ACAACAAAAAACAGTTAGGCAAAACAAACAATGGCACAATTTATNTGATAACTGGGGTNG 

CTTTMTNGGAAN GGAN AAATTAAAACNNGGTTGGGCGGTTATTTTNACTTTAACCAAAA 

AANAAACCCCCA ATTTTA AGTTNCTTTNCTTANGGGGCGANGTTNTTGGGGTTTTAAA^ 

NGGGi I I l I TCCCTTT TTTAGNAAGGGNGGGGGGNAAAACCCCANCCCNNCCCTNANAAA 

ANCNCCNCNTTTTTTTCCNTTNNNGGGAGGGGGNGGCCAAAACNCCCNCCNNTTTNGAAA 



WO 01/070979 



PCTYUS01/09126 



TABLE 1 

385/467 

AAAAAG AAAA I 1 1 I I I ( I I I I I I I I I AAGCCCNTTTTAACCNCCAAAAAAAANN AAGTGT 
TTTCCN NNCGNNNCTTTTNNAAMNNGNGGNACCCCCCNGGNGGGGGGGANNTTTTTTAA 
TTTTTTTNNCCCCCCCCTNGGGGGGGGGGCC 
Sequence 2329 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTGAGGAAATATT 

TTGTAMGTGAGC TTTGG GTATMCTTAGCCCCATCATTATTTAGAGAATAGAGGAGGAA 

GAAAGAGGAAGGATTTTAAAGGCAGACAATGACAGACCATTCAGGATAGGTAGGGTTTTA 

AAGGGAGCGGACGCGTGGGTCGAAGACCT 

Sequence 2330 

CCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATAATAATAAA I I II I 1 I 1 

TAAMA AAGAGTGT TGTCTTTGTCTTGTATTTrCTGCAGTTTGCATGTGATATTCTTAGG 

TATAGAI I I I I I I I AGTATTTGTCCTGTATATTGTTATTCGAGCTTCTGGGGATCTGTGT 

TTTGGT GTCTA TCATTAACTTTGGMTATTCTCAGTCATTACTGCTTCAAACATTCATTC 

TGTTGCTTTTTCTCTTCTGGTATTATCATTACACATATATCACACCTTTTGTAATTCTCC 

CACAGTTCATAGATATTCTGTTGTATnTATTTA I fill I CTCTTTGCCTTTTAGTTTTAG 

AGATTTCTATTGACATCACTTAAAGATGATTGATGAGTTGATGAGAATTGAGAGAATTGA 

TGAGAATTGTTGATGAGAATTATTCATTTGTGTTAGTGTTTTTCATTTCTGCCATTGNCT 

TTTGATTTTTAGAGCTTCCATCTCTCTGCTTACA 

Sequence 2331 

AGGGCGAA TTGGA GCTCCCCGCGGTGGCGGCCGAGGTACAACCTTGACCGTGACCGTTTG 
CTATATTCCTTTTTCTATGAAATAATGTGAATGATAATAAAACAGCTTTGACTTGAAAAA 
AAAAAAAAAAAAAAAAGT 
Sequence 2332 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTATTTTATAGAAAAGAA 

MCAGAGGCTCAGAGAGGTTAATTTTTCCTGGATGTCCTAGATGTTAAATGTTACAACCT 

TMTTGACCGATTCCAGAATCAGAGCTATTMACACMMCTATTTTAATTCTCTCTAAA 

TTCTTAAAGACCCAAGAAAAACAAACTTTATTGAGATAATTAGGAAI I I I I I I IAAAATA 

TCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 2333 

AGGTCAAGCTTCGACCCACGCGTCCGTGGTGAACACAGAGAAGACAGTCTTGTATATATT 

CCTCTGTATTCTGGGGAGCTTTGACCTTGGAGCTTTGTACCTGCCCG 

Sequence 2334 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAACAC 

CTATGTTTATAAAAATTTGMMCATTACATATTGTATTTAAAACTAATTAGNGAAGAGT 

AAGMAAAAACTAGCCMCAGAATTGTAGGTGATGCATTAGTTAAATTTCAAAACTCATA 

ATAMGGAACTTTCAGAGATTGGTTGAAACCCAGTGGTATCCCTGTAAATTAGCTCCTGT 

GACTGGAAAAGACCCCAAAAAGGCAGTAGAGGAGATTAGTGTTTACTTGCTGTGGTTGTG 

GTGTGCTGCTACTTAATTATAGGTAGTGACACACTGAAATTCTTATTTGTCCAATAATCT 

GAAGTAGTTTCCTATATTTATCTGTACCTGCCCGGGCGGCCGGCCGCACI I I I I I I I I I I 

TTTTTTGCNGGGGN I Mil ICTTTT CI I I I II I I ICA 

Sequence 2335 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACTNNN H I II I N I I I I I I I I 1 1 ICTT 
TAAACACCANTTAGTTTATTTCAGGACMGAATTTACCATNTMCANTCTTTTNACATAA 
ATTCTGNCTCCCCCCAC I rTTTTTTTTl I GAAN AT AACCATTC C I iTTTTTTI 
Sequence 2336 

CNAANNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTNTATCAGAGCTTTT 

GGGTGACCA GGCACACTGT CAATGAGCAGTMTACGGGGAMGGAATCTTTGGGGTTTTT 

TTTGGTTTGG 1 1 I I 1 GTTTTGTCGTTA I I I Ml I GTTTGTTTGTTTTTTG I I I I I GTTTT 

TTTAGCAGTAGGCCTCAACAGTGGACTTAAAATACTCAGTAAACCATGCTGTTAACAGAT 

AAGCTGTCATCCAG ACTTT GTTGTTCCATTTCTAGAGCACAGAGCAGATTTAGCAGAATT 

CTTMGGCTTTAGGATTTTCAGAATGGTAAATGAGCACTGGCTTCAACTTAGTCACCAGC 
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TGCATTGGCCCCTAACAAGACAGTCAGCCTGGCCTTTGAAGCGTTGCAGCC 
Sequence 2337 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTGCACATATTTAA 

MCATATAATTTGATACATTTTGACTCACAAAACAATCACCACAATCAAGANGATGAGNN 

TATAGATCACTCCCAAAAGTTTCCCTGTAGTCTTTTGCAGTCCTTTCTTCATGGCCTTCT 

TCATCCATCCACCCCATCTCGGTAACCAATGATCTGCTTTCTGTCACCACAAATTAGTGA 

GCACTGTCTAGAATTTTATGTAMCTGMTMTMAGATTTTACTCTTTCG 

Sequence 2338 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCACI I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I INGGAATCATCATTAACTTTAT 
TTGNCACTNTTGATAGACATTGGTCCACTCCAACATAAAAAGNAGAATTCACCCACTTCC 
ACTTAATATTCTATAGAATGAAGTTGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGT 
Sequence 2339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGAI I I I I I IACACAA 

AGAMCTTAATGCTGTATTAATAAATTCAGTGTGTAGCTTCAATTGGGATAGTTCCAAAA 

GTGAAGATTTTGTGAGGAATAAGTGCAAA I I I I I I I I I I ATTTTAAAAAATTCTTTG AAA 

CTCTTAAGTCTTTGTGTCTGCAATAAMTTGTACCTGCCCGGGCGGCCGGCCGCACTTTT 

I I I I 1 1 I I I I I I I I I I I I I I 1 1 I I NGAMNGTTTGAAGTTAACTCATTTTATTTNTAGGA 

TTNGGATTTCMCATTTTMTTTNTTTGG 

AGAATCAAA 

Sequence 2340 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGACTTGATTCATAACCAACCA 
ACCCAGGCTGTCATGCTAAAAAAAAGGGCAGAATCTTTGCTGCTGAACTACTGCATAGGG 
TGACTACCATTGCTCCTTTAGTTTAGTTTGGCTAGCAGAAGGTGGTCTTGCCATGTAAAT 
AAAGCCTCTCAGGTAATCAAAATGTTTC I I I I I I I ACTTTTGCTGGTG I I I I II ICTTTT 
CI II I I I I I ICAGCTACAGGAATTTAGCCAATTGAGAGGAAATCTTCCCCATAATTATGG 
MCTTTCTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACTGAAATAACA 
ACAACAACTTCAACAAATAAAAAAAAACAGTTAAGCTAAATAAACAGATG 
Sequence 2341 

AGGTACTTCTTACATAGTGATTGATGTCTCATGTCTCCCTAAAATGTATAAAACCAAGCT 

GTGGCCGGACCACCTTGGGCACATGTCATCAGGACTTCTTGAGGCTATGTCACTGGGCAT 

GTCTTCAACCTTGGCMAATAAACTTTCTAMTTAATTGAGACCTGTCTCAMTTTTGGG 

GGTTCACAGGTGAGTGGGCTCAGGCATGTGCACTAGTATGACTAAAGGTCATAGACTATT 

AGACTATTAGTCTATGACCTTCCTCTAGAAACACTCGACTGGTAAGGGAAGAATGCCTCA 

ACTGAGCATGTGCACAACTCCCATAAACACACTTGTGCTTGCGGAGCCTNTCAAGTGCTG 

GCAGGCCACTGCTCAGGTGGATTCTTCCCTCCTACCCGGAGGGAAGAATCAGGGGAGAAG 

GGACACAAGCCCCTGAATGCATGCCAACACGTAAA 

Sequence 2342 

CCGCGGTGGCGGCCGAGGTACCTTCAGGTATTGCCTATTTAATGATCATATATACTTGCA 
TMTATCATCCCTTCCCTTGATTTCTTTCAATCTAAAAATAAATATGAGAAAAACAAAAA 
AAAAAAAAAAAAAGT 
Sequence 2343 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGACTCAAGACC 

AGATGGCCAACTAGAAGCAGCCAGGAAGAACATCTCTCATGGAGAGACCAGGAATTTGGG 

AAGACTGGCACACTCTGAGCAGATCTTTTGAAGGAAAACATTGAGGGTGGATGAAGGGAG 

GATGCAGAGCATGGGCTGAGGGGGCAGGAATCTCAGAAGCCTGCACAGGGCTTCCAAGCA 

CTAGCATTCCTTACTAGCCCCCATTAACTCCTGGGGAAGGGGTGAGTTGAATAGGTGGNG 

GAGTGGCCCGCTCTTACCATGAAACTCCAGAATCCTAGCAGCAAGAGACCCCATGACCCC 

TGTGGACACGAGCTGTCCGGACGCGTGGGTCGAAGACCTGCCCG 

Sequence 2344 

CCGCGGTGGCGGCCGCTNNCTGCCCGGGCGGCCGGCCGCAC 1 M I NIM IT C H I I IGG 
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AAAAATNTNNNGACTCTGGGGATAAAANTNCNAATTAAATNNATNCNANNTTTTAAAGGC 

TATTAANNANAANAATATTNGCTAAATTNNCCTTNTGCATAACAAACTGTGGNTNTACCA 

TGTMNGTTTAAAAAAATGTNTAACCNCAATTTTACGCTCCTCTGTNACANGACAAGGAC 

TCCATTCANTGNCATTTAAGAACTNAATGGGTTGAN 

Sequence 2345 

AGGTCAAGCTTCGACCCACGCGTCCGATAAGCCAAAAAATGGGAGACAATTTCACATGGA 

CTTTGG AAAATA 1 1 I I I I ICCTTTGCATTCATCTCTCAMCTTAGTTTTTATCTTTGACC 

AACCGAACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAGTG 

CGGCCGGCCGCACTTTGTTTCTACTGGGTTTAGACCGTCGTGAGACAGGTTAGTTTTACC 

CTACTGATGATGTGTTGTTGCCATGGTAATCCTGCTCAGTACCT 

Sequence 2346 

CCGGGCAGGTACCTGCTCCATTTCTCCTGCAACATGTGGATACAGTAATATGATCATACC 

CTCCCTTGTTCCCCTCTAGGCCACTTTCCCCTTTAAATATTAACACCATCATAATCATCT 

TTGGAGAAAGACACCTGGATCTGTCCTGAACCTTGGCAAAAAATAAATAAATAAATAAAA 

ATAAAAMCTCTTCTCAATTGATTAATACCTGTCACAGATACATTTTGGTTTACAAATCA 

ATGAACAATGGAGGGAACTCTGTCCTTAATCTTGGTACCT 

Sequence 2347 

CCGCGGTGGCGGCCGAGGTCTCTTGTTTGNTATTACACTTCTACGTAGATTATATAATAT 

TGCTTGTGGACATAATTTGATCAATAATATATAATGTCAGTCACTACAGTGATCCAGAAT 

CTTATTCTGGCTATGGAGGAAGCTTMTTATTAAGCMCATCTTCCTAAAAAGCTTNTGA 

ATTTCTGATTCATAAGAAAAACAAAACAAATGAAAAGAGTATCTNTAACTGAAATAACAC 

TGAAGTTCGAGCTTGGGCCCCCTCCTTTGTGTTCAACATAATTAACTNTTCAAGATGAAA 

CCGGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGTTTN 

TTCT GGGATATCTTTTTCTTCTGGGCAACCTCCTCTTCTGGTTTAGGAACAATCTGTTCC 

TTTTCCGNAAGGATCATCTCAATGTGGCAGGGAGAGCTCAT 

Sequence 2348 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCCACGC 

GTCCGGGCAATTATCAAAMCACTTGGAAAMGATTTTTATTCCTACTTTTAAACATACA 

TCAAAATCTAAAATAAAACTAGGCACCTTCAGCTGGGCCTGGTGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGAAGTGGGCAGATCACTGGATGTCAGAAGTTCGAGACCAGCC 

TGGCCTACATGGCGAAACCCCCTNTCTACTAAAAATACAAAAATTAGCCGGGCTTGGTGG 

TGGGGACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGG 

AGGNGGAGGTTGCAGTTAGCTGAGATCACACCACTGGACTNCAGCCTGGGCCACAGAGCA 

GGACTCCATCTTAGAACAAAAAACAAACAAACAAACAAACCTCATGCACCTTCAAGAAAA 

TCAMCAAGTTTTTATCTMTTAAAGAAAGAATTTT 

Sequence 2349 

AACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGC 

GTNCGCTTTAACACACACTAGGCTCTTTGTGTATTATGATNCAGTGCTATTTGTAACTGT 

GTCCCAGNGACCAAATTGCACTCGACTCGATCAGCTGTTCATCCATTTCGTGI I I i I I CC 

TGTCAMCATTMTCCAGCMATATATGAGGTATTTACCMTTTATTTTCTTAGTATTAC 

AAAATAATTCATTAGCATAAAGTACCTGCCCG 

Sequence 2350 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAAATCATAGAGC 

TGCCCCAACATCTAGACAGTCTCTCCTACTGATTATAAATGAGTGAAAACTATCAGTTAG 

MAAATCTAATTTMGTTGTTAATACATGTTTCTTTGGTGAGCACCTGGATATATTTATC 

ACAAATTCTTTTATACAAATGTCGAAAATGCTTTCAACAAACCTAAGTGTCCTAATTACA 

TGCCACTTTTAAGCATCACTTTAAGGTMACAAAMTGMAACCATAATTTTAAATTAAA 

ATTTGCGGACGCGGGGGGTCGACTCAAGACCTCGGCCGCTCTAGAACTAG 

Sequence 2351 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCTTCGACCCACGCGTCCGGATGGCTTGGGTCAT 
CAGGACGTCCATTACATCCAAAGGAAGACAGCCTGTGACGTTTCAAAAGCAAAAGTCCCC 
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TACCAGC.CAGTGAAGCTACCTGATTTCTCAGTATCTTACGCCCAGNGACACGATCTACCC 
TCAAAACTTAAAAAAAAAAAAAAGGGAAACATAAACACATAACAGCAGACCTN 
Sequence 2352 

GGGCCGNGGNCACAACATTCCCCCTTCCCCAAACAGTAATATGGACACTGATTTAACANG 

ACTTATAAAAAMTMGGCNCATTTATTTTGATNTGGTMTTTTAAAATAGAAACCCCTT 

C 

Sequence 2353 

GCGNGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTNC 

GAGCTTGAGTCGACCCACGCGTCCGTGAAAATGTTGTCTCCTTTCTAAATTCTCTGCCGA 

TTTGGGAAAMGCAAACTTGACTTTACCCCGAGGMTTGGTGAAAAATTACTTTTACGCC 

TTGCAGCTGTGGAACTTGGTCTTACAGCCTCTGCTCTTCTGCCCAAACGGGCCATGCAGT 

TTGGATCAAGAATTGCAAAAATGGAAAAAATTAATGAAAAGGCATCTGATAMTGTGGAC 

GGCTCCAMTCATGTCCTTAGAAAATCTTTCTATTGAAAAGGAGACTAAATTGTAATGTG 

ATTCACAATGTMCAATATAAAAATMGTTTTTATATAATTATTAAAAGGNAGATACTCT 

GGTGCTTTACTATTGGATAAMTAAGTAMCCTGCCCGGGCGGGCCGGCCGACTTTTTTT 

TTTTT 

Sequence 2354 

TCCCCGCGGTGGCGGCCGAGGTTTCAAAGACCNGCCTGGCCAACATGGTGAAACCCCATC 

TCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATGTGCCTGTAATCCCAGCTACT 

CAGGAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAGAGGTTGCAGTGAGTCGAGGT 

TGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTCTGTCTCAAAAAAAAAAAACC 

TACAGCTGTTCMGGACCAGCTGCAGGGNCAAGNGGGGGCCTTTTTTGGTCTTTGAACAC 

ATCATAGAAAGNGGNCAAATGCTGCAAAGCCATGAAGAACATGAACCTTTAACCGGGTAG 

ACTAACTGCC CCACTTANACNC I I I I 1 I I 1 GCCCCCAAAAACAAGNTTGATTTTGCCCCT 

n \ rT TIvrrCCATTGGGGGGAAGTTTTGGAAAAANAAAANGGGGCCNCCCCCCCCT 

minim INNNNCNAAAAANCCI I I I I I I I GGTTTTTTTAAAAAAAAAAA 
Sequence 2355 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGGCCAGAGACTTC 

AAGTC TATC TGAAAAGTCTCCAGAGGTCTAACCCCAGATAAATAGCCAACAGGGTGTAGA 

GTACATTTTACACCCCAAAGAGTGTGCCCCATGGTGATGAAAATAAAGTGAACATGTTGC 

AAAATGAAAAAAAAAACCT 

Sequence 2356 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCAACACAAACTATTCAATAAAGTAATCTGCT 

TTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCACCTGACATCATTAGTTCAA 

GACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAAAATACAAACATTAGCTGGG 

CGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGAGGCAGGAGAATCACTTG 

AAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATTGCACTGCAGCCTGGGCA 

ACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAATAAACACAACAGAATTAT 

TCTGCAAATACAGATATTGGAGTAGCTGAGTTNCATCTCAAATTTGACTATGCAGGTTGC 

AGGGTGATCTTGGCCMCTACTTATTCTTTTNTGAAGTTCAACTTTTTT 

Sequence 2357 

CCGCGGTGGCGGCCCGAGCAAGTGGGCCTGTAGCCCGACTCTTAATCCAGGTTGGTGCTA 

TTCMAGAGATCATCTTTCACCCGAGGGATTTCTGGGCATCTATTTTGCGGATCAGAAAG 

TAGAGAAAGAAGGTAACTTTGCTGAAAGCTAGTCTGGGGAGTTAGTAGCTGATACAGATC 

AGCATTTCCTMCTATGAGATTTCATAATATTCTCTCTTGTCTCGATTCTGAGTCACTGG 

TGCCTGCTGTGGTGGCATTGTTCATGAACATGTACCTGCCCG 

Sequence 2358 

AGGTCATGTGCACATTGTGCAGGTTAGTTACATATGTATACATGTGCCATGCTGGTGCGC 
TGCACCCACTAACTCGTCATCTAGCATTAGGTATATCTCCCAATGCTATCCCTCCCGCCT 
CCTCCCACCCCACAACAGTCCCCAGAGGTGATATTCCCCTTCCTGTGTCCATGTGTTCTC 
ATTGTTCAATTCCCACCTATGAGTGAGAATATGCAGTGTTTGG I I I I I 1 GTTCTTGCGAT 
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AGTTTACTGAGAATGATGATTrCCMTTTCATCCACGTCCCTACAAAGGACATGAACTTG 

AGAATTCTTAATGCAGTGCTTTMTACAGTAAAAATTTTTAGTCTTTGTTTTCTACAA^ 

TGCAmGAAAAGTGCACTCTTGATCTTGNATTTTCTTTCCTTTCCTTTAGMTC 

AGCAGTTGGACGTATATATAAAATATTAAGTGTACCTGCCCGGGCCGGCCGGTCTAGAAC 

TAGGTGGGATC 

Sequence 2359 

AGGTGTACAAGCTTCGACCCACGCGTCCGAGCAGAACTTGGCAGCAACAGAGGAAGGGCC 

CCTGGAGCCGGCTGTCGTGGATGCCTTTAATCAAGCCTGGCATTTGGTTGCTCACGAATG 

TCCCAACTCTTCCGCTAGGCCCATCATGCTCAGGCTGCCCAAGGCTTTTCTGTCACCTCT 

TTTGTTCTCTCACACTGACCAGTCTTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTAC 

CMGTGAATTTAMT MTTGGT GTGGATTGGCCAGTAGCTAAGTGGGCTTTTAAAGAGTA 

TTGMGATTGAAAGGGTTTTTCTTTCTTTTTTAAAAM 

GATMTGAAAAGCTAGGGTTrGCCCTCTTCATGTCTACTCTCCTTCCAAATAGTTATATC 
CAAAACTG I I I I I I CCTNTCCCCTACCTTGTCC 
Sequence 2360 

CGAGGTACACAGCTATGCACTTTCCGTTTCTGACTTTTGCCACCCTGTCAGCCATGGGGA 

GCCCACTGTGGGACTGAAACCCTGAGCTGAATGCGGCCTCATGTCTCAGAGAAACACTGG 

CAAGTTGGTCAGAGCCGCGTCTGCATCGAGGCGTAGCTGANCGGCAGGATGGGGGGCTGC 

CTGCCCAGGGTCTCTCACCGTGGTGTAAGCAGAGCCATGGCTNGCCTAGGACCCTATAGA 

TACCATCACTCTTTCTCAGCTCGACTGGAGTTTGCACCTTTGCAGGGGCAAAGTAACTCC 

CTGCACCCTGAACCACCCCCCATTCCTGTTCATTTCAGCAGATAATGATGGAGGGGGGGG 

Sequence 2361 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCG 

TCCGCTATCATATTTCAAATCCTAATGGGAGACAAAACAAAAGTGATGGTCAGTATTTCT 

TTGAA ATTCTA CAAGGAACTTCAGGCACATGGTGCTGATGAGTTATTAAAGAGGGTGTAC 

GGGAGTTTTTGTAAATCCAGAATCAGGATACAATGTCTCTTTGCTATATGACCTTGAAAA 

TCTTCCGGCATCCMGGATTCCATTGTGCATCAAGCTGGCATGTTGAAGCGAAATTGTTT 

TGCCTCTGTCTTTGAAAAATACTTCCAATTCCAAGAAGAGGGCAAGGAAGGAGAGAACAG 

GGCAGTTATCCATTATAGGGATGATGAGACCATGTATGTTGAGTCTAAAAAGGACAGAGT 

CACAGTAGTCTTCAGCACAGTGTTTAAGGATGACGACGATGTGGTCATTGGAAAGGTGTT 

CATGCAGGAGTTCAAAGAAGGACGCAGAGCCAGCCACACAGCCCCACANGTCCTNTTTAG 

CCACACCTGCCCCGGGGGGCCGNTTGAGGGCCNCCCGGCAGGGNNNGTNNTAACANNGTA 

ACATGTATTATGGNAACCTTTAATCTTGGGGGNCTTANACAGGCTNNTNCCAANGGNTTA 

ANNAMGNTTGNTTTNAAAAAAAAAAAMGGGATGGGTNTTTTTNAAAAAAAAAAAAAAA 

Sequence 2362 

CCGCGGTGGCGGCCGCCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACC 

AAAGCTGATCTCT CATTC GTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAA 

AAGTTTGAMTTCATTTTCTTCAGGAAAATACCCAGCCAGTCTCTCAGTGGAAACCCTGG 

TTATGTCGTGGGGCTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCC 

GTGTCAGC TCGT AGCACAGAAGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCAGATTAC 

GTGGCCCCTTTTGGAAATTCCCAGGCCCAGGACATGCTGGACTGGGTGCCCATCCACTTC 

ATCACCCAGTCATTCAACAGGAAGGATTCCTGCCAGCTCCCAGGGGCTTTGGTTATAGAA 

GTGAAGTGGACTAAATACGGATCCCTGCTGAATCCACAGGCCAAAATAGTCAATGTAACT 

GCMATCTMTTTCATCCTTCTTTNCTGAGGCCAACTCAGGAAATGAAAGGGCGATTCTT 

ATTTNCACTGGNGGTACNTTTGNGGATGTGTCTGCACCTGCAGANGCAGGCTTTAAAGCT 

TCANCAGNCATTAATGGCANGCTGGCCNTTAACTTT 

Sequence 2363 

TAGGGCGAATTGGAGCTCC CCGCGGTGGCGGCCCGCACTTTNl M I I I I ll I I M I I I I I 
IIMMIII || I I I Mill M IMH I H I IIMl l l l lllMl I MM I IH I IN 11 1 
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I I I I I I I I I I I 1 1 I I I I I I I II I I I I I I I I I I I I I I I NNAAAAAAAANTTTNAANGGGGG 

AAAAAAANNGGGNAAAAANTTNNGGGGNNGGGNTTNAAAAAAAAAAAANCCCCNNNNCNN 

AAAAAAAAANNGGNCCCCCCGGGGGGGGNNAANNTNNAAAAAANTTTNTNCCCCCCCCCC 

CCCCGGGGGGGGGNCCCCCCCCCNNNTTTTTTNNNNNTTNANNGGGGGGAAAANNNNCCC 

NGGGGGAAAAAAAAANNAAAAAATTTTTTCCGGGGGAAAAMNNTTTCCCCCCCAAAAAA 

TCCCCAAAAAAAANNGGNNGGGGGNAAAAAAAAANAAAACNGGGGGGNCCCAAAGGGGGG 

CCA NCCN AMNTTTGGGGGGGGNNNNNACGCCCCTTTTTAAANNNANAAAACCNTNNGCC 

CCNTTTTAAAMMMCCCCCCCCCCCCGGGGGNGGGGGGTTTCCNAAAGGGCCCCCTTC 

CCCTTrCAMMAAAAAMAMGGGGGAGNGNNNNGGGGGGGGGGGGGGGGTTTTTTTCC 

MNANAAGGGGGGGNTNTANTNTTACCAAAAMMNGGGGGAACCCNCAAAAAAAATTTT 

TNNAAAAAGGCCCNAANGGCCCAANCCNAAAAAAGGGCNGGGGGNGGGGGTTTAANAAAA 

GNCCCCCCCCCCGNGANAAAAAAAA 

Sequence 2364 

CCGCGGTGGCGGCCCGAGGTCTTCGACCCACGCGTCCGCACTGAGTGTTCACCATAATTT 

GAGATTCTTGGCATGTTAACTTTTCATTATGGAATATTGAATMTTTCAATATTATTCAT 

ACATTTCTTTATGTTCAAACATACACAAAAATAGAATAATGAACTCTACCCATCACCCAG 

CTG CAACA AATATCMTACTTTACCGTTCTTMTACATCTMCCCCTTACTTTTTGTTTG 

TTTCTTTTGGTG AAGTATTTAATTGTAA I Illllll I AAG AGACAGGATCTCACTCTGTC 

ACCCAGGCCAGAGTGCAGTGGTACCTGCCCG 

Sequence 2365 

CGCACI I I I I I I I I I I I I I I I I I I AATAACAAACACTTATCCAACACTTAGTATGTGGCA 
GGCACTGTTTCAAGCACTTTACACATACAAACTCATCCCGGACGCNTGGGTNNAAGCTNG 
TNCACCNA 
Sequence 2366 

CCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCGTCCGGATTATTTCAGCTCTTGA 

CCTGTCCCCTCTGGCTGCCTCTGAGTCTGAATCTCCCAAAGAGAGAAACCAATTTGTAAG 

AGGACTGGATTGCAGMGACTCGGGGACAACATTTGTCCAAGATCTTAAATGTTATA7TG 

ATAACCATGCTCAGCAATGAGCTATTAGATTCATTTTGGGAAATCTCCATAATTTCAATT 

TGTAMCTTTGTTAAGACCTGTCTACATTGTTATATGTGTGTGACTTGAGTAATGTTATC 

MCGTTTTTGTAAATATTTACTATGTT^ 

TGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCCTAATAAAGGCAGCAAAATGCATTAATC 

CACTATGMTGGAGTTTTACATTTTAATTTATGCCTMTATTTATAMAGMTTTCATTC 

ATAGGCTACTCACAGTTGTTATCTGACGCTTACAGAAGTGGTAAACAACCAATTGCTAGT 

TCAAGTAGTTTCCTCATGACATCTAATGGTAAGCAAAAATTAGTATGCATATTTCAACAT 

CCCAGTNACCAATCTTTTTAAAATGGA 

Sequence 2367 

CGAGGTGTCMGCTTCGACCCACGCGTCCCGACTTTTTGTCTTAGACCCAGTTAGGGTCA 
CCTTACAGTGCAGGTGGAAAGAAAGCAGGACTGCTGAGAGGAGCTCAGGACCCATTTTCC 
AGGACTATTGCTTCTCAAAACTTTGGAGAGCAGGAAAATAGATTCCCAAGTGAAAGAGGT 
G GCAG AANTAAAAA AAAAAAAA AAAAA A AAGT 
Sequence 2368 

CGCCCGGCAGGTACACAGTTCTGACTGCMTACCTTTTTCAGACTGCAAAGGGAGCTCAG 

GATCCAGAAGTCATTAAAAGAACGGACGCGTGGGTCGAAGCTTGTACCT 

Sequence 2369 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAGAAATACTGAMGACTTTTGCCTAAAGTGGCATTATTGACTGCTGGTGTGATGCT 

ACTGTAATGTGATAAATTATTAAATTGTTGCAAAGTGCAAAAAAAAAAAAAAAAAAAAGA 

CCTGCCGCGGC 

Sequence 2370 

CGCACI I I I I I I I I I II I I I 1 I I I I I I I I I I I I I I I I I 1 AAAAAAAGCCTCATTATCCTG 
TAGTCCATTTTGGAAAGTAAAGCCCAAGAAAGCAAAAGATGAAGGTTCTAAAGCTAGTTT 
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GACTGACCTCAGAGTCCTCGGCCGCTCTAG 
Sequence 2371 

TTCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGNGAGCACANAGATGA 

ATAATANCAATAGGTTCACANAAAAGATGAATTGATTGAGAGAAAAAGA 

Sequence 2372 

CGAGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAG 

GTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCAT 

TGGACTTCTCAGCTTTCCTCATTAGAAGTTAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAMTTMGTCTAATAAATMTTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

MGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

TAATATACAGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGA 

AAAGGAGTAGMTGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTA 

TTATMCTTTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAA 

ATATCTGTAGGATCACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAA 

AAAAAAA 

Sequence 2373 

CGAGGTGTACAAGCTTCGACCCACGCGTCCGAAGAAGGCTCTCCTCTGTTCCAGGAGAAG 

GAAGGGACAGATGAGAAGTCACTTCAAGTTCCCAGAATACTCAGGAGCTGAACTTGTCAA 

GGTTTAGATGTGGCAAAGCAGGCCAGGCATGGTGACTCATGCATGTAATCCCAGCATTTT 

GGGAGGCCAAGGCAGGAGGATCACTTGAGCCCAGGAGTTtGAAAGCAGCCCGGGCAACAT 

AGTGAAACCTCATCTCTAGAAAAAATACAAAAAAATTAGCCAGGCGTGTTGGTGTACCTG 

CCCG 

Sequence 2374 . 

CGCGGTGGCGGCCCGAGGTACATCTGCAAGCTTTAAAGCAGTAGGTTCAGACTTCCTGGA 

AGAACTGACACTTGAAGCTGACTAGGGNCTACTTGANCNCATACTCACTTTGGCTAAGCC 

ACAGTATGAGGGAAGGGTGTCAGGAATMCNCTTCCATTTTTATNTTGTTCATTCCCGAA 

AATCCAACAGGAGATTCTTTTCACTCCCTAAAATNAACTGNTCTGTGTATAAAGCATATC 

TGGATATCTTGATCTTAMTGGAAATGGTATNTGAAAANNGCNCCNACTTTTCTAAAACT 

TTAAAATTGGCCCTTTTTTATTTTTAGCCCTGGGGGAGGAGGGAGGGMTGATTCCCAAA 

AAACTGACTGGTTTCCTTGNTGCAGTGTATTMTTACTGGTGATTATTTTTTGGGGGGGG 

NAATTTAGCNGGAAAAAA I I I I I I I I I CAGGGTTAAAACGCGCCTNNTAAAAANTTGGGA 

AATGGNCGNAAAAGGGGGCCCTTTTTTTCCCCCGCCGNTTTGGGGGNAACCCCCGCCCGG 

TTTGGMNGGTNAACNCCCTTTNAAAAAANAAAAAACTTGNTTTCAAGG I I I I I I I U CC 

MAACCGAAAMTTTGGGMGGTCTTTTTATATAMNCTTGGAAACCCGCCCCCGGGGTC 

AAAACCCCCCGNNNGGCCGTTTTAAAAAAATGGTACCCCCGCCNTCCGGNTTTCANAATT 

AAACTTTTTTNCCCCCNNCNCTGGGGGGGGCC 

Sequence 2375 

CTACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTCTTCGACCCACGC 

GTCCCGAGAATAGCTACTGAAGTCCTAAAGAGCAAGCCTAACTCAAGCCATTGGCACACA 

GGCATTAGACAGAAAGCTGGAAGTTGAAATGGTGGAGTCCAACTTGCCTGGCCAGCTTAA 

TGGTTCTGTCTGGTAACGTTTTTATCCATGGATGACTTGCTTGGGTAAGGACATGAAGAC 

A GTTC CTGTCATACCTTTTAAAGGTATGGAGAGTCGGCTTGACTACACTGTGTGGAGCAA 

GTTTTAAAGAAGCAAAGGACTCAGAATTCATGATTGAAGAAATGCAGGCAGACCTGTTAT 

CCTAMCTAGGGTTTTTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCCAACACAA 

ACTATTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCCAGGCAGGTGG 

ATCACCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGAC 

TAAAAATACAAACATTAGCTGGGCGTGGTGGCAGGCGCCTNTAATTCACTACTCAGGAGG 

ATGAGGCAGGAGAATCACTTGAAGCCAGGAGGTGGAAATTGCAGTGAGCTNAAATCGTCC 

ATTGCACTTGAACCTGGCAACAAAATGGGGACTCCGTTNAAAAACCCCCCCCCA 

Sequence 2376 

CCGCGGTGGCGGCCGAGGTATCACGAGCGGCCGGCCGCCCGGGCAGGTCAATCATAGAGC 
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TGCCCCAACATCTAGACAGTCTCTCCTACTGATTATAAATGAGTGAAAACTATCAGTTAG 
AAAMTCT MTT TMGTTGTTAATACATGTTTCTTTGGTGAGCACCTGGATATATTTATC 
ACAMTT C7TTT ATACAMTGTCGAAAATGGCTTTCAACAAACCTAAGTGTCCTAATTAC 
ATGCCACTTTTMGCATCNNTTTMGGGTMCCAAAAATGGAMCCCTTATTTTNAATTA 
MATTTNGGGNCCCGGGGTTTTANTTAAANNACTNTGGCCNTTTTTNAAATNTGGGGNCC 
CCCCGNGNGGGGGNAMTTTTTTTATATAAA I I I I II I I I NCCCCCCCCCCCCCTGGGGG 
GGGNGCCCCCCCCCCCCNI I I I I I I I I I I I I I I I 1 1 I NNGGGGGGANNAAACTCNNCNCG 
GGGGATAMNANANGGATMTNTTTTTTTNNTTNGGAAATN I II I I I 1 1 I I N 
Sequence 2377 

GCGAATTGGAGCTCNCCGCGGTGGCGGCCGCACI I I 1 I I I 1 1 I I I I M I I 1 1 I II I AAAG 
CACTGGTGCTCTTCTTAAGTGAAATGTTAACACAGGGCTCAATACATAAAAGAGAAAGTG 
MGTTGTTCCATTTGGGGGGTCCCATAAGGGGCCTCATGTTTCCCTAGGTGTTACCCCTT 
CAGACACAGCACATGCCTACAAAGNGGACNCGTGGGGTCGAAGCACCTGCCCG 
Sequence 2378 

CTATAGGGCGAATTGGAGCTCCCCGCGGCGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGGTCTATTTTGATTNTGGGGGGTNATCAGCATTATTCTTCAGMGGGGACCTGTTTTC 

TTCAAGGGAAGAAACACTCTTATTCCCAAACTACAGAATAATGTGTNAAACATGCTAAAT 

A GTTC TATCAGGAAAACAAANCACTGTNTTTATCTCCGNAGGCTATTTGNTCAGAGAGGC 

CTTTTGNTTAAATATAAATGTTTAMTATAMTGTTTGTCTGGATTGGCTATAACATGTC 

TTTCAGCATTAGGCTTTTMGAAACACAGGGTNTTTGTATTCTTTACTAAAGATATCAGA 

GCTNTTAATGTTGNTTANATGAGGGNGANTGTNAAGTACCTGCCCGGGCGGCCGGCCGCC 

CGGNCAGGTCTTNGACCCACGCGTNCGGGCNATTATCAAAAACACTTGGAAAAAGATTTT 

TATTCCTACTTTTAAACATACATCAAAATCTAAAATAAAACTA 

Sequence 2379 

CCGGGCAGGTCTTCGACCCACGCGTCCGATTGAAGCCTCTCTGAAGTTAAACCCAACTAT 

GTTTATTAAAATGTGTGAAACTGAAAGTGGGCTAGGTTCTACCAAGGCTGTGGAACTCTC 

CTACGAGTTCTGCTGATCAGGAMTTTAAGMTTTATCTTAAAAATGCAAGGAAAAAAGA 

CTGCCTTGGCAATTGTGAATGCTGCTTTCAATCTCCTAGCACCGAGCCTGGCACTTAGGC 

AGCTTTCAGTAAGTGGGTGAATGAATGACTGAATGAATGAATGAATGGCTCAGCTGAGGA 

ATGTAACTTTGGTCAAGACCT 

Sequence 2380 

CCCCGGCGGGTTGGGCCGGGCCCCGCCCCCGGGGCCCGGGTTACCATTAATTATTACCAG 
GAGGGTTNTTAATTCTGGTTAACCATTCAATTAATTGNTNAAAAGGTGGGGGGNAANGGG 
CCAAAGGGTGGGGAA 
Sequence 2381 

TGGGA GCTCCA CCGCGGGTGGGGNCCCGGGGGGNCCCCNCCCGGGNNNNAAANAAANGGG 
GGGNNTTTTTTNNNNNMAMCCCCCTTTTTTNNNNNNCCCNN I I I I 1 I I I NGGGAGTTG 
GGGC CC CCTTG GAAAAGTTTGGGMCCCCCCCCCMNTTTTTCCCTTTAATTGGGAAAAT 
TT TTGGG I I I i 1MA ATTTTTAAAAAGGGGGTTNGGGCCCCCCAAAAAAAAAAAAAAAAA 
AAlin l AAMTTTTAAAATTTTAAAAAAAGGGGCCTTTTTGGGGGN I I II II 1 1 1 11 I I 
TTTGGGGAAAAAAAA 
Sequence 2382 

A AACi i \ I I ATTAMATGCTTANGANACAGATTGACTTTCTTTCGCAAATGACTGTTTTA 

CTTTTCCTGAAGNAGGACATATATGCACTCTGATAAAACTGCATTACAGCCTGCAGGACA 

CCTT GGGCC AGCTTGGTTTTACTCTAGATTTCACTGGCGTCCCACCCCACTTCTTCCACC 

CCACi I I I l CCTTCACCAACATGCAAGTTCTTTCCTTCCCTGCCAGCCAGATAGATAGAC 

ACGGACGCGTGGGTCNAAGCTTGTACCTGCCCGGGCGGNCGCTCTAGAACTAGAGGATCC 

C 

Sequence 2383 

AGGTGCGGCCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCGCACAAACATTT 
TTTCAATGTAGCAAAATCAAACTTAAAAAAAAAAAAAGAAGAAAAGAAGAAGATGCCGAC 
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AGTGC GGAGT CTAGCCTTTTGTAAACTTCATATTGCACACTAGGACTATAAGCCATTGCT 
AGCTCATTTTGMTTTTMCGTGTMTTTTTGTTTTATTT^ 

GCTTGATCCACCAATGCTCTTTTTMTGTTTTATAACTATGTATGTGTATATATATAAAT 
ATCAAATAATATGTATGCACATATGTGTGTGTATATATCTATATGTATATACATA 
Sequence 2384 

TGGAGCTCCACCGCGGTGGCCCCGGGCAGGTCGAGCGACCGCAC 1 I 1 I 11 I I I I 1 I 1 1 I I 

TTTACCfGAAAATGCTTATTCTAGCTTCACATTTGATTGTTTGGCTMGMGAAAATTAT 

TTATTAGACTTMTTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATG 

AGGAAAGCTGAGAAGTCCAATGCCATTCTGATTCTTGCAACTTACAAGTAGTC I MUM 

GTCTAGACGCTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTG 

ATATCTTTTGGGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTGCAGG 

Sequence 2385 

AGGTACTCATAGCATTTITCCCCACATAGTCTTTCAAMTCTGCATTTATTTCAAAATCT 

GAC CTTC ATMCTCMCTATACATGMTTGCTGGTATTGTCTTTTAACTTGGCCAAAGAA 

CAGTTTTCTGAGTTAGCTATTATTTCCACCATAAAATTGGGGGTAAGATTTGGCAAAAAA 

AAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 

Sequence 2386 

AGGTACTACCCATTTTAATTACACAGTAAACAGAAGCACGGGTAAGTGACATACTCATAC 
TTTAAGCAATAAGAATTAGAAGAAACCATAGAAGCTTGGGGCCTTCTCTCTAGCTCTAAC 
CCAAAGAAA ATGAATTTTA I M M M 1 1 M i AAAAG AAAACAGCATCAATCACTTAAGAT 
TTTCTTCCTC I M I M I M M M M I I ACACTTGCTTATTAGTATAGNATCTCGTTCCAA 
AGCCCGGACGCGTGGGTCGAAGCTTGTACACCTGCCCG 
Sequence 2387 

AATTGGAGCCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTCTTATTTTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCAAATTCTCC 

MAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAAAAAAAAAA 

AAGTGCGGGCCGACCT 

Sequence 2388 

CGGCCGAGGTCTTCGACCCACGCGTCCGAAATGACTGGTTATTTAGAAAAGAAGGATGTT 

TAGAATAAAACAGGAAGTCCAAACATGTCATAAGTGGTTTGTGTATGTCATAATAAGGGA 

TTATAAAAAGAGGATTTATGTGAAAAAMTTTTATGTGATCAAGTTGTCTACAATTACAA 

GGAMTTATTTATAATAGACGTTCTAGAGATCTATATTAAAAAAAAAAAAAAAAAAAAAA 

CCTGCCCGGG 

Sequence 2389 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCC 

ACGCGT CCGTC TGMCCTACACCCCGTCTCTTCACGGTTTAGACTTACTAAAATAAATAC 

AAGGTGATTTTCATCTTCAGGTAGAGTGAAGCCTTTTAATTAAGGCGTCACAGGTGCAGC 

TATTCTACCTTAATGAMTGGGTAGTGATTTTCCCACCATTATTTATTTCGGTGATAATA 

TGCTGCATATTCMGTCTCTTGTAGTTATmCACCCAMGTAGTTGACMTTTGATGCT 

TCTGGTGATGTTTATGGCTTCATTTTATGTAATTTTTTAAGTAAGTTCCACTAGAAACAG 

TTCATCTTATACCTTCAAAA 

Sequence 2390 

AGGTACTATAAGAACACATTMTTCAATGGAMTACACTTTGCTAATATTTTAATGGTAT 

AGATCTGCTMTGAATTCTCTTAAMACATACTGTATTCTGTTGCTGTGTGTTTCATTTT 

AAATTGAGCATTMGGGMTGCAGCATTTAMTCAGAACTCTGCCMTGCTTTTATCTAG 

A GGCGTGTTTGCCA TTTTTGTCTTATATGAAATTTCTGNCCCAAGAAAGGCAGGATTACA 

TCM M n I M I M I AGCAGTTTGAGTTGGNGTAGGGGTATTCTTGGGTTATCAGAATAC 

TCATATAGCTTTGGGATTTTTGA 

Sequence 2391 

GTTCGTA TTTCTTATTCTACAACAAGGGTCAGCCTACAGGCAAAACACATCCCATTGTCA 
I I I i i i i GTAMTAMGTTGTATTGGAACATGGCCACTCTCATTTGNTTTCTATTATTTA 
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TGGCTGCTTTCACTTACAACCTGAGTGGNTGCCACAGAAACTGTATGGGCCTGCAAAGTC 
TAAMTATTTACTATGTAGCTTTTTCTTTTCTTTTT 

CCAGGCTGGGAGTGCGGTGGTGTGATCATGGGCTCATTGCAGCCTCAAAACTCCTGGGCT 

NAAGCAATCCTCCCGCCTCGGTCTCCCAAGTAGTTGGGACTACAGGCATGA 

Sequence 2392 

CCGGGCAG GTACCCAGTAATCACATAAATTCTGCMTCATCTGTTTATTTAGCTTAACTG 
I I I I I I I I I ATTTGNTGAAGNTGTTCGTTGTTATTTCAGNCTTTTTCTTATTGGG 
Sequence 2393 

ACCGCGGTGGCGGNCGCCCGGNCAGGTCTAGCTTAGTCGACCCACGCGTCCGGGCTTAAC 
TAATATTTGNNTGNGTGCTACTAACAGGATTATAATAAATTTGTCATCAGTGAAAAAAAA 
AAAAAAAAAAAAAAGTGCGGNCCGNTCTANNAACTAGTGGGAT 
Sequence 2394 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGATTTCAGGTTGACTTTTCTCACCTTTAA 
CCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCATCTGGGTTATCTGTAG 
CTTGAGTTGTAAAAAAAAAAAAAAAAAGT 
Sequence 2395 

AGGTACTGTTGTCTCATGTAATGCTAAAACTGAAATGGTCCGTGTTTGCATTGTTAAAAA 

TGATGTGTGAAATAGAATGAGTGCTATGGTGTTGAAAACTGCAGTGTCCGTTATGAGTGC 

CA AAMT CTGTCTTGMGGCAGCTACACTTTGMGTGGTCTTTGMTACTTTTAATAAAT 

TTATTTTGATAAATAATATTGAAAAAAAAAAAAAAAAAAGTGCGGCCTCGAGCGGCCGCC 

CGGGCAGGTACAGGCACCTATAGAATTTAMGGGGAGATTTCTTTATTTTGTATTCAATG 

TATTMTAAGATTTTTAAAACATATTTTGGAGAAATTGCTAAT 

Sequence 2396 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 
ACTGAGACTCAGAATGTTTCMTTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 
TGGAAACTGAAAAAAAAAANAAAAAAANGT 
Sequence 2397 

AGGTCAAGCTTCGACCCACGCGTNCGGAAGTNTTCATTCTCCCT CI HI 11 1 111 I i U 1 

TTAGCAATTCAAGNCATGTTTTTGNCTACMGTTTTTCCAGTATTGTATAGATAAATAAT 

MTTTACNAGGCTGCCTTTGAGTATACTTAGACAAGAGACCTGCCCGGGCGGCCGNTCTA 

GNACTNGGTGGANCCCCCGGGCTGCCAGGNATTTCAATATNAAGNCTTATTNGTTACCGN 

GCGACCTACGAGGGGGGGG 

Sequence 2398 

CGGCCGCCCGGGCAGGTATCAAGTGCTTGGATTCTGAACTGNCAAAAGAAAACTGCACTT 
GCCCTCTGAAGTAAAAACCGAAATGAGNTTTCTTAGGCAAATGTATTCATCAGCCCAGAT 
AAAAAAAAAAACCANNTAATGNGAGCCNTTAGTCACTGCT 
Sequence 2399 

AGGTACAAGCTTCGACCCACGCGTCCGATACGACTCACTATAGGGATCTACCTGCTTGAG 

TCGACCCACGCGTCCGAACACATACAAAAGAATTAAACCCACAAGCTGCCTCTGACAGCA 

GCCTGTGAGGGAGTGCAGAACACCTGGCCGGGTCACCCTGTGACCCTCTCACTTTGGTTG 

GMCTTTAGGGGGTGGGAGGGGGCGTTGGATTTAAAAATGCCAAAACTTACCTATAAATT 

AAGAAGAG I I I I I ATTACA 

Sequence 2400 

AGG TGGCCG CAC I I I I It I I I I II II I 1 I I II I AAAGTTTGGGGTCTGTCAGGAGACAGA 
GGCI I I 1 I IGAATTCACTGTGAAGAGAAGAACCCGAACCTTAAGACGGCAGATCCCTGAG 
AGTCTTTCTGGCTGGTTTGAGCGGACGCGTGGGTCGACACCTGCCCG 
Sequence 2401 

AGGTACTTCAA AGTTAT TTGCACATACACTTGTTTACTTTGNATGTTTTGCAGGATTAAA 
CTTTGTATAATCTTTTTGCAAAAI I I I I I I I I CAGTATGCAANGCTTGCAAGATGAAAAT 
TAAAACC 
Sequence 2402 
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ACCGCGGTGGCGGCCGAGGTACAAGCTTCNNNCNACTCGTCCGAGCTTGAGTCGACCCAC 
GCGTCCGCGTTNATGATTTTTAAMCACACTTGAMTAAAMTGATTGAACTAAATTTTG 
GTCCGGNGACATCATTNTGCACTGCATAGCCCA 
Sequence 2403 

CCGGGCAGGTACATCTCTATCAGAGCTTTTGGGTGACCAGGCACACTGTCAATGAGCAGT 
AATA CTTTGAAAG GAATCT TTGGGG I I I < I II I GGTTTGGTTTTTGTTTTGTTGTTATTT 
TTTTGTTNGTTTGATTTrrGTTTTTGNTTTTTTAGC^ 

AATACTCAGTAMCCATGCTGTTAACAGATAAGCTGNCATCCAGACTTTGNTGTTCCATT 
TCTAGAGCACAGAGCAGATTTAGCACAATTGTTAANGCTTTATGATTTTTCANAATGGTA 
AANTGAGCACTGGCTTCAA 
Sequence 2404 

Nun i i GGGGGGGGGAAAACCCCNNACCCCCNCCANANAGNCANGNAANAAGGG^^TTT 

TNACANNNNAGGGGGGGGCANCCCCCCAAAAAAACNNNCNANGCANGAAGNANANNNAAA 

CAAAACANGNANGNAAGNNNNCACGNGCTTTTTTTAAMNATTTTTTTNNNNGGGGGGGG 

GGCCCCCCCAAAAAAAANGAGGACGNNGGAGCCCCCCC 

Sequence 2405 

CTNCTATAG GGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGG I I II I M I I 
I I I I 1 1 I I ICTTTCTGTTCCTTGGACTAGATAATCTGAMTCMCTGTCTTCAGTTTTGC 
AGACTCT TGTG CCAGCTAAAATGTTCTGTTGAGCCCCAGMGCTAATTTTCTTTTCAGTT 
ATTATGATTTTCAGCTTTAGAATTTATTTTTTTMTATM 

TCTCCATTGGGNGAGACATTCACATACTTTCTTCCAG I I Mill I AGACGTAGTTTCCTT 
GAGTNCTTNGAGCATATTTAAAATAGTAGATTTAAAGGATTTGGCTAGTTACTCCACCNT 
CTGAGNTTCCTCAGGGAAA 
Sequence 2406 

CCGGGCAGGTCAAGTCGACCCACGCGTCCGCTTGTTTTGCTCTATCCCATAGGAGTTGGT 
ATAGTTGTGTTTCCAATATCATTTACTAAMGMNANTNTCCTTTTTATTTCTTCATTGA 
CCAACT 
Sequence 2407 

AC I I I II I I I I I I I n I I I I I I AGTCTTTCTACTTTNAGGGTTTATTNACTGGCAGTTTT 

C AGGT CTGNAAATACAAAAAGGAACATAGACATNGGGTNTCTGCCNTTGAGGNGCTCCNA 

GTTTTGGNGAGGGACACAACACNTTTCAGGGGACAACCGTGGTACCTNGGCCGGTNTAAA 

ACTAGTnnnnnnnnnnnnnnnnnnnnnnnnTCNATATCAAGCTTATCCGATACCGTTCTA 

CCTNTGAGGGGGGGCCCGGCACCCA 

Sequence 2408 

AGGTCAAGCTTCGACCCACGCGTCCGGCCTGGGACAGATCCTTAGTCTTTCCTTGACTTT 

TATAGACCCAGAGGTGAMGGCCAAATGTTTTGTAGMTGTCCTTAAACTTGGGTTTATC 

TGAGGTTTCCTTGTGATTGAATTCAGGTTAGACATCTTTGATGGGACTGTCATAGAACTG 

ATGCTGTGTTCTMTTGCATCTTATCAGGTGACTTATGATTTCTGTTTGNCCCATTATTG 

ATGCTGNTACTTTAGATCACTTGATTAAGGNGGTGTCTCGCCTGGCTTCTCCAGTGTGAA 

ATTTCCI I I I rCTTCT 

Sequence 2409 

AGGTAATTTTTCTGCTTGCCCAAGAAACAAAGCTTCTGTGGAACCATGGAAGAAGATGAA 

AATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTAATTGGGAATAAGATGGAAAATATGAATGCTAMTCAAATTTTTTAAAAAAATA 

CACCCGGACGCGTGACCTGCCCGGGCGGCCGGCCGCAC I i ll I I M I I M I I I I I ICAAT 

ATTATTTATCAAMTAAATTTATTAAAAGTATTCAAAGACCACTTTCAAAGTGTAGCCTG 

CCTTCAAGACAGATTTTTGGCACTC 

Sequence 2410 

AC i i i i i i i i I I I I I I I I I I I I I I I I I I I I I I NGGNGGGGAGGAAGGCAAGGCACTGGGA 

AAGACTNACAACANATATTGACCCTAGTCGTAAAGAAATCCATAATTGCCAGTAACACGA 

CNTATTTMGMCAGGAAAAGACNGACAAGGAAMGAGGGACTTTTTTTAAAAAACATTA 




WO 01/070979 



PCIYUS01/09126 



TABLE 1 

396/467 



Ctaaagaattnggacataanaagagtgaaattgacaaggaaaggaggaggggga 

Sequence 2411 

GAGCTCCACCGCGGTGGCTGGCCGCCCGGGCAGGTACATAATATACAGAGGTATAATCTG 

TAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGTAGAATGCTTGTATG 

TGACTAAAATTATGTTGGTATCAGTTTAAMTATATTATTATAACTTTAGAATGCTATAC 

CCATTCCCACAGTAATTCCCATAGNAACCAAAAAGAAAATATCTTGTNGNATACACACAA 

AAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTATCAAAATAG 

AAAACAATNATGGNGAAAAATAAAGACA 

Sequence 2412 

CCGGGCAGGTGCCGCAC I I I I I I I I I I I II I I I 1 1 I I I I 1 1 I I I I I ATGGAAAAAATATT 
GTGATTATTTTTMTAGATTTACGGTATAAMGAAAAACTTTTATGATNCMTTTCGACA 
GACTACTTTCATAAAAMTTTTACTNTACATACMTGTATTGCAMTTTTTNGGCAAC 
Sequence 2413 

NGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGATTTT 

GTGAGAATGATTGTTCTTTCACTTGGGCTGTTTGAGAGCATAATTATGGTAATCATGAGA 

TTAATGT TTCA TGATTTCTACCTCCAAAGTGTGMGACAAGTNAMCAATGNTTCTAAAT 

TGTCTTATTTTGTTGGCGGAGMGATTACAATGGGCTATTAGTGCTACATTTGGTCAAAT 

GTAATCACTTAAATAGCTTCTTGTCACCTTAACTAAAGCAGAATAAAAAAACCCTGCCCC 

GGGGCGGCCGGNCCCGCCCCGGGCAGGTACCATTCCCGACGTTTGCAATGGTGGGAGTTG 

GCAGGTGTG 

Sequence 2414 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 

TCCGCTTCTCTTAGMTTTTGGGGAMTTGATAGTCCAGTGACTCCTACCCACTTTTGGG 

TGMGGACGATTTGGAATTTTGAAGTGTGGGGAGACAGGCCTGTGAAGTCCGAANGACTC 

ACTTGGGGT 

Sequence 2415 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGAAAAAAAAAAAAAAAAGTTTGTTGGAGA 

CCGACTGGGGGTGAGGGGCTGGCAGCAGGAGACAGATAACANGTTCNCTCAGAATGCAGA 

GT 

Sequence 2416 

ANGI 1 1 I I II I I I I I I I I ICANTGCTTCCCAAAGCTGCGGACATAAGGGTAGCTGANCTG 

GACTCTGNCCTTGCTGAAGACTTGGAGATGTCTGAAGTCATAACTGGGGNGACCTCCTTG 

GTCCAAAATGTGCTGCCCCCTATGAATTCCACATCAGGAGTTGTAGGAGATGAGGTTAGA 

GGGATGCTTGGGCTCATCTGGCTTCCTTCAGCATAGCCCGGACGCGTGGGTCGAAGCTTG 

ACCTGCCCGGGCGGCCGGCCGCACI I I I I I I I I I I I I I I I I I I I I I 

Sequence 2417 

CCGGGCAGGTTTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGT 

GGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGG 

TGGATTTCTTATGATTGGCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAG 

TGATTTCTTGTGAGATCCGGTTGTTTAAATCCAGAGGCACCTNCCCCTACCCTCTAGCTC 

CCATTCCTGCCATGTMGACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGAAGCTTT 

CTGAGGCCTCCCCAGAAGCTGATGCCAGCCCTATGCTTNCT 

Sequence 2418 

CCGGGCAGGTCTTCGACCCACGCGTCCGCACATTTTGATGGTCAGTCAATAACTTAAGCA 

GNTACCAAAATACTAGGTATCCAAGGAGCGAGAGGTGGGCCGAGCATAAGAAACACATTT 

CTNATGGCACAGCTCTGCCAMGCCCTGCAGAATCATTTACACATAGGTCTTTGGTTAGT 

AGCCCCTGGCACAGAANTCTGATCTTAAACAAATATTGTCTATAATCAAGTAGAGCAATG 

CAAT TMAAAMAAAAAAGCACAGGNTTTTGGGGCCATNGCTGAAATCCCAGCCTTGCTA 

TTTTGCTTGGCTGNGTGACCGGGGTTTCCT 

Sequence 2419 

AGGTGTACAAGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 
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CAGTGCCTGCCACATACTMGTGTTGGATAAGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

AAAGTTGCGGCNCGGCCGCCCGGNCAGGTACTGGCGTGGATTCTGCATANTGGNGATCAC 

ACGTTCCACCTCATCCTCAGTGAGTTCTCCCGCCCTCTTGGAGAGGTCAATGTATGCTTT 

CNCTCAACACCACATGAGCATATCTTCGGCCCACACCCTTTAATGGGCAGTGATGGCAAA 

GGCTATTTTCCGCCG 

Sequence 2420 

AGGTATTCAACAAGGGCCCTGAGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAG 

CANAGACAGGANTCAGCACGTGTGAGGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACC 

TGGCGATGAGGAGGCACGGCAGGGAGGTGGAACAGGCAGGAGAGACTCTTCANGAATTGA 

GGAGATAGAATAGAGGACACTAAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGAN 

CATCGCTTGAGGCTAGGAGTTTAAAAGCAAGCCTGGGCAACATATCCGAGACCCCCATCT 

CTAAACACAAAAAATAAAAAA 

Sequence 2421 

CCGGGCA GGTACCCAGTAATCACATAAATTCTGCMTCATCTGTTTATTTAGCTTAACTG 
I I I I 1 1 1 I ATTNGTTGAAGNTGTTGTTGTNATTTCAGTCTTTTTTCTTATTGGGGTTGAC 
CAGACTTGGGTAAAATCTGTAAGAAAAGTNCCATAATTATGGGGGGAAGATTTCCTCTTG 
AATTGGCTTAAATNCCTGTTAGCTGAAAAAAAAAAAAAAAAAA 
Sequence 2422 

CCGGGCAGGTCTTNGACCCACGCGTCCGAGCAAAATTCAACTAAAAATACAATCTGGATT 

CCATAGCCMGGGTTTTATTTACMTNTCCTAGTAGGMGTCTTTATTTTAGCTTTCAAT 

GTGTTGAACTTATMGGAAATTTAACGTATACATGAGTATTATATTTATGGAATGTGAAG 

ATATACAGMTGGAAATGGAAAATMTGTTAATTCGTATTGACTTTGAGGAATCTTANAA 

TCATGTAGCCCTGTTGCAACAAGAAATAGGGAACTTCTGAA 

Sequence 2423 

AGGTCAAGCTTCGACCCACGCGTCCGGTTTGTTTTTTTCTTACGGCAACTCAAAGCAAAG 

AGCTGGAGGAGCCAGCCATTATAATTGCTTACTCTCATCGCTTAGCGCCCCAGGTGGGAT 

GTGTTTCCAAAACACATTmGTATTTATAAGGAMTGTAGTTAGGATTM 

CCTMTTAGMCTCACATTTTGGTTAAATCCTCMTTTCATTAAAAAAAAAAAAAAAAAA 

AAGTGCGGCCGGCCGCTCGACCTCGGC 

Sequence 2424 

AGGTGCTTCGACCCACGCGTCCGACTTAATTGAGAAGGTGGAATCCTCCTATCCCTGAAC 

TCGGGGGAATGGAATCTCGCTGATCTTCCAGGACTAGCTCCCTGATCATTCCAGCCCCTC 

TGAACAACAGGGCCCCTGGAGATAGAAGTAGTCCTATTTCACCCCAACTACAACATTAAT 

GGGAAAAAAGAAGCAGGMTTCCTGMTTTTATGACTATGACGTTGCCCTGATCAAGCTC 

AAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTCCCCTGCACCGAGGGA 

ACAACTCGAGCTTTGAGGCTTCCTCCAACTACCACTTGCCAGCA 

Sequence 2425 

AGGTACAACTTCTCTTTTTTGGAGTTTTAC 

TTT CCATGACGTT GGAGTTTGGCTCACTTCCAACAGGGAAAAGGAGTG I MUN I I GTT 

TTGTTTTGTTTTTCCTGCTTCTTGAATGGTAGAGAGCAGTCTATAGCCAGAGACTCGTCC 

CTAGGTAAGTAACTGAATTGGGGTTTGTCTTGGTTAMGTTAAGATTAACGACCAACTGG 

TCTTAATTTCTCCTTACCATTAGAGCACTCAGTTATCATATAMTTGCGCCATTGTTTGT 

TTTGCCTAA 

Sequence 2426 

AGGTCAACCGCCAGGGTCAAACGGAACACAACCCACTCTCAGGAAGACATCCCTAACACA 

AATCCAGGGACTTTGTTTCTTAACCTTAAAATTGGAAACACTTCTTGCTACCAGGGATGG 

GGGGTGGGGCTCAGCAGTTTGGGGAAACGGAGTGGGAGTCTTTTGCTGAACCGGACGCGT 

GGGTCGAAGCTGGACCTGCCCGGGCGGCCGGCCGCAC l l I I I H I I I I I I II I I I I I GAG 

AGTGGCTATTTCATTAANATTTAATAG I I I I I I I I GG ACTAAGTAGTGG AAAAACTTTTA 

TACTTAACTGAGACATTTTGTCAAGGCTAAAAAAAAGT 

Sequence 2427 
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AGGTACATCCCA CTATT TCTTTTCTTTTAGCTAGAMGGTATMCGTTAAMCCCTTTTA 
CCAMTAAAATGATTTTATTTAGAAAATGCCGGGCACTAAAAAAAAAAAAAAAAAAAAAA 
AAGTAC 
Sequence 2428 

CCGCGGTGGCGGCCGAGGTACCTATTTTGTATATGTGAGATGTTTAAATAAATTGTGAAA 
AAMTGAMTAMGCATGTTTGGTTTTCCAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGC 
CGCTACCTGCCCGGGCT 
Sequence 2429 

AGGACAATGCTGTAGATAATGCAGCCCATGCAATACACCCAAGAACACTAGAGTCCTACA 

CCCAAGTACAATATGATAAAGCAGCCCTCTGCAAGTGGTGCTGGATACCACTAAGAAGTC 

TACTGCAGCCATGTTGGTTATGATTTTCCATGCAGMGGGTACAGTTAGTTCATATTTAT 

GTATTGCACATAATCATGCTATTCAGCATTGATGCTATATTGTATTATGTAAATAATAAA 

AGCCATGTACAGAGGGAAAAAAAAAAAAAAAAAAAAA 

Sequence 2430 

CCGCGGTGGCGGCCGAGGTATTCGACCCACGCGTCCGTAGTTTTTATCTTTGACCAACCG 
AACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAAAA 
AGTGCGGCCGNTCTAGAACTAGTTGGATCCCCCGGGCTGGAGGAATTC 
Sequence 2431 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCACGCG 
TCCGAGTCAAGAGAACAGCACATTAGTTCCAGAAGAAAGATGGAAATTCTGAAAACTGAA 
TGTCAAGAAAAGGAGTCAAGAACAATTCACAGTATGAGAAGAAAAATGGAAAAAAAAACT 
TTATTTAAMAAGAAAAAAGTCCAGATTGTAGTTATACTTTTGCTTC 

Sequence 2432 

CTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCTGTTCGTATTTCTTATTCT 

ACMCAAGGGTCAGCCTACAGGCAAMCACATCCCATTGTCATTTTTTTGTAAATAAAGT 

TGTATTGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACA 

A CCTGAGTGGTTGCCG CAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAG 

CTTTTTCTTTTCTTTTTGGAGACAGTGTGCCACTCTATTGCCCAGGCTGGAGTGCGGTGG 

TGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATNCTCCCGCCTTGGT 

CTNCCAAGTAGTTNGGGACTACAGGGCATGAGCCCNCCATACCCCGGNTAATTTT 

Sequence 2433 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAAAATGTTTTCTTTTTTT 
AATTTMGGTTTAAAATTCCTTTGCCAAATCAAAAAAAAAAAAAAAAAAAAAAAAACGNT 
NGCNTGCCNGGCCGGCCGGCCGNNCTTAC I I I I 1 I I I I I I 1 1 I 
Sequence 2434 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCGACCCA 

CGCGTCCGATMAACACCGCCTTCAGTGTAMAATTTTACATGGTATCTGAACAACATTT 

ATCCA GAGGTG TATGTGCCAGACCTTAATCCTGTTATATCTGGATACGTGAGTATTTTCC 

TGTTCCTTTTTTAATTMTATACTCCTTGCCACCAACCTTTATGTGGTTCTMGAAMTT 

GCTGA MTACm CTTTATTGCTTTTGAGATmACATTATAATCATTMCATTA 

TAMTTATTTTAAAAATATATAAATCACATGGATTAAMAACTTTTCATAACTTGAAAAT 

TTTCCTTTAGATTAAAAGCGTTGGTCAGCCTCTATGTCTGGGATGTTGGAGAAAACAATC 

AAGGAGGCMACCATTAATTATGTATACATGTCATGGACTTGGGGGAAACCA 

Sequence 2435 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCI I I I I M I I I < 1 I I I f I I 1 1 AATG 

GTTMGCTACTCAAATTGTTTTCAGAGCCAGAGTAAAAACCAAAAAAAATACTCATCAAT 

ATCMTTGCCAAATTCAGTCTGAAAACATTTCACACACAGCTTACCCAAGTATAAAGCTG 

CTGGGGG GACTTC TGAAAANTTGGCMCATTCATTNGGGGCTTNGAAATGCTTTACAAGG 

GGAAGGNTi i i i I AANGCAGGGCTTACNTGGGTTTTCCCCCAAGGCCCCTGGNNANGTTT 

NCCNAAAATNAAAGGGGGGGGCCCCC.CCTNGNNGNGGGTTTTTTCCCCNNNCCNCCNNAN 
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CATNANGGGGNGNGCCCCNCNCNNTTTTrNTTTTTTAMGGGGNAAANTTTTTNCGNGNN 
TNTCAAAAAAGGGTTTTTTTAAMCCCCGGGGGGGTTT 

TTTTTAAAAAGGGGGGGGGGNNGNTGGGNNTTTTTNTGGNGGGAAAANNTNTNACACCCC 
NNTAAAAANGGGGTTTTTTNCCNNA 1 1 I I I I I I I I NCNNCCCCAAAANANGGGGGGGGGG 
GGGGGGG 
Sequence 2436 

CTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCATTTTATTTTTAC 
GTTGTTACGATATGGGGAGTAGTGTGATTGAGGTGGAGTAGATTAGGCGTAGGTAGAAGT 
AGAGGTTAAGGAGGGT 
Sequence 2437 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGC 

GTCCG GCCTG GGACAGATCCTTAGTCTTTCCTTGACTTTTATGACCCAGAGGTGAAAGGC 

CMATGTTTTGTAGMTGTCCTTAMCTTGGGTTTATCTGAGGTTTCCTTGTGATTGAAT 

TCAGGTTAGACATCTTTGATGGGACTGTCATAGAACTGATGCTGTGTTCTAATTGCATCT 

TATCAGGTGACTTATGATTTCTGTTTGTCCCATTATTGATGCTGTTACTTAGATCACTTG 

ATTAAGGTGGTGTCTGCCTGGCTTCTCCAGTGTGAMTTTCTTTTCTTCTTTGTAATATT 

TTGTGGGGGMGTMCTTTMGACTATGTAACTGTTTCATTTCTTTTCTTTTCTT^ 

Sequence 2438 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCAC 

GCGTCCGAAAGATTCTTGGCTGAGCATGGTGGCTCATGTCTGTCATCTCAGCAATTTGGG 

AGGCMGGGCAGGAGGATAGTTTGAGCCCATGAGTTTGAGACCAGCCTGGACAACATAGT 

GAGACCCCATCTCAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGTGCGGCCGGCCGC 

CCGGGCAGGTACATGACTATATCAGGATTTCAAATTGAGGAAACCATTGACCGCGAGACT 

TNTGGCAATTTANAGCAACT 

Sequence 2439 

CCGCGGTGGCGGCCGAGGTACCTATTAAGCTCATGAACCATAGAGGTATCTCGGTGGCCC 

CTCATTACCATCTGCTGTTCTTTCAGCTGTTTAGCTACATCTTTGGCTGAGGAACCAGAC 

ACTTCAATCCATGTCTTAGAGAAGAATGCACATGACCCCAACATGAAGATGATATAAACA 

ACGACATGGACAG 

Sequence 2440 

TNCTATAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGCCGCA C I I I I I I 1 1 I < 
1 1 I I M I i I GAAAATAGTTTATTATTTTATTG IT II I I GTTTATACATGTTAAGTTTCAA 
CTTTCAATAATAMATTCMTAMTTTGATTCCTTMTCATAMAACTTGCTTTACA 
Sequence 2441 

CTATAGGGCGNTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCC 
ACGCGTCCGAT MTTTA TACTAAATTTAGTAAAATGGACTTCTTATTCAAAGCATCAATA 
ATTAAAAGAATTATTTTAAAAAAAAAAAAAAAAAGTGACCT 
Sequence 2442 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGACAAGCTTCGACCCACGCGTCCGCTT 

MTAGCAAAGGATAATTGAAATCCCAAACTTACAAGGTTTTCAACAAAAGTGAAGTTTGC 

TTAAAGTTAACAGTGTAACATGTATTATGGTAACTTCTAATCTTGTGGCCTTAGACAGTC 

TAGTCCAAAGGCATAMGMAGTTTGCTTTAAAAAAAAAAAAAAGGAATGGTTATCTTCA 

AAAAAAAAAAAAAGTGGGGGGAGACAGAATTTATGTAAAAAGAGTGTTATATGGTAAATT 

CTTGTCCTGAAATAAACTAACTGGTGTTTAAAGAAAAAAAAAAAAAAAAAANGT 

Sequence 2443 

CTNCTA TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I II I I I I I 1 1 I I I I 
Mill lACCTGAAMTGCTTATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAAT 
TATTTATTAGACTTAATTTTCCTCACGAGTTTAMGATTGCTTCAGATCTTAAACTTCTA 
ATGA GGGAAAGCTGAGAAGTCCAATGCCATTCTGATTTTGGCAACTTACAAGTAGTCTTT 
TTTTNNNTAGACCCTTTTCAGGACC I II II I I II I I CCTAAGTCAGGGGGTTTNCAAAAC 
CTTTCNAAAGGGGNTNTTT1TTNGGGNNNCCTGGGCCGGTTTTTAAAAANTAGGGGGGAN 
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CCCCCCCGGGGNTGGGGGGGMTTTGATNTNAAAGTTTTNTNGATCCCCGCCCCCCNTGG 

GGGGG 

Sequence 2444 

TACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I I I 11 I 11 1 1 I M 1 I I 1 1 

I I I I I 1 1 I GAAATGTTTGMGTTMCTCATTTTATTTCTAGGATTTGG ATTTCMCATTT 
TMTTTCTTTGGMTATMGTCACTTTTTGCAAGCTAAAAAATAGAATCAAACTAAGGTG 
ATCTAAGTCCTCTAGGCATCCAGGCTGATCCTTGGAATCATGAGCAGAATGATGACATAC 
TACANGGGGCTAGCMTACCGGNTNTAMCTNTTAAATAATANCCCTNCATGGTTTTATT 
AGGGAACCAGCCAAAAGTCCCGNCCCTTTTAAAACTNGGGGGANCCCCCGGGCTTNNNGG 
GANTTC GATA TTNAACTTTTTTNGAAACCCGCCCNCCNCNNGGGGGGGGGGCCCCGGGCC 
CCNACNTTTTTTGTTNCTTTTTAANGGGGGGGNNNNAMTTCGCCCCNCTGG 
Sequence 2445 

ACTTAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCAC1 1 I 1 1 I M 1 1 I 1 1 I I I II I I 

II I I I 11 I 1 I 1 ATTGGTATTTAGTTTTTATTTCATAATCATAAACTTAACTCTGCAATCC 
AGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCGAGAGCA 
CAAAAGATTCTAGGTTNNTGCGAGCAAATGGGGTNGGNGGGGNGCTTTNNTTGGNTAAAN 
AAGGGATGNNNTTGGGGGTTTAAMAMACCCNNGGCANNCNTTTTTNGGGTNNNCCCCN 
ANCCCCAANNNGGGGGGNNTNTTTTGTGGGGGGGGTNGNNTNTTTTGGGGGGACNCAAAA 
ANGNNNNTNTTGGNCCCCCNAAAAAAAANTTTAANGGCN I 1 I I 11 1 I I I II I I I NTAAAA 
AAAANNNAAAGTTNNNTTGCCCNANNNCNCCCCTNI I I I I 1 1 I GNGGGGGGNGGNGAAAA 
MAAAAANNNGNNGNCCCCNNNTTTTTNNNGGGGGGGGCCCNAANTTTTTNTCNhfTNAAA 
AAAAAAATCTCCCCCC 

Sequence 2446 

CCGCGG TGGCGGCCGCCCGGGCAGGACACAGGAGGCCTTATTACTTTTAAATTATACAAC 

ATTTTTTGCTTAAAA I I I I I I I AATAAAA I I I I 1 1 CTTTTATGACTTTTGCAG ACAATTT 

TTTAAACATGTTTTAACTTTTTGACTTATTACAAACATTCTTTTCT^ 

AATTT ATTTC AGGACAAGAATTTATCATATAACTCTTTTTATATAAATTGTGCCTCTCCC 

CTTTA"! I I I TG AAG ATAACCATTG I Mill I AAAGCAAACTTTCTTTATGTGTTTTGACT 

AGACTGTCTAAGGCCACAAGATTAGMGTTACCATAATACATGTTACACTGTTAACTTTT 

AGCAAACTTCACTTTTGTTGAAMCCTTATAAGGTTTGGGGATTTCAATTATCCTTTGCT 

ATTAATAAGACCTTGTTCAGTCTAAATTAACTTAAA 

Sequence 2447 

NCACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGAGAATTACCCGAAAAACAACCACACTGCTTCAATCCTGGACAGGATGCAGG 

CAGATTTTAAGTGCTGTGGGGCTGCTAACTACACAGATTGGGAGAAAATCCCTTCCATGT 

CGAAGAACCGAGTCCCCGACTCCTGCTGCATTAATGTTACTGTGGGCTGTGGGATTAATT 

TCAACGAGAAGGCGATCCATAAGGAGGGCTGTGTGGAGAANATTGGGGGCTTGGCTGAGG 

AAMATGTGCTGGTTGGTAGCTGCMCANCCCTTGGMTTGCTTTTGTCGAGGTTTTGGG 

AAATTGNCTTTGCCCTGCTTGCCTCGTGMGAGTTTTAANAAGTNNCTTCCNNAGGNTNA 

NNTAAGGGTNTCTTNGGTCTTTNTNANNNCCTTCNTTANTTTGGGGGGNG 

Sequence 2448 

ACTTAGGGCGATTGGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCGCACTTTTTT 

I I I I I I I I I I I I I I II I I 1 IAGMTTTAGCCACTTCTCAGGCCTTNTTCCCCATAATTTG 

GAACTTTCCTTTGGATTTGATCAAGTTGGATAGAGTTGATCAAACCCTGATCAAGTTGGA 

TAGAGTTGATCAAACCCAATGGGAAAAAGACCAAAACAACAATAAAAAACAGACAACAAC 

AACAAAAAACAGTTAGGCAAAACAAACAATGGCACAATTTATATGATAACTGAGTGCTCT 

MTGGTMGGAGAAATTAAGACCAGTTGGTCGTTAATNTTAACTTTAACCAAGACAAACC 

CCAA 

Sequence 2449 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTACAAGCTTCGACCCACGC 
GTCCGCGAAAGCGAAGAAGGAAGCTCCTGCCCCTCCTAAAGCTGAAGCCAAAGCGAAGGC 
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TTTAAAGGCCAAGAAGGCAGTGTTGAAAGGTGTCCACAGCCACAAAAAGAAGAAGATCCG 

CACGTCACCCACCTTCC 

Sequence 2450 

CCGGGCAGGTACTTTTTCTMTATACTTTTCNATTACACATGAAAGCCATGACAGGAACT 
GAGATAAGATTTCTTTGTTTTTTGAMCATCTTATCTAC^ 

TTNACTTNAMGCTATTAGTAGTTTTATACTCNCTTMTAAGTATTMTAMTTTACATA 

CTNGACTTAGTMNCTAAGCAATTTGGNTAACGTNTTTNTTTATTNGAGNGANTTTTTGC 

CANTTGGATATTTTTNCTACCTTACTATTACNTTATAAATATATTTCCCCAAATATATCN 

TTCCTCTTTAAAAANTATGTTTTGNCAACNAACCTTNAAA 

Sequence 2451 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAAAGTTATTGCAC 

ATACACTTGTTTACTTTGTATGTTTTGCAGGATTAAACTTTGTATAATCT7TTTA 

I I 1 1 1 1 I I ICAGTATGCAAGCTTGCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAATGT 

Sequence 2452 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAGCTGATTACACCAACTT 

GAATGAAACGACTTCTCTTGTGAACTATCAAGGGGCCGCCAGAATCACCTCTGCAAGTAT 

TGGGGTCAGCATAGGGACTCACTCCTCCAGTACAAAGGAACCGAGGGGTGACCACCTCTG 

AGATGTCCTTGACTTTGTCATAGCCTGGGGCATATTGAGCATCTCTCTCACAGCTGCCTT 

TCTTATCCCCATTCTTGATGTAGACCGGCCGCCCGGGCAGGTGCACATACACCAAATGTC 

TGAACCTGCGGTTCCTCTCGTACTGAGCAGGATTACCATGGCAACAACACATCATCAGTA 

GGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGCTCACGTT 

Sequence 2453 

CCGCGGTGGCGGCCGCAC I I I I I I I 1 1 1 1 I I I I II I I I AAGAAATAGGGTCTCACTCTGT 

CCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTTCTGACTCCTAGGCTCAAGGGATCC 

TGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGGCAGGAGATCGCTTGAACCGGGAGG 

CGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTTTCCAGCCTGGGTGCCAGAGCAAAA 

CTTTGTCTN AAAAAAAAAATTN I I I I I I 1 I AATTAAATAAAAAGGGNCAGGGGA I I 1 1 I I 

GGGAAAAGGTCNAAAANAAAATTGNNCTTTTTGAAAAAANCCTTGGNAAAAANCCAAAAA 

AAAAATTNGGNGGGAAAAAAATNNTTTNNTTNNGGGANAAAAAANAAAANAAAAAAANNT 

ANGGGGGGGG I 1 1 I I I I I I NGGGGNGNGGGGGGGGGGNAAACNCAANCCCCCCCCNNTTT 

ANAAAAAAAAAGGGNGCCCCCCCCCCCNNGNGGNAAANGNGNTNTTNNTAAI I I II I I I I 

TTTNCCCCCCCCCCCCCCNGGGGGGGGGGGN 

Sequence 2454 

GAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCC 

CGCTTTCAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTT 

CTTTTTCATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTG 

AACTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTT 

AGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTCAT 

CAATGCC 

Sequence 2455 

CCGCGGTGGCGGCCGAGGTTTCAAGGACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

ACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATGTGCCTGTAATCCCAGCTACTCAG 

GAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAGAGGTTGCAGTGAGTCGAGGTTGC 

GCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTCTGTCTCAAAAAAAAAAAAACCTA 

CAGCTGTTCMGGACCAGCTGACAGGTCAAGTGTGGCCTTTTCTGGTCTTTGAACACATC 

ATAGAAAGTGACAAATGCTGCAAAGCCATGAAGAACATGAACTATAAACGGGTAGACTAA 

CTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGCTGAATTTGTCACATTTATATAT 

TGCAATAT 

Sequence 2456 

AGGTCTTCGACCCACGCGTCCGGTGGCTTATGCCTGTAATCCCAGCACTTTGGGAGGCCG 
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AGGCAGGCGGATCACAAGGTCAAGAGATGGAGACCATCCTGGCCAACGTGGTGAAACCCC 

ATCTCTACTAAAAATACAAAAAGTAGCTGGGCGTGGTGGCACACGCCTGTAGTCCCGGCT 

ACTCGGGAGGCTAAGGCAGGAGAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCG 

GAGACCACGTCGCTGCACTCCAGCCTGGTTGACAGACCGAGACTTCTTTTTCAAAAAAAA 

AAAAAAAAAAAGTGCGACCTGCCTNGGCGGNCGGTTTAAAAAATNGTGGATTNCCCCGGG 

CTGNAGGAATTTTCGATNTTCAMGCTTTATTNNATTACCGTNCGACCTTTNGGGGGGGG 

GCCCCGGTACCCCAACI I I I I I 

Sequence 2457 

CTATACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGA 

CCCACGCGTGCGCMTTTTTAGGCCCACAAGGAGTCAAGCACCTCAAGGAGATCTTCAGT 

TTGAACTTGGTGTAGACACAGGGATACTGATGAATCAATATTCAAA7TAGCTGTTACCTA 

CTTAAGAAAGAGAGGAGACCTTGGGGATTTCGAGGAAG 

Sequence 2458 

GCTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGAGTCGGGAT 

GCACAACTTCAACCACCGACTTATCAATGCAGCCGCCTGTGTATTGCAATTGGCCGTTAC 

CTTAAGCACTGAGCCACCCGGGTTTAGTTCAGCCATTTCAAGAAGTATATTTAACGTCGG 

TAGTTCTGCTTTAT TAAMTGCAGCAGAGGTACCTGCCCGGGCGGCCGGCCGCACTTTTT 

I I I 1 1 I I I I I I I I I I AATGAAATTGAGGATTTAACCAAAATGTGAGTTCTAATTAGGACA 

ATAMATTAATCCTAACTACATTTCCTTATAAATACAAAAATGTGTTTTGGAAACACATC 

CCACCTGGGGCGCTAAGCGATGAGAGTAAGCAATTATAATGGCTGACTCCTCCAGCTCTT 

TGCTTTGAGTTGCCGTAAGAAAAAA 

Sequence 2459 

CCGCGGTGGCGGCCGCTACCTGCCCGGCGGCCGGCCGCCCGGGCAGGTCTTCGACCCACG 

CGTCCGAACTAATTGGCTTTTTAGAAACACCCCACAAAAGCTCAGAAATTGGCTTTAAAA 

AAACAACCACCAAAAAAAATCAATTGGCTAAAAAAAAAAAAAAAAAAAAAAGTGGGCCGC 

TCACCT 

Sequence 2460 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGGACAGCTCGTGTCCACAGGGGTCATGGGGTCTCTTGCTGCTAGGATTCTG 

GAGTTTCATGGTAAGAGCGGGCCACTCCCCACCTATTCAACTCACCCCTTCCCCAGGAGT 

TAATGGGGGCTAGTAAGGAATGCTAGTGCTTGGAAGCCCTGTGCAGGCTTCTGAGATTCC 

TGCCCCCTCAGCCCATGCTCTGCATCCTCCCTTCATCCACCCTCAATGTTTTCCTTCAAA 

AGATCTGCTCAGAGTGTGCCAGTCTTCCCAAATTCCTGGTCTCTCCATGAGAGATGTTCT 

TCCTGGCTGCTTCTAGTTGGCCATCTGGTCTTGAGTCCTNTGTACCTCGGCC 

Sequence 2461 

AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I II I I H I I I I I II I I I II I I I I 
TTTTTTGGGTTTCTCTTTGAAAGNTTATTC 

TTTATATTTTACATTCACCTNTCANAATATTTAATGGNACCTGCCCG 
Sequence 2462 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGAT 
TACCATGGCAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGNCTAAA 
CCCAGTAGAAACAAAGT 
Sequence 2463 

ACGA CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAAAGTTA 
TTTTAGTCATGAAATTTTATATGCAGAGAGAAAAAGTTACCGAGACAGAAAACAAATCTA 
AGTCGA 
Sequence 2464 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTTTCAAGACCAGCCTGGC 
CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 
TGCCTGTAATCCCAGCTACTCAGGAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAG 
AGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTC 
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T GTCT CAAAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGC 
CTTTTCTGGTCTTTGAACACATCATAGAAAGTGNCAAATGCTGCAAAGCCATNAAGAACA 
TGAACTATAAACGGGT 
Sequence 2465 

CCGCGGTGGCGGCCGAGGTACAAGATAGGTATGGATTCCAGNCTGGAGAAACCCNTAAAC 
CACTACACCCTGCCTCANAGNAGGGNAGAATTNTCAGTATGTATGTGGAGACAGGCTCGG 
ACGCGTGGGTCGAAGACCTGCCCGNN 
Sequence 2466 

CCGCGGTGGCGGCCGAGGTCTGATTACAAAAATTTAAAAAATATGCAACTTCAGGCATAA 
AGGGTTAAGGGGCAGATCTTCACAAACTCCACACAAGCCAGCAAGCACAGCACCTGCCCG 

Sequence 2467 

TACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I M I I I I II I I I I 1 I 1 1 1 » 

TTTTMCCTTTCCTTATGAGCATGCCTGTGTTGGGTTGACAGNGAGGGTAATAATGGCTT 

GTTGGTTGATTGTAGCGGACGCGTGGGTNGAAGCTTGACCTNGGCCGCTCTAGAACTAGT 

Sequence 2468 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGCTTGTACAGTATGTGAAGAAGTTGCAC 

CTCATGAGCTGATCACTGATCAAGTTAAAAACACCAGACATACATTATACGTCTAATAAA 

TTTCCTTCAGGTCATGACCAGCATGAAAAAATNAGACACAGACACTCAGCAACCATTCTG 

TAAATGTGTACCT 

Sequence 2469 

CTACTTAGGGCGATTGGNNCCTCNCNGNGGGGGCGGCCGAGGGCCTGATCTAACTGCTGC 

CAAGACATGCCAAGGTCACCAGCTTCCCGTCCTGACCAACAGCCACCTCTCAGAAGGCTG 

GCTTAACTACTCANCCCCAGTCTTNTTAGCAGATGGCAAGGAGCTCTGGCCAAGATTTTA 

GTCTAAACAGAACCCTAGGCTGCTGATGCAACATCAAGCAC 

Sequence 2470 

GGGATGTGGTTTAAGTGGATTACTGCCTANCTGAGCCAAAATGTTTGCTTGTACCTNnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnACTTNTTTCCTGGNTTAAAATGTTATTTCGGTTTAN 

AAl I 1 1 iCCCCANNCATACCAANNCCCGGGGGGNTTNAAMTTTNTNAAAANCCGGGGGG 

GG CCTTNATG GGGGNGGNGCNANNCNCCCNNATTAATTANNGNNNGNGGNTCNCN^n^NGC 

CCCI I 1 1 I I I IAAAAMNGGGAAAAAACNNTTTTCCNCGNCCNANTTTGGNNTNTATTTN 

AAATTNGGCCCMCCCCCCCGGGGGGNGGGGGGCCNTTTTGGNNTTTTTGGCCCCNNNTT 

CCCCNTTTTNTTTTTTTNTANAAANNANGGGGGGGGGGNGGG 

Sequence 2471 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTGCGGCCGGCCGCCCGGN 

CAGGGTCACGCGTCCGGCCTGCTGCTTTGGACATGATGCTGACTGGTAGAAGCATTCGTG 

CAGACAGGGCAAAGAAAATGGGACTGGTTGACCAACTGGTGGAACCCCTGGGACCAGGAC 

TAAMCCTNCAGAGGAACGGAC AATAG MTACCTAGAAGMGTTGCMTTACTTTTGCCA 

MGGACTAGCTGATAAGAAGATCTTTTCCAAAGAGAAGACAAGGGTTTGGTGGGAAAAAT 

TGNANAGGGGTTNTTGCNNTNACTTTTTTCAATTTGTTTAGGGCMCMGGTTTTCCNAN 

AAAAAAAAAAAAAA 

Sequence 2472 

CTACTTAGGGCGAATTGGANCTCNCCNCCGGGCGGGCGGCCGAGNACTTACACNCNNGCN 

ATCGNTTTCNTGNCNGCAGNNGGATNCACTAGTTNTAGCTTCGGNCGCCACCGNANCCNN 

CNNGAGCATGCATGGAACACATATACCAAACATCTTTCTGATMCATTAMCATTTTTAA 

AAGATGTTAMTGTTCTTTTCATTGTGGTGCTTCAGATTCCTGATTCTAGAACTTGTGTG 

TGTGGAACCTGTGTGCTAACTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAAT 

TTTCAAACTAAATGATGGGGGGAAGGAACTA 
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Sequence 2473 

CCGCGGTGGCGGCCGCAC I I I I II I I I I I I I I I I II I I I ! I II ! I I I GGGGTTTCTCTTT 
GMAGNTTATTGNTTTCTTTAAAAAAAAAAAAAACCTATACCTTTTATATTTTACATTC^ 
CCTCTCANAATATTTAATGGTACCTGCCCG 
Sequence 2474 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGATCCTCTTCTGCTGTTCTG 

GAAAGAAACAGACCAGTGGTATGGAGTATAAGAAAACTGATGCACCTCAACCGGATGTGA 

AGGAAGAGGAAGAAGAGAAGGAAGAGGAAAAGGACAAGGGAGATGAGGAGGAGGAAGGAG 

MGAGAMCTTGMGAGAATCATGCTGTTGCATTTAGMCTTTCTGCTTTGCACAGGAAA 

G AGTC ACACMTTMTCAACATGTATATTTTCTCTATACATAGAGCTCTATTTCTCTACG 

GTTTTATAAAAGCCTTGGGTTCCAACCAGGCAGTAGATGTGCTTCTGAACCGCANGGAGC 

AAACACTGAAATAAAATAGTTTAT 

Sequence 2475 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I I I 1 1 I I 1 1 I I I NNGTATTTTN 

AGNANAGATGAGGTTTCACCATGTTGACCAGGCTGGTCTCGAACTCCTGACCTCAGGTGA 

T CCACC CACC TCAG CCTCCCAAAATGCTGGGATTACAGCGTGAGCCACCANGCCCGGCCA 

Ai 1 1 I i GTAACTTTTACAMGATATTTMTTTAAMTTGATTTTAATAAAAGGTAGACAT 

CCAAAAACACAGGATGATGAATGCACTTCAATGTTAGGGGAATATC 

Sequence 2476 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTACTTTAAACAAAAAA 

TGTTTACTTNTAAGGATATACGCACAAAGGGAACATAATATACAGATAAATGAGAAGTTT 

CGATTCTGCATCAAGCATTATTCAATCGGACGCGTGGGTCGAAGCTTGTAAACCTGCCCG 

GGCGGCCGCTCGAGGCCGCAC I \ I I I I I I I I II I II I I I I G AG ACGGAGTCTCGCTCTGT 

CACCCAGGCTGGAGTGCAGNGGCGCAATCTNGGCTCACTGCAACCTCCACCTCCCAGGTT 

CAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAG 

Sequence 2477 

CTAACTATAGGGCGANTGGANCTCNACCGCGNGGNGGCGGCCGNGGNCAAGGCTTCGACC 

CACGCGTCCGNAAAAATTANCCAGGTGTGGTGGCACACTCCTGTAATCCCAGCTACTCAG 

GAGGCTGAGGCAGGAGAATCGCTTGAATCCAGGAGGNGGAGGTTGCAGTAAGCCGAGAAC 

CTACTGCACTCTGGCCTGGGCGACAGAGCAAGACTGTCTTGGGAAAAAAAAAAAAAAAAA 

AGTGCGGCCGNCCGNCCGGGCAGGTCCTAGCTTGAGTCGACCCACGCGGTCC 

Sequence 2478 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAATTGCTAAAACAGCTCCAGG 

GNMGNNATCTTATTTAGCATTAGCTCCCTCACMCTNTTTTCATTCATACTTNTATTGG 

CCAGNCTCATACCTGAAGTATTTTAAATGAGTTNACAATTATTNCACTTACCNTCAGAAA 

AAAAAAGGAGCAAAAACTCTTAATGACTGGTNACATGCACATTTGGTGTAGGAAATTATT 

ATGNGGTAAMTTTATATATTTCTATTTATTTTTATTAATTTATTNTTNACACATTA 

CAC 

Sequence 2479 

ACTATAGGGC G AATNG GAGCTCCCCGCGGTGGCGGCCGCACI I I I I I I I II I I I I I I I I I 
i n i u i I I iTGGTTTTCTCTTTGAMGTTTATTGTTTTCTTTAAAAAAAAAAAAMACC 
TATACCTTTTATATTTTACATTCACCTNTNAAAATATTTAATGGTACCTGCCCG 
Sequence 2480 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGTGGTGAACACAGAGAAGACAGGTCTTGTATATATTCCTCTGTATTCTGGGGAGC 
TTTGACCTTGGAGCTTTGTACCTGCCCG 
Sequence 2481 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 
CAAGGAGTTTGATCTGTGTTCAACTGATTGTGAACCATCAATTGAGATAACTCACTACCT 
TCAAGGCCAGCCAGNTACATACTTTTTGNMAAGCCAAGAGTGGMNCAGGGTTGGTTTT 
TAATCCAATTTTTGGGC 
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Sequence 2482 

AGGNCGACNTTNTCACAGGCAGNNAAACCGGCCAGNTNAAAACACTATGCTANCTCGCGG 

GGCCANTNTTAGGATGGGTGAGGCAGATGAANCCATTCTCCNANTGGCCAAGGCCGAGGG 

CATCAGCCTCAAAGAACNTTTGACNGGAGAGAATCACANACGTGNNNTATTCGTCATAAA 

NAAANAATGAAAAACCNACC 

Sequence 2483 

AGGTATTCGACCCACGCGTCCGTAG7TTTTATCTTTGACCAACCGAACATGACCAAAAAC 

CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCC 

GCCCGGGCAGGTACCAGTAGCAACATATGAGTATTTCTCTAGATAAC I I I I I I I 1 GACAA 

GGTCTCACTCTGTTGCCCAGGCTGGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTT 

GACCTTCCCTAGCTCAGCTGAACCTCCCATCTCAGGACACCATTGCCTCCACTGCCCATC 

CTGCATCTGCCTGCCTACCCCAAAAGT 

Sequence 2484 

AGGTACATT7TCACTCCTGCTCTAAAACTTGCCTCAGTCTCTCACTGTGCCTTATGCCCC 

TCAGCTGAATTCTTTCTTCTGAGCAGGCAGGAATTGAGGTTGCTGCAGACGTGTATGCAT 

TTGCCACCAGTAACATACTTGGTGCCACATGACTAGGATATGTTCTCTAGTGCTAACATG 

TTCGTTTACAGTTCTTAGGACTCCCTGATAGAAAAAAACACAAAAAAAAACACAAAAAAA 

CCCAACCA ACCA AACAAACAAACAAAAAAAAACACAGGAGTTTCTCCCAAAAAGAAGTCT 

GCAGTGTCTTTTCTGTTTTCTCTGAAGGTATCCCAGGGTGTTAGAT 

Sequence 2485 

AATGTTGCAGGCTACTCTCTGCCGACCAGGCCGCCGCCCGTCCCTCGTGACTACAGCAGG 

TACCTGCCCGGANGNNNNNNANGNAACGNATGTTTCCACCTNCTTCTCCAACCTCTACCC 

CACCATTAGT NGTA TNTTNACTNTNAAAACAGTGGAACCACAGCCCTAAAGACCTGCTNA 

TNNAMGTNCTTTTGTCTTAATTGTATTTAAAAAAAAAAA 

Sequence 2486 

GGGGAAAACCCAGCTCCACCGCGGNGGCGGCCGCCCGGGCTTTTCAAGCNTCGACCNACC 
CGTCCGATA ATTTA TACTAMTTTAGTAAAANGGACTTNTTATTCAAAGCATCAATAATT 
AA AAG AATT ATTTT AAAAAAAAAAAAAAAAAAAG G G ACCT 
Sequence 2487 

AGGT AC CCTCACCTTGGTCATC TATCCTGAAATAAGGCTTAGTTAGTATTGGCCTGAATG 
TTTTGTGI I I I I I I I I I G I I I I I I I I I I I I I I I I I ACTGTTACTTTG AAAAATATGTATG 
TATACCTTATCATATCTGCCTATATCACTTACTTTGGGGAGATACTCAGAGCTTTGTGGT 
TATCAGTATACTAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCCCG 
Sequence 2488 

AGGTACGCGGGGGAGCCTGTCCAGCTGGCCCGGGCCCTGGCCTGGTTCTCAAGTGTTTCC 
TAGAC AGAGAG GC ACCTG GGTCAGTATTAGTCTATTTATCAGAGGTGTAAATAATCTATG 
TATAAG i i i I I CTCCTTTTAGATTATTTTGTATTTGTTTAAAAGMGTTTTGTCAAAATA 
CAAA MTATA MGAAATGACTGAAAGTTGTTGACAGGGTTTTTAAGAAATAANTTATTCT 
AATTGi i i i I GTTTGGTTTGTTTTTGCCTTGTAAACTAGCGCCAAGGAAC 
Sequence 2489 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATNCAGCTNATAGCAAAC 

TTTGTGGATCCCAATCCGAGATTTNCCTGCTGAAAGACAAGAAGTNTCTNAAATAAAAGN 

GCTGTANCAGNATTTGTATACTCCAGAATAAGNTTCTGTGATTCTTANCTGCCAATGTGT 

TCAAGGCGTGATGACTNGGTNTCTGTTTCTNTGAACATNAATACTAGGGTCTGTATAAAT 

TTCMTGCATGCCACCAGCTNATCMCCCTTTTGGCTTTGATTTTTGNATGNNGNATTNT 

TATTCCCTANGANTTCCGGCCAAGTACCTTTGGNCGCCACCCGTGGTGGGAGCTCCAATT 

TCGCCCTTATMGTGAAGTCCGNMATTACGCGCCGCCTCANTTGGNNCGGTNAGTTTTT 

ACAACGCCNGANGACCTGGGGAAAAACCCTTGNCCGTTACCCCAACTT 

Sequence 2490 

GNCGGGCAGGACGCGGGACCAGAATGCAGTTCCAGCTTAGGAAGCCACAAACAAGCCACC 
CAGGAGGAACAAAACACCGNAGCGTGGATTTTTCAAATTTCCCCCGGGAAAGTAAGTCTC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

406/467 



GCTTCCTGCCAAANAA 
Sequence 2491 

CG GGCAG GTNCCTTGCTMGTTCAGACCTTCTCTTCCTTTNCTTTCCTTTCCTCTCCTGC 

CCATTTTCCTGTTCTTCTGTCCTTCAATACTTCTGNAGCTTCCCATTCATGTTCTCTTCT 

CCCACGCAGGCCNCATTGTGTGCAAAAACTGNGGNGGGGGCTGTGCTGNACTCNTNCCTG 

CCTCCTGC CTCCTGC GGCTGTTGGATTTGGGAATGACCTTTGGTGAGAGNCTCACTGCTC 

C AGGGTC TATTTTTTGGTCCAMGGCTAGAACCTATANMGTNGGGAATCACCTNTTTTT 

TCllll l CNNGGGTGAAAATAAMTGGGTTTTTTTCAANTTCNAACAAAAANAAAAAA 

Sequence 2492 

AGGTACAGAGAACTGMTTTACACAATAAAGTGTTACCCTATACCAGTGATTCTAAAATT 

TTGG TCTGGGGAACCTTTCATGGGGTTCATGAGGTCAAMCTATTTTCATGATAATGTTA 

TTTTGCCTGTTATATTCTCATTCTCGAGTATACAGTAGAGTTTTCCAGAGGCCACATGAT 

CCATGACATCACAACAAAATGAATAGAGACAGATGAAAATCTAGCTGGCTTCTGCTACAT 

GAATCAGACATTAAGGGGTCCAGAGACCATAAAGTGTGACAACCACCACTCTTCACAACC 

TATA TATA ATATCTCAAAATAATGTTAATTCTCTATCCTCAAAGTTTATTTCCTTATATC 

TACATTTTCTATGATCAACACACTCACTACAAAA 

Sequence 2493 

CCGGGCAGGTACCACAAAAACAGTTACATGGTAGAGTTCCAATCACACAGAAAGGAATCC 

ATTCAGCAAATTTCCAAAGTTGCAGACTGTTTGGGCCCACCCAGAATTCCACCAGGACTG 

GACGGCCCCCACCAGGCTATACTGGATATGTGGACACAGAGCTTATGCTAAGCTGCCTGA 

TCAGGACAGGTAGCTGTATMTTGGCACCATTAAGCCATCTTTCTTCTTACTGCCCATAA 

AAACAGGTGAACTTCTAGGCTTCCCAGTCTATGCTTCCTGCAAAAAATGAAGCATAGCCA 

TAAGTGATTAGAAAAATAATGAATGGCCCTCTGAAAGAATCATACAATACTATAGACCCG 

CCACTTAGGCACAAGAAGGCTCATGAGGATATC 

Sequence 2494 

AGGTACCTTGGGATTGCAGGTGCCCACCCTTGCGCCTGGCTMTTTTTGTATTTTTAGTA 

NAGACGGGGTTTTGTCATGTTGTCCAGGCTGGTCTCGAAGTCGAGCTCAGGTGATCCGCC 

CATCTCAGCCTTCCAAAATGCTGGGATTACAGGTGTGTGCCACCATGCCCAGCCAACACA 

CACATTTATTTMTGCMGTTTTACCTAGCACAGAGAAGCAGTGAGAGTCAGTTACTTAT 

ATATTGAATTGGACCAAGTAACTTGTGAAGAAGCTAGTAAATATATGGAGGCTAAAAAGC 

TGAGTGGTTCTGTTCTAACAAGGTCTGCACAGTAATCTCTTGGCCTCGACTTCTCATCCT 

TAAAAATAAGGAGATCGTCCTATGTTTC 

Sequence 2495 

NNGG GGGGGGGGGGCCCCCCCCCNGGGNNNAAAAAAAAAANA I WWW ] INTTNTTTTT 
i I I I iNAAAAAMAAAAAAAMNNNGGGGGGGGGGGGGGGNGGGGGNGGGTCTTTTTTTN 
AAAAAAAAAAAAAAAAAAAAAACCCCCCCCCCTNNN II I I I l I I I I I I I I I NTNTATNAA 
AANNGGGGGGGGGNNNNNNNNNNNNNNNANATNTNTTTTTNNNNNGGNGGGGGGGGGGGG 

Sequence 2496 

TGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGCGGGGGCGGAGGTCANGGG 

ACAAGATGGTGCCACCGGTGCAGGTCTCTCCGCTCATCAAGCT 

Sequence 2497 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGCGTGGAAGGGTTTANGGCAG 

CAGTGT TCTG ATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTAG 

GAGGCCTTTTGAGGCCGCGTAGTCGGTGTTTTTGAACTGACTCTACAGCTTCTGGCAGGC 

CGTGCGGCGCCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTAC 

GCCATCTTTAAGGTTCTAMTGAGAAGAAACTTCAAGAGGTTGATAGTTTATGGAAAGAA 

TTTGAAACTCCAGAGAAAGCAAAAAAAAAAAAAAAAAAANANAGTACCTGCCCG 
Sequence 2498 

AGCTCNCCGCGGTGGCGGCCGNGGACNAGGGNCTGANTGTCTGNGTNTCAGAATGGGATN 
AGTGNCCTTATAAATGAGGGAGCTNGNTTGTCCCTNCCACNACATGAGGTTACAGCAAAA 
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AGATGGCTGTCTATNAACCAGCAAGTTNGCCTTTGNCANACCCCAGATCTGCANACTACC 

TTGATNTTGGACTTTNCATCCTGCACAMTCTMGANANAMTTACTGNTGTTTATCAAC 

CACTCNGTTNATGGTNTTNTTCGTTATAGCAGCCTGAACTAAGACAACAGGTNGATCTTA 

AGGCATNGCTACNATNAAGTCTTTCCNTGCTCAGAATCTCC 

Sequence 2499 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGAANTGGATGACTT 

TTCTTGTCCATATCACCATGGAAATCTGTGTGCTGGGCATGGAGAGTGNGAAGCANGCAG 

ATGCCAATGCTTCAGTGGCTGGGAAGGTGATCGATGCCAGTGCCCTTCAGCAGCNGCCCA 

GNACTGTGTCAATNCAAAGGGCCAAGTGTGCAGNGGAAGAGGCACTTGTGTGTGTGGAAG 

GTGTGAGTGCACCGATNCCAGGAGCAT 

Sequence 2500 

TGGAGCTCNCCGCGGTGGCGGCCGAGGTACTATATGTCTGAATGTCTGTGTCTCAGAATG 

GGATTAGTGACCTTATAMTGAGGGAGCTTGTTTGTCCCTTCCACTACATGAGGTTACAG 

CAAAAAGATGGCTGTCTATGAACCAGCAAGTAGGCCTTTGCCAGACCCCAAATCTGCAGA 

CTACCTTGATCTTGGACTTTCCATCCTGCACAMTCTAAGAMTAAATTACTGTTGTTTA 

TCMCCACTCAGTTTATGGTATTTTTGTTATAGCAGCCTGAACTAAGACAACAGGTAGAT 

CTTAAGGCATAGCTACAATTAAGTCTTTCCATGCTCAGAATCTCCATCTGCTGGCCAAGC 

ATAGTGGTTCGCACTTGTMTCCTGGCACTTTGGGAGGCCAAGGGCGGGTGGGTCACCTG 

AGGTCAGGAGTTTGAGACCAGCTTGGCTAACATGGCAAAACTCTGTCTCTACTAAAAATA 

CAAAAAATTAGCCGGGTGTGGCGGNGGGTGCCTGTAATCCCAGC 

Sequence 2501 

NTGGGCG MTTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCCGGGCCATTNTTCTTTCTTT 
1 1 1 1 1 1 1 INGCGGATGGGGACTTGTGAATTNTTCTAAAGGCGCTATTTAACATGGGAG 
Sequence 2502 

AGGG GCGAAATTGGGAGCCTCCACCGCGGTGGGCCGGGCCCGAGGTACTTTTCTTTTTTT 
TTTTGNTGGATGGGGGACTTGGNGAATTTTTTCTAAAAGGGGGCTATTTNAACATGGGGA 
AGGANAAGCGTTGTGCCGGTTTCCA 
Sequence 2503 

CCGGGCAATCTAAGAAGACATGATCACTAAATGTGATGTGGGATCCCAGATGGGATCCTG 

GACCAGGTMAAACTAAAGTAATGTTTCAACTTCAGTAAATAATAATGTATCAATATTGG 

TCCATTAATTGTGGCAAATGTGCCACACTAATGCAAAGCGTTAGTAACAGGGAAAACTGG 

GAGCAGGGTATATGAGMCTTTTTGMCTGTTTTCACAATTCTCCTGTAAATCTAAAACT 

CTTCTGAAAATAGAGTTTATTCTTTAAAAGTGTCTGGAGGATGTGCACAAGGGTGTGGCA 

GCAGAGGGGCTACAGGTAAAAAATCATGACATCTGGMTA7TTCCTTCAATTTTTGCTCC 

ACACGGTGACTATCTTACCCTGCTCCC 

Sequence 2504 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACCCGGGGGGTCTGAGG 

ACCCGAGGTCGTAGGTGGATCTTTTTCACGGAGCAAAGAGCAGGAGGACAGGGGATTGAT 

CTCCCAAGGGAGGTCCCCCCGATCCGAGTCACAGCACCAAATTTCATGCGCATCTGTGTA 

MGAGACCAACAAGCAGGCTTTGTGTGAGCAGCAAGGCTGTTTATTTCACCTGGGTGCAG 

GCGGGCTGAGTCTGAAAAGAGAGTCAGTGAAGGAAGATGGGGTGGGGCCGTTTTGTAAGA 

TTTGGGTAGGTAAAGGAAAATTACAGTCAAAGGGGGGTTGTTCTCTGGCAGGAGTGGGGG 

TCACMGGTGCTCAGTAGGGGAGCTTTTGAGCCAGGATGAGCCAGGAGAAGGAATTTCAC 

AAGATAATGTCATCAGTTAAGGCAAGAACAGGCCATTTTCATTT 

Sequence 2505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAACAAATGCAGTAGCCAACAAGATTACCATG 

CMTCATTAAGGAGAACCAAAGTAAGAGAGCCACTCAAACCAGATTTTGAACGCTACTAA 

AATTAAAGTAGTTCTTTGATGAATATGAATGAGTAGGGAMGGATTCTTTGTAATAGTGA 

TACCTCTGTGGTAAGAGAAGGGTGGTATGTGAGTTTTAGTCTACAGATTATGGCAAATTC 

AGTGACAACAATCAAATGGTCTAAGATTGACAGTAGCACAGTTTTACTCTGTGAAGGTAA 

TGTTCAGGACAAATTTCMGAAAACTAGAAAACCATTCTTTACAGCTGAAATCTTTCCCT 
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, AACCATTGTTATTTCCACTTTTAAGTCCTCAAGAGATGAGAAAAGGGAGGTAAGGCTTCC 
TTATACATTTCCTGCACMTGAAACATTTTTCCTCCTCCAGGCAAAAGATTCAAGCAGAA 
CTGGCAAATATCTTATCTTGCTCTTCTTAATAATATAATGGTGGTAGGATATAAAGGTCT 
ATACAATTAACCTANAT 
Sequence 2506 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGNCNGGCCAGGTACACNTTC 

TTAGACCTCAACCTCGAACTNTCAAAATTNAGGATGTCTCANCCCTACTGAGGCCGGGAG 

TCACCTNNACACTGANGGCCCTNGGTGNGAAGATGAACCTTNCACCGTCTNTANTGCATT 

CTG GAGT GC AAAAATA AAATCCACTNAAGAGTCACAAGGCCCGCTGTGCATAATNGGNTT 

CACTTTTACC I I I I I! I 

Sequence 2507 

CCGCGGTGGCGGCCGGCCGGGCAGGTACATGTAATGCTCCTGAACTGTATGCTTCGCACG 

GCTGACATGCTMGNTTTGTTCTGTGTATTTTATGACTATTTTTTAAAAAGTAAACAAAA 

AAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAAAGCAGGTGAGTGG 

NTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCANAATGGCCATGGGCTTTGCCT 

TCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGTTCAACACTGCCAG 

TGTA GTTM TGTCACTGAATTATACACTTTAAATGGCTAACATGACCAATTTTATGTTAT 

ATATATTTTACTACCACAAAAAAACTANCTGGCACCTAAAAACATTCCATTGAACAGGCC 

CCTTCAGATCTGTGTCTTTCCTGCATGCAAATTACNCCACAGAGCAAGCACCTATGGCAN 

CGTGGATCACAGGCTCTGTTTTANGATAGANAAAGGACACAAGGNGTCCCCC 

Sequence 2508 

ACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTTCTTTACTTTT 

I I I I 1 1 I I IGAGACGGAGNCTCACTCTGTCGCCAGGCTGGCGCGATCTTGGCTCACTGCA 

ACCTCTGCCTCACGGGTTCMGCGATTTTCCTGCTTCAGCCTCCTGAGTAGCTGGGACTA 

CAGGCACACGCCACCATGCCCAGCTCATTITTGTATTTTTAGTGGAGATGGGGTTTCACC 

ATGTTGGCCAGGATGGCCTCCATCTNTTGACCTTGTGATCCGCCCGACTCGGCCTNCCAA 

AATGCTGGGATTACAGGCGTGAGCCATCAAGTCTGGCGAGAGAGATTGTTTCTAGATGAG 

GGNGGGGGCCGGNTGTCCTTANCCCAAAGCTTGTGCCAGTCTCTATCAGAAATAAATGCC 

CCCAAAACCAAAAAAAAAAAAAAATAAGGTACCT 

Sequence 2509 

GCCG GGCAG GTACTACNTCAGCAATTTCTCCANNGNGCNGNNCGACAGCATATGGCACCA 

GCCCATTTTCAATTTGCTGAGCATCGGCCAAAGCCTGTATGCGAAAGCCAAGGAGCTGGA 

CAGAGTGAAGGAAATTCAGGAGCAGCTCTTCCATATCAAGAAGCTGTTGAAGACCTGTAG 

GTTTGCTAACAGNGCATTAAAGGAGTTCGAGCAGGTGCCGGGACACTTGACTGATGAGCT 

CCACCTGTTCTCCCTTGAGGACCTGGTCAGGATCMGAAAGGGCTGCTGGCACCCTTACT 

CAAGGACATTCTGAAAGCTTCCCTTGCACATGTGGCTGGCTGTGAGCTGTGTCAAGGAAA 

GGGCTTTATTTGTG 

Sequence 2510 

CCGGGCAGGTACMTTGNTTTAGMGATANTTTGTTTTTCTCTCTTCAGTTTCNCATA^ 

ACTAMGACAAATCATGGTAGGATTGGNTTGTTTATTATACTTGGCCTAACTATTTGTAT 

ACMTGCAGCMGAATGATTATTTTTTACTTAGGCTTTTAAGTAGGCTCTGATGGAACTT 

TGTTCCATAGCAGGAATCTCAGATAAGACTTTGTAAAACCCGTAAAACTCANCCGAGCCA 

TGGATTTATGCCATTAMTACCCATGAGTTGGGTGAMTTTCCTNTCCTTTNGAGGGCCC 

AAGATAAACCTGGGGCGTCTGCACTTGNCAAAAAGTGATATTCTTTACTTACACAG 
Sequence 251 1 

AGGTACACNGNTNNAATCTTACTTCAACTTTNAANGGGCCCACNNAACCCTCTATATCCC 

CTTGTAAATTTAACTGCTAGTCCACAAGAGGAACAGCTCTTTNTACACTAGGAAAAAACC 

TNGTNGAGAGAGTANAAAATTTAACACCCATANTNNGCCTAAAAGCAGCCACCAATTAAG 

AAAGCGTTCANGCTNAACACCCACTACCTAAAANATCCCAAAAAAAANAAAANAAAGNAC 

GCGNGGANGTGTNMATTTTNANAGAAGAATTTNTNTrrTAGTTCTTTGCACGAAGGNANA 

GATAAAGACACTTTTTCAAAA 



# 
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Sequence 2512 

CCGGGCAGGTACGCGGGGACTGAAAATNGGACTGTTCAACTCACCTGGCAGCCACTCCCA 

GAGCTCCTGGAACTCTGGCCCAAGGTTCTCTGACTGACTCCTTCTTGGCTTACTGGCTGA 

AGACTGACGCTGCCTGATCGCCTCAGAAGCCCCGCAGACCATCATGGACGCCGAGCTTTA 

GGTAACTCACAGTGGAGGCCCGCCTGCACCCAGGTGAAATAAACAGCCTTGTTGCTCACA 

CAAAGCCTGTTTGGTGGTCTCTTCACACAGACGCGCATGAAAGGGAAGACATACAAAAAC 

AAGGCCTCTGAGGTAGGTACCT 

Sequence 251 3 

CCGCGGTGGCGGCCGAGGTACTTN I I I I 1 1 I I I I I I I I I I GGGTTTCCGGGGTCGTNTTG 
GGGGTCTCAATATTTTTGGCTCCTCT 

TTGGAGT CNMT TCCTGATCAGAAGTTTGAATGGTTGTTACTGCTGTGTTTTTCATGTCA 

ACAATTTCTTTT^ ^TTGT TNCCTGACMGGTGTCCTCTTGCAGCTGACTGTATTATTATA 

GCGCTTTCTTTCCTTTTCTCCTGTATTTTTTGGTATCTTNTTGAACACAANM 

ACACAAGCTTGAATCAAAATAAGCTTGTCMTAGCTTCCTTTCGCATTAGATTTAACTGC 

TC CACGN GATMTACTTMGGAACACTTTTGTTTTTCCAAGAGCCCAGTTATCGAGACCA 

GCTTTTTCCAAAATGGTGGCACAAGTGTCAGGGCTCATGCGGG 

Sequence 2514 

TACTT A GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I M I I I I I I I I 1 1 I I I 

TTTTTACTTTTTAAACTTCTTGTTAAAAATGAAGACACAAACAAACACATTAGCCTAGGC 

TTACACAGGGTCAGGATTATCMGATGTCACTTAGGCGATTGAAATTTTTCAGCTCCATT 

ACCATCTTATGGGACCACCATCCTATAAGCAGTCTGTCATTGACCTAAACATCATTATTC 

AGCACA AGCGTATTTCAAATTTAGAGTTTTACTTTGATGTTCTTC ) I I I I \ I CTTTTTCT 

TTTTTTGAGACGGAGTCTCACTNTGTCGCCCAGGCTGCAGNGCAGTGGTGCAATCTCGGC 

TCACTGCAACCTCCAGTTTGGGCCACAGAGCGAGACAGCGAGACTCGGTCTCAAAAAAAA 

AAAGAAAAGAAAAGTACCTGCC 

Sequence 251 5 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATAAAC 

TCAAAGGATAGGTTCAACACCATAATAGAGAGGACAGACAAAAAGAATCTGTGAACTTGA 

AAATTGAACAATAGGAATGACCCAATTTGAATAACAGAGATAAATGAACAGAGCCTCAGG 

GAATTGTGGGAGTATAACAAAAGATCTAACACTGATGTCGTCAAAGTCCCAAAGGAGAGA 

GAGGGGATGGGGCTGAAAAAGCACCTGAAGAAGTAATGGCTGGAAGCTCCCCAAATTTGG 

CAAAACACATAAGCCTACAGATTCAAGAAGCTGGGTGAATCCCAAATGAGAGAAATCCCT 

TCAAATTCACACAAAGACACATTATAGTCAAATTTNTGAAMCTNNCANATAAAAAGAAN 

TNGNCAAATNNGGTACCTGCCCG 

Sequence 2516 

CCGCGGTGGCGGCCGAGGTAC I I II 1 1 I CI I I I I I I I I I I I II I I I M I I I I GAGACGGA 

GTCTCACTCTTGTCACCCAGGCTGGAGTGCAGCGGTGTGATCTTGGCTCACTGCAAGCTC 

CACCTCCCAGGTTCAAGCCGTTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTATAGGC 

GCCTGCCACCACGCACAGCTMTTTTTTTGTATTTTTAGTAGANACAGGTTTTCACCAC 

TTGGCCAGACTGGTCTCAAAACTTNTAACCTCAGGNGATCCACCCGCCTCAAAGNGCTGG 

GATTACAGGCGTGAGCCACCGCACCCGGCCCAGACTCCTTAAAATGTGANAAGTAGCACT 

GAGGMTGTGATCAGATTATGGCCTTGATTGGCACATGGGGTCGCTTTCCACGGTTGGCC 

TTCTTGTTCTCCACGGCATCTTGTGCATMGCCATTTGCCATTTCAGGAGCTCAGCATGC 

ACATCCCGGGATTTCTGNGCTTGGGT 

Sequence 2517 

CCGCGGTGGCGGCCGAGGTACCTGTGACATCATAATTGCACCCTCCGACATGATATCTCT 

TCCAAATGCTTGATGAAAAAGGTGGGAGGATCACTTGAGCCAGGGAGTTGAAGGCTGCAG 

TGAGCCCTGTTTTTGCCACCACACTCCAGCTTGGGATTGATTCCTAAAGACTCATGTTAC 

GTGAGGAAGCAGCTCAGAAGAGGAAAGGAAAGGAGCCAGGCATGGCTCTTCCTCAGGGAC 

GCCTGACTTTCCGGGATGTGGCTATAGAATTCTCATTGGCAGAGTGGAAATGCCTGAACC 

CTTCACAGAGGGCTTTGTACCTGCCCG 
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Sequence 2518 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I I I 1 1 I i I I I I I I I I I GGGGCTGG AAGGGG 

GGCAGAGTCTCACTCTGTTGTCTAGGCTGGAGNGCAGTGACAGAATCACCGCTCACTGCA 

ACCTCTGCCTTCCAGGTTCAAGTGATTCTNGCGCCTTAGCCTCCGGAGTAGCTGGGATTA 

CMGCTAATCCCANCTAGGCGTGCGCCACCACACTCGGCTMTTTTNGTTATTTTTATTA 

TAGTANAGACGGGGTTTCACCATGTTGGCCAGGCTTGTCTCAAACTCCAGACCTCAAGTG 

ATCCACCCGCCTTGGCCTCCCAMGTGCTGGGATTACAGGTGCGAGTATATGCTTTTAAA 

GGGTATCCAATCAAGCTMCTATGGTGATGGAATGTCTCCAGTTCCTCTGTAATACACGT 

ATCGTCCAGCCCGCGTACCTGCCCG 

Sequence 2519 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGAGA 

TTANGGGAGTGGTGATGACTCTTAACMGCATGCTGCCTTCMGCATTTGTTTAACAAAG 

CACACCTTGCACAGCCCTTAAGCCATTTAACCCTGAGTTGACACAGCACATGTCTCAGGG 

AGCACAGGGTTGGGGGTAGGGTTACAGATTAACAGCATCTTAAGGCAGAAGAATTTTTCT 

TACAGAACAAA ATGGA GTCTCCTATGTCTACTTCTTTCTACACAGACACAGTAACAATCT 

GATCTCTCTTGCTTTTCCCCACAACCTCAGCCTCTCAGAGTGCTGGGATTACAGGCATGA 

GCCACCGCGCCCAGCCTCCCTTTTAAAGCACTTTCTGAAGTCAAGCCTGATTCAGGATTG 

CAAGCCTGCAGAGAACTATGGTGGTAAAGCCTAAAAAGATAGAATCCTTCCCACACCTGA 

GMGGCAGGTATTTTTAGAAGGAAACACCAGAATCACACTTAAGTCACTGCAAAGGCATT 

CATGTTTATACATTTCTGMCTGTCTTACTTGGAACTTTATGNGG 

Sequence 2520 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCATGGTGGCTCACGCCTGTAAT 

CCCTCCACTTTGGGAGGCCAAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACCAGC 

CTGACCAACATGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAGCTAGGCNTGGCGG 

CACATGCCCGTAATCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATCACTCGAACTCGGG 

AGGCANAGGTTGNGGTGAGCTNANATCACACCATTGCACTNCAGCCTGGCCAACAAGAGT 

GAAACTCCATCTCAAANAAAAAAAAAAAGGAAACATGAAGCCTTCCTTNAATGATGATAG 

TTTCTAAAGTGAATTATTTGMTCTCTTTGCATGTTTTGGCTCTGTTAATCTMCTC^ 

TCTCTAAATAGATGCTGAAAGTGTAAATCTAGATGACTATAACATACACGTNATTGCAAG 

TGTATTCAA 

Sequence 2521 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTATTC l M 11 I I 1 I I I 



Mill IGGTTATGAAAACTTAGGGACTAAAATTAATATAAAAATTGGCATAATGTTGGAT 
TGAATCTA CATTTTG GCAG AAGTT AAACATTCCCACATAATGTCAAAATTATACATCATG 
CAGTTCTGTTTTTTTGTTTGTTn'ATTTTGT^ 

GGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGCGATTCTCCTG 

CCTCAGCCTCCCGAGTAGCTGAGATTACGGGTGCGCGCCACCACACTTGGCTAATTTTTG 

TATTATTAGTAGAGACGGGGTTTCAGCATGTTGGCTAGGCCGGTCTNTCCTGACCTCAGG 

GTGATCAGCCCACCTCGGCCTCACAAAGTGCTGGGATTACAGGCGTGAGCCACCTTGCCC 

AGCCCACATCATACAGTTTGAMTGAAACTTTGCCACAACCAGCCTTTGCTGTAGCACAC 

ACATATATCACTGAACCTGTTTGNAATAAAAG I I 1 1 I I I < 

Sequence 2522 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I 1 M I 1 I 11 ) 1 11 I I 1 1 1 11 I M I 1 1 U 1 1 I 

TTTNGGTCATTTGNCCTTAGTTTTTAAMACAAMGTGTTAACTAGAATNTAACAACAGA 

TCAMTCCAAACNCA GCAG TCCAGNGGAGAATCAAAACTTTTCCGGCTTTATTNTNTGGG 

AAAACCCCTGGTCTGTTTTCATTCCTATTGGNCCAGGCCACCATTNTATGATATGAAGGC 

CTAMTTAGGAMGCTAGGNGAGCTGNGCAMTTNTGGGTGTCTGANCCNCCTGTTGTTT 

GGCGTGTGATGGGGGTGGAGGCCCNCACAGGGGTGTTCTCGCTAGNGTTCAAATCACAAA 

AACAGGGACCGTAACTAGGGGGAGGNGAGCAAAGCNCTCACCTTGGGCACAAAATTTAAG 

GNGTGCCAAAAMCCCAGT MCCAA AGATAAATACTNTTTTMTGCAACATTTTTAAAAA 

ATCCMTTAAATGTAAAAAGGTTTTTGATGGACAATGTTTCNAAAATTTTNAATAAAAGG 
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GTTCCCCCCGTACI MINIMUM 
Sequence 2523 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACACACATTACACTATCCC 

TTATCAAATATACTGAATGTAAGTAGGCTGGGTGCAGTGGCTCATGCCTGTAATTTCAGC 

AACTTGGGAGGCCAAGGTGGGTGGATCACCTGAGGTGGGGAGTTGGAGACCAGCCTGGCC 

AACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGCACAC 

GCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAACTGCTTGAACTCGGGAGGTA 

GAGGTTGCAATGAGCTGAGACTGCACCACCGCACTCCACCCTGGGTGACAGAGCAAGACG 

ACGTCTCAAAAAAAAAAAAGTMTTAAAGCTTGTTTCGATTACTTAGGCTCATCAATAGT 

AAGATCAACATGGTACCTGCCCG 

Sequence 2524 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCCGGGGTCCTTTGTCCTNAAAACGT 

TAGATCTGGGCTCTGATTCCTAACTGCCACTCGTGATCACAGAGGTGCAGAGAGTCATTA 

GCCATTGTTGCACTTCCATTGTTGCAAGCCCCTCACCTTCTGGTTGAAGTGACTTTCAGG 

AGAATTAAAAGAGCTGGNGTTTCCCTCCCCACTCCCATTCATTTTTCTCATTTTCCCCTT 

TTGGGAGCCAGACATAAACACAATTGCNTATATGAGAGAAATCAGAAAGTCATGGCACAG 

CCTAGGGGATGGTGCAGGCTCAGAAAAGACCTCTGAGAAGACCTTAGATTTATACTTCAG 

ACTCATCTTAGGCAAAGAGGGCTTACAACAATCAAAAAAACAAAAATAATAAAAACAGTA 

ACAAAAACAGCAAACCCTGNAGAAGAGGAAGAATCTGATTTCCAGAGTTAACACATTAGT 

AAATTCTMTGTCTGGTTTTCAACAAAAATATCACAAAGGGAtTACCAGAGAAACAGGGA 

AAGTGTAACCCCATTCAAATTGAAACCTAAAGAGACTTAANAAACAANAGGGATTTAAAG 

GTCAAG AAAA I I I I MA 

Sequence 2525 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGGAAGGTCACGGGCTT 

GAGTCGTAGCCAGGGATAGNATTATGGGTGAGAATAGACAGGATGGAGGACACGAGGCCA 

CAAAGCCCCACAATCCCAGGGCCACAGCGCCAGAGGCCTAGTTTGTCCAGAGGAATGAAG 

AGATCACAGGCATTTCTGACCACGTNTAGCANAAGTGGGGGATGACCTNTAAGGGCTCGA 

GCCAGGAGCNTGACTTGCAGCCCNAGTTTCAGAGCCAGTTGGGGCANACCTCCTCCTGGA 

GTCCTGGTCCCCCMGTCCCCCAGTTTTNATTTCCTClsrTGGGACTCNTCTTCAAAANCNT 

TTMTNCNCCGNTTNNANGNMANNATTTTGGTTCATTNATANGNCNANTTTNTNAACCT 

TNCNGGTTAAATTNATNANTNNGGAAAACNAATTGNACCNNCCCNNGCNNNAGCNGACCC 

CCNGCCCGTACN 1 M M M M M M M M I 

Sequence 2526 

CCGGGCAGGTACGCGGGGCTTCTGTTGGGCGTTTCTGCTGAGAGGCGGGAGGCGCCGAGA 

GTCTGTGCGMGGTCCGTGGACAGACTGCTTTGCCTGTTGTTGCTCTTCGGAGGCGGCGA 

TCCCCGAAGGCGAGCTGAAATACGGCTGCAGGCTACAATTTGCAGCCGACCATTATGGAT 

GACAAGGAGCCGAAGAGGTGGCCCACCCTCAGGGACCGCTTGTGCTCGGATGGCTTCTTA 

TTTCCCCAATACCCCATTAAACCGTATCATCTGAAGGGGATCCACAGAGTTGTCTTCTAT 

CGTGATCTGGAGGAACTGAAGTTCGTTCTGCTCACCGCCTTATGACATCAATAAGAGAGA 

CAGGAAGGAAAGGACCGCCCTACATTTGGCCTGTGCCACTGGCCAACCGG 

Sequence 2527 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAAGAAGTAAGTGGAA 

CACCTTTCCCTGTCATAGTTATTTTCATCCAGACATCTGGTGGAAGCATCAGATTCCGAC 

AAACAAGGATTTATGTCAGGATCTCTCAGCCTCTGTGTTACCGAGGGCATTCTAACAGTC 

TTCTTACTCCGGCCTCCGCTTCGCCAGCACCCAGGCCGTCTCCAGCTCCAAACACCCCCG 

CGTACCTGCCCG 

Sequence 2528 

ACACTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI M M M M M I 
i m i i ii i u i i i NGGAANGGAGTCTCGCTCTGTCACCCAGGCTGGAGTCCAGNGGCACG 
ATCTCGACTCACTGCAACCTCCACCTCCCGGGTTCAAGCTATTCTCCTGCCTCAGCCTCC 
CGAGTAGCTGGGACTACAAGGGATCTGCCACCACNCCCGGCTAATTTTTGTMTTTTAGT 
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AGAGACGGACTTTCACCATATTGGCCAGGCTGGTCTTGAACTCCTGACCTTGNGATCCAC 

CCACCTTGGCCTCCCAAAGNGTTGGGATTACAGGCGTGAGCCACCGCGCCCGGCTTACTA 

CMGTCATMGTTTCTTAAAGGCAATGTAAACTCCGAAAACTTAATGCACTCTTATATTG 

NT MTAC ATTAAMTCCACTGGCCTGGCTTACACTTTTGAATCAATCTTTGAGCCATGCA 

TGATTTTGTAACATTCACGTACCTGCCCCGGCGGGCCGNTTNTAAAACTAGNGGATCCCC 

CGGCTGNAGGMTTCGAATTTAAAGCTTTATCGATCCCGCNNCCTCGANGGGGGGGNCCC 

Sequence 2529 

CTATAGGGCGAATTGGAGCTCC CCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 II i I I I I I I 

I I 1 1 I I I I I I GTTTTTATTTTTTTATTTATGTTGGCAGAAAAGCTGAGGCAGGGCTTCTG 

ACATAAGGTAAAAGTGTCTTGGAACATGTCCTGGGTCCAGGGTCTATAACCCCTTGNGGC 

CTATGGAACACCAAGCTCTGTGCCAAAGGGTGGAAGGCTGCCCTGCCNCACTACAATNTA 

AGCCCAGAGCATAAAACCCCTTGTAGCCTGNGGAATATATNCANACTCGCTGGCCCCTTG 

CTCNTTGCTCTNCCAAGATCACAAATNGATTGCATNTNNAATTAAAAAAACCTGNTCTCC 

CTTATCNNAAGGTAGCAAGANNCATAGCCAAACCCGTNCAGGNTACCGNTTGNNGCACCA 

ATTA CCTTTN TNTCNTNACGTCCTAACCTGGCNACCCTTACNTCANAACATCCTAATTAC 

CTGGI I I 1 I I I I I I I GGATTCCCAATAAAAAGGGGGG 

Sequence 2530 

CCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGTCCCTACAAATGCAACGTCTGCAAT 

AAAGTCTTCACCCAGCGCTGCTCTCTGGAGTCCCACCTGAAGAAAATCCATGGGGTGTAG 

CAGCAGTATGCCTATAAGCAGCGGCGGGACAAGCTCTACGTCTGCGAGGATTGCGGCTAC 

ACGGGCCCCACCCAGGAGGACCTGTACCT 

Sequence 2531 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGAAGACCTTAT 

TCGATATCTTGAACCAGAGAGATGGCAGTTGGACTTAGAAGATCTATATAGGCCAACTTG 

GCMCTTCTTGGCAAGGCTTTTGTTTTTGGAAGAAAATCCAGAGTGGTGGATCTGAACCT 

TCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGTGTCCATAAG 

GGMGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTtGTCTCCTGGTTAAACGCTG 

TGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCA 

Sequence 2532 

GGAGCTCNCCGCGGTGGCGGCCGNCCGGGCAGGTACCNCGGGGGTGCCCCGNAAGCAGTT 

GTTGTTGGTTGGGGGCCTTTGGGCCGGTGACGGANACTGCCCAGGTGTTGGNCACCATGT 

TCCTCTNCGCGGTCTTTTTTGCCMGAGCAAGTNANATGNMCAMTAGTCCTTTTCGTG 

GMAAGAAMAAATNCGCTCCCTTTNAACGGTGGATTGAAAATGACTNTGNTTTATAAAG 

AGAANACTGAGGGCGGGGATACTGATTCANAAATNCTGTANCGTGTAATAAAAG 

Sequence 2533 

CCGGGCAGGTACAGCTGCATCAGCTGCTCGTAGGACATGTCCAGCAGCTGGTCGAGGTCC 

ACGCCGCGGTAGGTGAACTTGCGGAAGGTCCGCTTCTTCTTCTGCTCTACTTCTGCCACC 

CGCGTACCACGGCTATCCTTATAGCTTTTTAATTAMTGAAGCCAAGTGGGATTTGCATA 

MGTGAATGTTTACCATGMGATAMCTGTTCCTGACTTTATACTATTTTGAATTCATTC 

ATTTCATTGTGATCAGCTAGCTTATTCTTGTGTACCT 

Sequence 2534 

GACATGG CGCCCGCCG CCGCTTCCCCCCCGCGTAC I I 1 1 I M I I M I I I I I I M 1 1 I I I I 

TTTCTTTCI I I I I I I I I I I I I I I I I I I H I I I I I I I I I I I I I I | I | | || I I I I I I 1 1 I I I 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I NGGGGGAAATTANAAANCNTTNN 

TTCNAAAAAAAAAAAAAAAAAAAAAAANTAAAAAANTNTAAANAAAAAAAAAAAGGGGGC 

CCCNGNNCNNTAAAAAAAAANNGGNNCCCCCGNGNGGGNGNAANANTTANAAAAATNTTN 

NNANCCCNNNGANAANGGGGGGGGGNNCNNANNNAAMTTTTTTTNNTTTNANGNGGGGA 

AAAANGCNCCCTNGGGNAAAAAAA 

Sequence 2535 

CCGCGGTGGCGGCCGAGGTACTTCTCI I I I I I I I I I I I I I I I II I ACATCCCATATGACA 
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TTGACTGATTAAAMGGCCTGATCGGTTTTGTTATCTTCCCACACTTCGTTTMTTTAAC 

TTGTGTTTTGTATCTCTTATAAACCATAAGATAGGCCAGGCGCAGTGCCTCATGCCTGTA 

ATTGCAGCACTTTGGGAGGCTGAGGCGGGTGAATCACCCAAGGTCAGGAGTTCAAGACCA 

GCCTGACAACATGGTGAAACACTGTTTTACTAAAAATACAAAAAATTACCCCGGCGGNGG 

GGGNATGCCCCCTGGAAAATCCCACTTCNTCGGGANGGNTGANACAGGGAAAAATCCNTT 

TGMCCCTNGGMGGCAAAAGGTTTCNANNGNCTTANAAANCCNNCCAT7TGTTTNCCCN 

AGNCTNGGGNGAAAAAGGGCAAAAATTTCCTTCNCCAAAAAAAAAAAAAAAAA 

Sequence 2536 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCGCCCGGGCAGGACAAAACAAA 

GACTTTTCATCMCTCTTTTAGATATGCTAGAAGAGCTAAAGGAAACCATGGACAGAGAA 

CAAAAAAATTAGGAAAGCAATGCCTCATCCAATACAGAATATCAATAAAGAGATTGAAAT 

TGTAGAAAAGAACCAAATAGAAATTCTGGAGTTGAAAAGTATTATAACTAAAACTGAAAA 

TTCACTAGAGGTATTCAGCAGCAGACTGGAGAAGTCAGAAGAAAGAATCAACAGGCTTCA 

AGATAGGTCAATTAAGATTATACAGTCTGAGGAGCAGAAAGGAAAAAGAATGAAGAAAAA 

TGAACAGAGCATAAAAGACCTCTGGGACTCTATCAAGCA 

Sequence 2537 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGTGTAAGCGAGCTACACCAAAGAATGGTG 

ATTAGCGCTCCCAGGGTGGGGAAAAATGAGGATTGTTTATATAGGCAAAATGGAGGTGCC 

MACAGAATTATMCATTTTCGGAACTAAGGCTCATTTAMGATACAAATTTGATTGGCT 

ACTATTGATTACACTTGMGGGGATTTGGTTTMCTTTCTTTTGTNAAAGAAAGNACNAG 

NNNNTAAGGNNNTTTANTTTCCCANCCNTTTAAGTCTTAGGTTTTTGAAAAAAAAANAAT 

ANGGGGGGCCTGGGNTTAAAATTTTNACCCTNNTAAACCCCNAAAAGGNCCAGAAAAAAN 

AMTAAAMNGTTTGGTNGNTTTTTCCCACCGACCCCAGGCTTGGCAAGGCNCCCTAACT 

TTTTTCNTCTTTNGGCCNAMTNCNATTTGGGGM 

TTTACCATTCCCCTNTTNTANTGGGNAAAANNTT 

Sequence 2538 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACMCAGTMMTTTTGCTCTTTGCTTCT 

GGAGGAACACCCTACMCAGGTMTTAAAAAATTTAAAAATAATATGAAAGGGGAGCTAA 

ATACCTGAAAATTTAAACAAAATGAAGCAAAAAAAAGGGAAACAAATTCATTGCAAGAGA 

TGGAAATTACATTTTAAAGGCTGAAAATAATAAAGAAATTCACTAAACAACNGGCCCTAT 

TGNCCTGGCTCCTTCAGAAAAGATTACTGGAANGGAACCTGGGAAGGGGGTTTGNGGAAA 

TACAAAATTTTTGGGANTTNGGGGGGAAANTTANCACNGNGNGGAAANCCCCCCNGNACC 

CCCNI I I I I 1 I I I INGNGGNCCTTTTTTNAAAAAAAAANNGGGTTCCCCNGGGNCCCNAA 

AAAAAAAAANTAAAAAAA I I I I I I I I CCCCCCCCCCCTTTNAAAAANNGGGGGCCCCCCC 

CCNNNNGGNGGGGNATTTTTNNNTNNNN I I I I I I I INCCCCCCCCC 

Sequence 2539 

AGCCTTGGAGCAGGCAATGATCTCCACCCTCCTGGGTGCGGGTGCAGACTCAGGCATCCC 

TGCACTTTTGGGGGCCTGCTTTTGCAGGCTCGGAAGTGCTTGCCTCTGCAGCTTGGCTTC 

TCCCTGCTATTGGCACCAGCTCTCAGATCAAAGCAGGGGGGTAAGCCTGGAAGCCATGAA 

CAGGGAGGGCCTGAAGGCTGAGGGCCAGGCTGCCAGTCCCACCTGTAGGAGTGGGAACTG 

GTGCCT7TTCCAGGCCAGTAAGTATGCACTTNCTCCCCCGAGGCCCGTAAAATGCCCCAG 

GCTCAGCCAGAACTGAGCAGAGGGTGGGACGACCAGCAACAGG 

Sequence 2540 

CCGCGGTGGCGGCCGAGGTACTAGTCTCAAAAGCTGGGGACTCTGAGCCTTACCTAGAGT 

CTCAGCAGGTGGACCATTAAGATTAACATTTCTAGTAGGTGAGTTCAATCACAAAAATAT 

TTC TTGT TCCATAGATTTTATTGTGGCCATGTCAGTGMCACCCACMGTTTTGCTCAGA 

ATATTTTAGGTGTAAGCTAAATCCCTAAATTGTTCAGAGTTCCCACAGCCCTGTAGCAGC 

AGAGCGAGMCTTTAACCAGACTTTTTCAATCCCAAAGCTAATCTGGAGGCCAACAGTGT 

TCAAAACCTTGGTGACTGAGGAACCATTTAGAGTTTTTTCAGGCTCAGGAATCACATGGG 

TCGTTGTTGGGCTTGGGGGTAAGTTTCACAGGCGATGAAGCTTGACGTTGAGTCACTTGA 

CTTCTGGAGCCATAATTTATTTTCTCCCAGCAACCTCCT 
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Sequence 2541 

CCGCGGNGGCGGCCGAGGTACTCAATGGAGTTCCTTGGGGGGATCCAAAACCNNGGAGAG 

GAACACAGGCGCTGAGTATGGAGCAGNNTNGAGACCCACCCCACCTGCAGGCCTGACNAG 

AGCGCCNGGATNTCTGGTTCNGATGCCAATACTAGACACCCCAAGNAGNCNGCTCACAGN 

AACCNTTACCTNNTTTTNAACGCCCCTGGGCCTCGCNACGCGCACAGAAANGGGANCATA 

ANCNNCNGAAAGGNACCCACTGAAGCCCATTTCTCCGNNGAGNCANCNNGNAANNNNCAG 

CTNNCNCCCCGCTAAGGAAGANANCNAGNGNNGGGGANCCNGGCCCCNCAACCNCCNGNG 

GAACCCAGCAGNNCNGNNANANCCCCNGCNACCCNAGAAGAANNAAGACGGNNAAANACN 

GGGNGGAACNNNNAANAGANAACACACNNNNNAANAGAAAACGCNNAAAGGAAGNNAGNA 

CCNGNCCGGGCGGCCGNNCNAGAACAAGGGANCCCCCGGGCNGAGGAANNNGANACAAGC 

NAANNGAACCGNCGACCNNGAGGGGGGGNCCGGACCCAGNNNNGGNCCCNNAANGAGGGN 

AAANGNGCGCNNGGNGAANCANGGGCANAGCNGNNCCNGGGNGAAANGNAANCGNNANAA 

NNNNNANAAAAANNANCCGGAGCAAAANGGGAAAG 

Sequence 2542 

ACTATAGGNTTTTATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGACATTATTTTT 

T CTAT TCATCTAGCACNGAAGCCAAATTTGGGTCTAAAATCTTGTGATGGCAACCAGGAG 

CTTTTAAACTTCCTTCGTAGTGATCTCATTGAAGTTGCAACAAGGTTACAAAAAGGAGGA 

CTTGG ATAT GTGGAAGAMCATCAGAATTTGMGCCCGNTTCATTTCATTAGAGAAATTG 

AAGGATTTTGGTGAGTGTGTGATTGCCCTTCAGGCCAGTGTCATAAAGAAATTTCTCCAA 

GGCTTCATGGCTCCCAAACAAAAAAAAAAAAAGAAAAAAAAAAAAAAAAGTACAAACTTA 

GAAGAAAATTGGMGATAGAAACANGATAGAAAATGAAMTATTGNCAAGAGTTTCAGAT 

AGAAAATGAAAAACANGCTAAGACAAGTTTGGGAGAAGTTTAGAAAGATAGAAAAATNTA 

AANGCCCAAAATTGGGATAAATAGCNCTGAAGAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2543 

CCGGGCAGGTACTAGANNACCTTCCTCGCCACTCTCTCCACATGAGAGAGTCAGCTGCCC 

TTTCTCC TGTGC CTCTGCAGGAAGAACTCTCTTGCATGGCACATCTCAGCTCCTCATTGA 

GGGATAGTTTTCTTTGATMGAMCCTGGAGTCCATTTACTCTGACCTCTCTTTAAATCT 

ATAT CCAG AGCCACTAGCCCAGGAAAAACTTGGGTGACCCGTAATTTCTCTTCTCCTGCT 

GTCCTTTTGCTCTTACGCCCCACCCCAACTCCCCTTAAATTTTACAGGCTTATGACAGTT 

TGTATGTGCTCAGCCAATGAGCAGAAAACCTGGAAAGAATTTCTGGACTTTAGCCCACCA 

GTTTGTCTGGTTGACTAACCTGCTGAGAGCTAAAAATTGGCACCCATTGCCCCGTGCCTT 

CAGGCAGTCTCCTGGGGCAGAGTATGCCACCATCCGAATATCAGGCACTGAGTGGGGATG 

TGGGTG ATGCTCAACATGACTGGCTAGAGCTTTGGGGGTGGGNNGGGGGTTAAACTACTA 

I I 1 1 I I I GGCCATG ANCTNTTTCCCCTTCC I I I I I I I I I I AATTAAATAAANGGNTCAAN 

TAAATANTTCAAGGCCTGCCTTTNAAAANAAAT1 I I I I ! I I A 

Sequence 2544 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATTTCA 

TATATTGTGTGAGCCCCACAAATGTCTATTTTAAAAAGAGTATAGTCCCTGGCCAGGCGC 

GGTGGCTCACGCCTGTAATCCCAGCAGTTTGGGAGGCCGAGGTGGGCGGATCACCTGAGG 

TCTGGAGTTCGAGACCAGCCTGACCAATATGGTGAAACCCCGTTTCTACTAAAAATACAA 

AATTAGCTGGGCATTGGGGGAGCATGCCTTGTAATCCCAGCTACTCGGGNGGGTTGNGGN 

GNGGAA AANAN CTTTGAACCCCCCNANGGCCAAAGGTTTTTTATTTGGGGCCCAAAAAAA 

ACCNCCCTTTTGCCCCTTCANCCCNNGGGGNNAANAAAAAAGGNGGAAACCNTTCCNCTN 

CCCCCAAAAAAAAAAA I I I II II I AAAAAAAATTTTAAAAATTTCNAAATTTAAA 

Sequence 2545 

ACTTAG G GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I 1 1 I C I I 1 1 I II I 

Mil i i AGTTTTTGAAATCCCGTCTCTACTAAAAATACAAAAAAAAAAAAAAAAAAAAAT 

TGGCCGGGCATGGNGGCGCATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGATA 

ATTGCTTGAACCCGGGAGGNGGAGGTTGCAGTGAGCCGANATCGTGCCACTGCACTCCAG 

CCTGGGCAACANAGTGAGACTTTNTNTCGGAAAAAAAAAAAAAAAGATCTGGNGGGTGAA 

AATMCCNGAATGMAATAGCTTGAAAACTCACANGNGGGAAGCTCCCTTTTACCCTTTT 
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TNTTCC CTTGGCCNGATGGAATCCNNNCCCTTTTTTANAAAAAAAAGGGAAAGNCCNCTT 
Mill (NAAAAMNANTGGTTNTNAANATTTTNCNCNCCCCGGGGGGGGGGGNGTTATTA 
AAAAAMNGGGGNCCCCCCCCGGGGNGGGGMMTNNNTTNAAATTTNTTTNCCCCCCCC 
CCCCCNGGGG 
Sequence 2546 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTCCATTGTTTCTAGGAAGTAACTAACTTGCT 

TTTGATT TTACA GGCTTGTAGGTGGAATGGGCTTGCCTTGTCTCAGATGAGACTTTGGAC 

TGTGGACTTTTGAGTTAATGCTGAMTGAGTTGAGACTTTGGGGGACTATTGGGAAGGCA 

TGATTGGTTTTGAAATGTGAAGATAGGAGATTTGGGAGGGACCGGGGTGAAATGATATGG 

TTTCGTCCTGTGTCCCTCACCCAAGTCTCATGTTGCAAGTCCCACAATTCCCACGTATTG 

TTAGAGGTGATTGAATAATGGGGGTGGGTCTTTCCTGTGCTGTTCTTGGGATAGNGAATG 

GGTCTCATGAGATCTGATGGTTTTAAAMCGGGAGTTTNTNTGCACMGCTNTCTCTTTT 

NCTGCTGCCATCCATGTAANATGTGACTGGC 

Sequence 2547 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACCATGACACCA 

AGCATTCCTGTCC CCTCC CCCGCAATGGCAATCAAGTGTCACAGAAAAATAGCCTTTTAA 

MTTACCATGAGCCTTTTATTTTTATTTATTTTTACACCCA^ 

GATCACGGCTCACTGTAGCCTCAAACTCCTGGGCTCAAGCAATCCTCCTGCCTTGGCCTC 

CCAMGTTGGNTNAGGGTTGATGAGCCACTGTGCTCAGCCMCCATGAGGTTTTTAGAGA 

TGATCTTGTNTAMCCCTTCTCGCCTGTTTTGCAGAAAAGAAAATTGAGATCCCAGAGAA 

ATGAAGTAACTTGCCCAAGGTCATTCAGCAGGCAAGATAGAACTAGATCCCCAAATTGCA 

AACTAGCTGNCCAGAGTTCTTTCCTCAATGAGCAATTTAAAAGC 

Sequence 2548 

CCGCGGTGGCGGCCGAGGTACTAACTGTCTGGGATCGTAGTCGATTAAACAGAGCCACCT 

TTGTTCCTGA GGCA ATGCATAAGTCAGCATTTTTCAATGACTGCTTCTTTTTGGMGGTT 

TGGAGATGACTTTTATCCGCTTGCTGAGGAACACACCAATGTCATCACTGTTGCCATAGA 

ACATCTTTACAGACAACATGAAGTGCTTTCGCTTGTCTGAGTCAGATATATACAATGGTT 

TGGCTGGTGCCATMGTTCTTTTCNTTCCAGGTTMGCTGGCTGCATTTTNTTGGTCACT 

ATTTCCTATNCCAATAAATGCACACNGGTGAGACTCTTGTTCAAAAACAACCATCGCGGT 

CCATTTGGTC1 1 I 1 I I I I I rC TTCCATTCNCTGGCCATAAGATATCCAAGGGGGGNGGGC 

AMAAACTGGGAGTTATTTTGTATGCCAAAAAGACACAGCCAAGAGGACTTGNGGATCAT 

GCCCCC 

Sequence 2549 

ACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I I I I 1 1 I I I 

I I I M 1 1 I I I TGG CATAAAAAGTGCACMGTTTGTCCTATGTGTTAAAACATCTTAATTT 

GGATGTATTTTTATCCAAAATGATGCATACTTGGATGCATTAAAAGCACACCAACAAAAA 

TNCNCNCTNTMCAATMNGGGNNNNTTTNNAANGGNGNCGGGGAGNGGNAACAANATTT 

N GGTANAN GGGANNTTNTTNTTTTTTTCNAAANAAMGAGMTTTGCTTTNTAAAACAAN 

AAl I I I I I ICCCCNNCANATTTAATTTGAAAATNTGAAAGNTATNGGAAGGACANNCNCC 

AATTGGAACACNNTGTGCAAAAGTTCNANAACNAAANAAAAGATNNTTTTCTNGGTTNGG 

GCNCTNANNCCNTGNAATTCAANATTTGGGGNNGGNNGGNNCCCCN H 11 1 1 1 I GGGGGG 

GNGGGGGNTTTCCCNCCCCCCCCCCCCCANNNNGGGGGGAAANCN 1 I I 1 1 1 I I IAANNAA 

AAAAM 

Sequence 2550 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTGCTGAAATC 

CGGCATGTTCTTGTCACACTGGGTGAGAAGATGACAGAGGAAGAAGTAGAGATGCTGGTG 

GCAGGGCATGAGGACAGCAATGGTTGTATCAACTATGAAGCGTTTGTGAGGCATNTCCTG 

TCGG GGTG ACGGCCCCNTGGGNGGACNNCCCCCANNGGNCNTAAANGGGTNANAACCNTT 

CCNGTTTTCCCCAAANGCCCGNCCCCTTTTC 

AAANGTTNCCCTAGGNTTTNTTGTCNCANNMNTTTCCCATTTTGTTTNTNTGGGANGAT 
GTTTNGCCCGTCANNTTCCACCAAATAANANTTNCTTTTTGGNAAAAAAATNNTAAANTN 
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NNNNCNTNNNNNNAAAA NCCCCC CCCCCCCCCTATNAAAANTNAGGGCCCCCCCCGGNNN 
GNAAMMTMNANTAAATTTTTTNCCCCCCCCNCNCCNGGGGGGGNGGCCCCCCCCNCT 



I I I 1 1 I I I I I I NTAANNNANCCCCCCCCCCC 
Sequence 2551 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGTCTCACCACATCCTGGCTCCA 

GTGTGGATGCAGAGAGAGTGTGACAGAGGATCTGCCTGCGAACCACCTGGGATTAGTCAA 

GTCCCAAGGTGCCCAGAGTGGGACTAGTTCTTCACAGTGTGGCAGCTGCACTAATCTGTT 

TGTGAGGGMTA TCCAT TCCCTCACTCTACTCTCNTCATTATGGGMTTTNTTTTTGTTN 

CAAAATAAANCCCTTTTGTATAGANAAAAAAAAAAAAAAAAAACCNCCCCCCCNNTTNAA 

ANAANGGGCCCCCCCCNGGGGGNGNANNNNTTNNAAI I i I 1 1 I I I NCCCCCCCCCCCNGG 

GGG 

Sequence 2552 

ACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGATCTTTTTGGCAN 
GAGCCCCAAGTAAAGCTGTCAATCCTGATGAGGCTGTGGCCATTGGAGCTGCCATTCAGG 
GAGGTGTGTTGGCCGGCGATGTCACGGATGTGCTGCTCCTTGATGTCACTCCCCTGTCTC 
TGGGTATTGAMCTCTAGGAGGTCTTTTTNCCAACCTTN 

NNCCCAAAAAGNGCCGGGTTTTTTTNNTCCCCCNTNNNGGGNAACCCNGGGGGGAAAAAA 
MANGTGGTTCNNGGGNAMAAMAAANTNTTTTAAMAAAAAMNATNTTTTGGNNNNN 
A i i m i ii i i iGGGANAGANNNCCCCCCCCCCCCCCNGGGGGGGNNTNTNTNAAAGAANN 
TTTNNNTTTTNCA 1 I I I FNNCCCCNNNNNGATAAATNCCCCCCCCGGNNGCCTATATAAA 
AAAAAAANCNCCCCCCCCGGGNNAANAANTTTTNAMTATTTTTTCCCCCCC 
Sequence 2553 

CTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTC I I I 1 1 I I I I 

I I I I I I I I I I I I I 1 1 AAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGG 

GATCTCGGCTCAGTTCAAGCTCCGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTC 

CCGAGCAGCTGGGACTACAGGCTCCCATCACCACGCTCGGCTAAGTTTTTGTAATTTTAG 

TAGAGACAGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCfCCTGACCTCGNGATCTG 

CCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCAGGCCAATA 

TGAMGTTTTTAACTTATTGGCACAANAAGTTTTCATC 

AAGTATTCTTCAATAACATTCCCTT 

Sequence 2554 

CACTACTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTT 

I I I I I I I 1 1 I IGAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAAT 

TAGGGTTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCTG 

AGTAGCTGGGATTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGA 

GACGGGGTTTTGCCATGTTGGACAGGTTGATCTCAAACTCCTGACCTCAGGTGATCTACC 

CTCCT CGGCC TCCCAGAGTGTTGGGATTACAGGCATGAGCCACCATGCCAGGCTGCTAAT 

TCTCCi I I I I AGGNGAGTTAGGGGAACTGAGCCTCAAAAAACTTAAACGATTTCTCAAAA 

MCACCTCAAGTGATMAGTGGCCACATTGNMAGGGAGTTTTTATCTTTTTATTGNNGG 

CCCAGGGGTCATTGGACAAAATCATGCTACCTNTTGGATTTTAAAATATTCAATTGGCAA 

A 

Sequence 2555 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACTCAGAGGC 

CGCCATCAACCCGCCAGATCAACCTGGAGCTCTACGCCTCCTACGTTTACCTGTCCATGT 

CTTACTACTTTGACCGCGATGATGTGGCTTTGAAGMCTTTGCCAMTACTTTCTTCACC 

AATCTCATGAGGAGAGGGAACATGCTGAGAAACTGATGAAGCTGCAAAACCAACGNNGGN 

GGCCGMTCTTTCTTCAGGATATCAAGAAACCNAACTTGTNATTACTGGGAAAAGCGGGC 

TTAATCAAGGGGGGTGGGCCTTTANNTTTTGGNAAAMMGGGNGAATTNATTTCTNTTT 

TTGGGAACAAGNAAAAACCTGGGCCAAAAAAAAA 

Sequence 2556 

CACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTT 
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i n N 1 1 1 1 1 1 I I I I I I 1 1 INAGACAGGGTCTCCCTNTGTCACCCAGGCTGGAGTGCAGT 

GGCACAATCATGGCTCACGGCAACCTCGACCCCCTGGGTCCAAGTGATCCTTCCACCTCA 

GCATT CCACAAGATGATGGMCCACAGGCATGCACTACTATGCCTGGCTMTTCTTTTAT 

TTTTTGNGCAGACAGAGGTCTCCCTATGTTGCCCAGGCTGGTNTTGAACCCCTGGGCTNA 

AGCTATCCTCCCGCCTTGGCCTCCCAMGNGTTGMTTANNNGGAATGAGCCACTTTTTG 

GGCCTNGGCCTCNANTTAATTTNAAMNGGGNGTTTGNTTTNAACCNCCGCCCCTTTAAA 

AAAMGGGGCACCCCCCCGGGNGNGGGGNAATTTTNMNNNNMTTTTTTCCCCCCCCCC 

CCCCCGGGGGGGGGGGNGCCCCCCCCCC I I I 1 I I I 1 I I 1 1 1 I AANNAAAAAAAAANCCCC 

CCCCCCCAAAAAAAAAAAA 

Sequence 2557 

ACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTCTTGTCAAATT 

CCTTCTTGATATGAGCCGCAATGCCCCTCTCTATGTTGTATTTCTCCAGCGCCTGAGTAG 

CGCACTCCACCGAGTCCTGTTGCATCTCTTCCGACATGTCCGCATTTTTGATCACGGCCT 

TTCGGTCGCCCCGCGTACCTGCC 

Sequence 2558 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTACTGCTGAGGTCTNCAG 
GACAGAAGNCACCTCCTNTGGTAGAACATNCATCCCTGGNCCTTNTCAGNCCACAGTTTT 
GCCAGAAATATCCACANGAACAATGACAAGGCTTTTtGCCT 
Sequence 2559 

CCGGGCAGGTAC I I I II I 1 1 I 1 1 ! i 1 1 i ! I I II 1 1 I 1 1 AGGGGACACGGTCTATCCGCAA 

GCTGNGGATTCTGCCATTCTCATCATAGTTTTGAAGCCGGGGTTCTGAAATCTCATGACA 

GATCTCCAGCTTGAACTCACGGAATGGTTTTTCTAGGCCCTGCTCATAATACAGCTGCAG 

GTAACCAGNGTCTGTCAGTTTGACGAANATCGGTCCCCAGTGCCTGGAGGACATGATGTT 

TTTNTTCTCAGGGATCCTCAACATNATTGGCCACCCGTCACGAGGCTGGGACCGTGCTGA 

TCCAGGCGGGTGAGCATNTAGTTCAATCCAGGCTACTGGGTCATCATCAGGTAGAGTTGC 

ACTGCCAAAGTG ATCA GGGTCATTMTTTGGAGTTGTTTGAGTTTTTCAACAGCATCAGA 

AGTGNGACTGGGTTTTGCTTTGAATCATCNAAACTGGATGGCATCCTTGGNAGATGACAA 

TGAAGGGAATNTTNTTTGGCTTT 

Sequence 2560 

CCGCGGTGGCGGCCGCCCGGCAGGTACTGGGCCGGGTGCAGTGGTTCACGCCTGTAATCC 

CAGCACTTTGGGAGGCCGAGGCAGGCAGATCACGAGGTCAGGAGATCGAGACCATCCTGG 

CTAACACGGTGAAACCCTGTCTNTACTAAAATTACAAAAANTTANCCGAGCNTGGNNATG 

GNNGCTTGNATNCCAANTTNTTCGGAAGCTTAGCNAGAAAATNGCTNAACCTTGGANGNG 

GAG CTTTGAN TGAGCCAAAATCCCNCCNTTGNACTCANCCTTGGGGGAAAAAACCNGNAA 

NTCCi till I AA AA AAA AA A A A AA A A AA AAA AAG G G N CC N NTCG G CCCT NTTTT AAAN N N 

TTGGGGACNCCCCCCGNCNNNNNGGNAA I I I I 1 I I I NAAACTTTTTTTNTNCCCCCCCCC 

CCCCCNGGG 

Sequence 2561 

NCTACTTAGGGCG AATTGG AGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 M I 1 I I 1 1 I I I I 
TTGATTGTTATtTAGTTTTTATTTCATAATCATAAACTTAACTCTGCMTCCAGCTAGGC 
ATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCGAGAGCACAAAGATT 
CTAGGNTCCTGCGACCAANGGGGGGGNGGGNGCCNNNNNNNCCCNAAAAAAAAAANNTTN 
GGGGGTT NAAAAAAANCCCCNTNN I I 1 I I I I I 1 1 1 GNCCCNCCCCCANCCCNGGGGGGGG 
it I I 1 1 I iGNGGGG GGGNTTTG CCNATNNCCGGNANCCCAAAGCGCTCATGCCNCANAAA 
- NAANANTC CGNGC CI HUM ICNCTTGNGNCAAACCNCCANTGNGCTTTNGACNNCCAC 
CCNC TTCATTT TTGGGGGGGGGCAMAMMMANCCCGGGCNCNCTTTNGGGGGGGGGN 
CCCNI I 11 I I I NNAAAAAAAAAANANNCCCCCCCCCCCTTTTTTTGNGANNAGGATTTTT 
Mill 

Sequence 2562 

ACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I I I I I 
nTTTTTTTTTTTTl I AGGGGACACGGTCTATCCGCAAGCTGGGGATTCTGCCATTCTCA 
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TCATA GTTTTGA AGCCGGGGTTCTGAAATCTCATGACAGATCTCCAGCTTGMCTCACGG 
AANGGTTTTTTTAGGCCNTGNTMTMNNNAGNTGCAGGNAACCAGGGTTTTGCNATTTG 
NNCAAAAANCGNNCCCAANGNCTGGAGGACANNNTNNTTTTNTTTTNAAGGATNCTTAAC 
ANTATTGGCCNCCCGTAACMGGGTGGGACCGNCCTTATCCAGGCGGGNGGGCTTTTTAG 
TTCAATCCCGGCTCTGGGGNA NTAN TNAGNAAAAGGTNCCTTCCAAAGGAANNGGGGCAA 
NMTTGGGNGGTNNTTNGGNGTTTITCMCCCNNANAAMGGGGGCCGGGGGGGTTTTTT 
TTAAAANAAAAAAAAATGGGGGCNTNTTGGAAAAAANAAAAAAGGGAANN 1 1 1 I 1 1 1 I 
Sequence 2563 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCAGGTGGAGAGAGGGTGGAAAGTGA 

GCAGCGGGCTGGGCTGGAGCCGCACACGCTCTCCTCCCATGTTAAATAGCACCTTTAGAA 

AAATTCACAAGTCCCCATCCACAAAAAAAAAAAAAAAGAAAAAGTACTATTCTAGTCTTA 

AGMTAMGACCNTGTTCAGCCAGGCGNGGGGGTTGNCNCNTGTANTCCNACNCNTTNGG 

GNGNTTGGGGGGGCAAAATCCGGGGGTCAGGGCATTCGAGACCAGCANCCCGGGGTCAGG 

CTTTCGAGACCCAGCCTGGCCAACATGGTGAAACCCTGTCTNTACTAAAAATACAAAAAT 

TAGTTCGGCATGGGTGGTGCACACCTGGTAATCCTAGCTACTCAGGAAGGTGAGGCAGGA 

GAATCACTTGACCTAGGAGGCGGAGACTGCAGTGAGCCAAGATTATGCCAATGCACTCCA 

GTATGAGTGACAGAGCAAGACTTGTCTCCAAAAAAAAAAAAAAAAA 

Sequence 2564 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGAGCTC 

GGATTTTAAGGCAGTAGCCTGCTGATGCTCCCAGCTGAATAAAGCCCTTCCTTCTACAAT 

TTGGTGTCTGAGGGGTTTTGTCTGCGGCTCGTCCTGCTACATTTCTTGGTTCCCTGACCA 

GGAAACGAGGTAACTGATGGACAGCCGAGGCAGCCCCTTAGGCGGTTAGGCCTCCCTTGN 

GGAGCATCCTTGAGGCGGTNTCCCCCCCCCCGGGGCCCCCTTNAAAAANANNCNCCCCGN 

MAAAMNACCCCCGGGGGNNNCCCCNANNNMCGGNGGGGNCCCTTTTTTGGGGNNNAA 

NAAAANNCCNNCCCCCACCNCAAANAANNACTN I I I I I I I 11 TCNCNTNNTNNAAAAAAA 

AAAAAAAAAAAAAAMAAAMNNTTTCCNCGGGGGGGGGNTTTNNTAAAAAAAAAANNGN 

GCCCCCCCGNNNGGGNGAAANmTTTTNAAAATTTTTTNCCC^ 

Sequence 2565 

GCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTG I I I I H I I I 11 11 1 I I I M I 
GCTGGTAGGAAATGCTTTAATAAAAATGCAATCTCTAAGGGGCCATGGCATCATTAAAAG 
AAAGGATGTCATGCCCAGATCCANAACTTGAAGGTGGCNGGCACCAGCAAGCACCATANT 
TTTGMTNGGCCTTNCTTTNCAGGGTCCTTANT^ 

CTTGAAAAMTGGANNAACNTGTTCNCCNCCCTTGGGTTTNTTAGTNGGGAGGGAACTTT 

TNGTCCANCTAAMAATNTTGNNCGNGGGCCCANTTTNGGGGGCNTTTNGTTTTTTNANN 

TTGG CCCCNNGGGGGGGCNGGGGGGGNNNCCCCTTTGTTNNNCNNNNTANGNNNAAAANT 

TTTTNNNTCNGGGGNAAAAAAAAAAAAA 

Sequence 2566 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAAGAAGAGG 

AAGAAGAAGAAGCAGAAGGAATGGAAAGCCTGGAGAAAGAGGATGAAATGACGGATGAAG 

CAGTTGGAGACTCTGCTGAGAAGCCTCCTTCTACTTTTGCCTCACCTGAGACTGCTCCAG 

AAGTGGAGACCAGCAGAACTCCACCAGNTTGTGAACCCCCAACCCTTCAATCAAGAAAAG 

ACCTTTGATCAGGAGAAGACTTCTCGTTTCATTTTGGGGACACATTCAGGATTTCTTCAA 

AAGCAGTGAGTAACATTGAACCCCTCTTGCTTGCTATTCTGGTCTNTCTTCTNATANAAA 

TTGGAANAAMCCCCCGGNCCCTTGGAGCCTTTTAAATGCCAMNGNNNGCNCTTTNAAA 

ANTTAGGGNANCCCCCCGGGCGGGGGGGNA 1 1 1 1 I J J I AAAANTTTTTTTCNCCCCCCCC 

CCCNTGGGGGGGGGGCCCCCCCCCNN II I II I I I I I I I NTTNNAAGNNNGNGNANANNCN 

CCCCNNGGGNAAAAAAAAAAA 

Sequence 2567 

CTATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNCTTTTTTT 

TTTTTNTNGNGGACTTGGGCTT CTCTG CCCATATTGCANTGTTGATGTTCCCAGAGTTCT 

ATCCTTACTCTAAANGATCTCCATTTTTGAGCTTATCCACACAGNGGACTGTGGNTTNCT 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

419/467 

GGTCTTNGNNCTTNTGCCTAGGACNTNCTTTCCTAGGTANCCACATGCCTGCTCTCATTA 

TNTTCATGTCTTMCTCANAGGNCMCTTTGANTTMNGCTTTCTATNACTACATTNAAN 

ANNGTGNGACCTTTCNTGTCTNCCCANCTNGCCTGATNTTTNCTTNTrCCATTGCACTTTN 

CTTNTGACTTCT 

Sequence 2568 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATT 

TCCGNTGAGTGACCCTTACAAGTCCTTCTTGATCCTGAACTGGGTTAGGTGCCGCTGNTG 

CTGCTCGTGTTGAATCTAGAACCGTANCCAGACATGGNACTGGAGGACGAGCAAAAGATG 

CTTACCNAATCCGGAGATCCTGANGAGGANGAAGAGGAAGAGGAGGAATTAGTGGATCCC 

CTAACAACAGNGAGAAAGCAATGCNAGCAGTTGNAGAAATGTGNAAAGGCCCGGNAGCGG 

TTAGAGCTCTGTGATGAGCGTGTNTCCTCTCGATCACATACAG 

Sequence 2569 

CCGCGGTGGCGGCCGAGGTACATTATGTTTTCGTTTTCATTCATCTCMTAGTTTCTTCT 

TiGGAGATTTGGGGGGAAAATGATAGACAGGAGGCAGGACTAGATTGCAGCCCTCACCCGG 

ACAGACAGCAGCTCACAGGGACTCGCATCATGAACTTTGGCTCCAGAACTACTGCAGGAA 

TATACCAGGAAAGCCAAGAGAATCCACAGACCCTCTGAAGGAAGCAGATTGCTCCTTCAG 

GACCCAAGAGACACCCTAAATACTGTGTTGGTATCGTTGGCAGAGAAACCTCAAGACGGT 

TCACATTACAGGACTCTGTGCAGACAACCCTCGGTACCTGCCC 

Sequence 2570 

AGGTACTACAAGAACATCGGTCTGGGCTTCAAGACACCCAAGGAGGTGCGGGGAACCTCA 

GAAGAAAGAAGGGGAACCTGGCGTTCCTGCACGTGTGCCCACGACGAGTTGCCCTGCCTG 

CATCT AAGT GGCTTCTGGGGCTGCTGGGAATTGTAGTTGCTTCCCTGAGGCCACGCCCCT 

GGCTNTTTTAAGGMCCGCCCGCCCAAGGCTCACTCCTTTATCTTTCCTATCCTTTCAGG 

CTATTGAGGGCACCTACATTGACAAGAAATGCCCCTTCACTGGTAATGTGTCCATTCGAG 

GGCGGATCCTCTCTGGTAAGTGCGGGAGTTACTGGTGTNTGGGGCCTGAAATACTGAAAG 

AAGGGTCTTGGGGCCCA 

Sequence 2571 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCNGGCCGGGTACCCCCATNACAG 

NNGNCAANCNGNCTGAGGAAGTGGCNNCTACCANGCAGGNGATCTTNCNGGAGCGGTTGG 

CNGCAGNGCCNGAGTTNCACCGGTNTTGNGCCCCTGTTCAAGTNCTNACCTGANCCCATG 

GCCCTCAC CGNGT CANAGACGGAGTNTGTNATNCGNCTGCACCAANCACACNTTCNNCAA 

CCACATGGTTTTTNANNTNGNCTGCACAAACNCACTNAATGACCNTACCTTGGANAATGT 

NNCAGTGCNNATGGAGCCCANTGNGGCCTATGAGGTGCTCTGTTACGTGCCTGNCCGGAG 

CCTGCCCTACNANCANCCCGGGACCTGCTACACACTGGTGGCACTGCCCAAAGAAG 
Sequence 2572 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNNGNGCCCQGGACGCACAGNAAANN 

CNTGTNTTNGTNGGNNTNTCTATNAAAAAGGCAATCAAGAAAGATAATGTGAAAAAGANA 

GGAATTNATAGGTCGGGAANANATGAATGTCAAGACATTTGAAGAACTATAGTAAAATGA 

TCAACACTAAATATACTNAGAGAAANCTTTGTTAATATGCCAATGAGGTNGGCCTGATCT 

TTGAAATAGTGAATAGGAATNCAATGCATTTCCTCAGTGATCACTGATTANGAATGAGTT 

GGTNNGGATCCTTGGGA 

Sequence 2573 

TAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAAGCCTGGTTGAAGTC 

TTTTCCTTGAGAGGCCTGGATTTGTTCACATATTCAGATTTCMCTGGCTTTATAACAGG 

TTTCCT TATGT GCCTCATATTGGCTCCAGGTTCACTGGGTTTAAAAGGAGCCAAAGCACC 

ATMGGTTTTGGCAAAGGAAGAGTGGCATCTGCTTCTGGGATGTAGGCATTCGGACGCTG 

CTCT GCAG TTGTGTAAACACCATTGGGCAGCTGGCGATTATCTGCTTCCAAGAACTCCTG 

AGACTTTTCAGCGATGGCCAAGGCGTGCATTTCTTCATCTCGAAGGACTTTCAACCATTC 

TTTCTCAATTTCCTTATTGAGTGGCAGACCTTTTTCTA 

Sequence 2574 

TGCCGGTGGCGCCCCGCAGTCCGCTTGTCCGTCCTTNCTCTCTGACTGNGGTACNNCGGG 
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GCCTGTCTCTGCAGAGGTCANGGGAGGCNGGGGCCCCAGCACACGTTCCNCAGTGGCAGC 

GNGGAGCGGCAGCTTACGGGTTCGCGGGAGCCCCCGACCCCCCAAGGGCTAGAGGAGCNC 

TCGGCGGACCAAAGAAAGCCCGCGAAGCGGNTGCGCGCCNACCAATAGGTGCGGGGCTCG 

GAGCCNNACAAACTTGCGCCGTACCTCCNTCCGCCCCCGCCCCGNCNCCNCCGCCGCCGC 

CGACTCCCCCTACTCCTGCTCCTGCCTTGGCTCCTCCGCCNAACGTNTCGCACTCCGAGA 

GCCGNAGNGGCAGCGGNNCCGTCCTGCCTGCAAAGAG 

Sequence 2575 

CCACGCGTCCGTTCCCAGNCAGTTCCATCCAAGGGAGAATTAAGTAGAGAMTTTGTCTG 

CAATCTCANNGGNNNGACAAATCTACGACACCAGGAGGAACAGGAATTAAGCCTTTCCTG 

GAACGCTTTGGAGAGCGTTGTCAAGAACATAGCAAAGAAAGTCCAGCTCGTAGCACACCC 

CACAGAACCCCCATTATTACTCCAAATACAAAGGCCATCCAAGAAAGATTATTCAAGCAA 

GACACATCTTCATCTACTACCCATTTAGCACAACAGCTCAAGCAGGAACGTCAAAAAGAA 

CTAGCATGTCTTCGTGGCCGATTTGACAAGGGCAATATATGGAGTGCAGAAAAAGGCGGA 

AACTCAAAAAGCAAACAACTAGAAACCAAACAGGAAACTCACTGTCAGAGCACTCCCCTC 

AAAAAACACCAA 

Sequence 2576 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCCTCCTCTATTCCTCCCA 

ATCTCAAGGTTACTCTTAAATACTAGTAAATGCAAAAAGAACTTGTAAAGTGGCAAGGCA 

TGGCCTATCAAAAGTCAGCCCAAGGGCAGTTTTCAGCCCTGCCTCACCTGGGTCTAGTTC 

AGCTGACGGATGAGCTGATTGATGCGTTCACCCCGATAGCCAGGTGTGCCCATCTCCTTG 

AGGAAGCCCACTCTATTTTTGGTAGCATGACGGGCCACTGAGAGGTGGAAAGGGCACAAG 

AACCATGAGATCTCCTGGAAATGCTTCCCTGGGAAGGCAATTTCATGAATGAGGTCTTCC 

AAGCAAATGACGCCAAACTTCCCCAGGTGCTCCTCAATCACTGTACCTGCCCGGGCGGC 

Sequence 2577 

AGGTACAGAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAAACTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACGCAGGACAGACTAGGAGGGAGCCGGGAGC 

TGCCCGGCGGGTCATGGGAATAACGCCGCCGCATCGCCCGGTCGGCATCGTTTATGGTCG 

GAACTACGACGGTATCTGATCGTCTTCGAACCTCCGACTTTCGTTCTTGATTAATGAAAA 

CATTCTTGGCAAATGCTTTCGCTCTGGTCCCGTCTTGCCGCCGGTCCAAGGAATTTCACC 

TTCTAGCGGGCGCAATACGAAATGCCCCCCGGCCGCTCTAGAAACTAGTGGGATCCCCCG 

GGGCCTGCAGGGAAATTTCGGATATTCAAAGCCTTATTCGAATACCCGTCGACCTCGAGG 

G 

Sequence 2578 

CCGCGGTGGCGGCCGCGCGGGCAGGTACCCCGAGTCCAGCGGAGACAAAGGAGTTAGAAA 

GAGACAGAATAAGAGTTTAAAAGGCAGGTCCAGGGGACCGGAGCGTTGGAGGCTTGCTCA 

TGGCCCAGAGCTCTTTGGCTCCGCCCAATTTATTGATTTACMGCTCTTTGTTCTTAGGG 

CAGATGGGAGGGGTAGGAAGGGATGAGGAAAAGGATTAATCAGTGAAGGAGAACTCGTGA 

GTCATTCAATAATATGTATAGTAGTGGTGGTTTCTGTGAATTTCCTTGAGTAAAGGCGTG 

TGTCTAAACTACTCAAGATCTTTAACTTATCGGNATTGAAATGGATGGG 

Sequence 2579 

AGGTACGCGGGATAGTGAAACCCCGTCTTTACTAA I MM I GTATGTTTGGTAGAGACAG 

GGTTTCACCGTGTTGGCCAATATGGTCTCGATCTCCTGGCTCATGCCTGTGGNCCCAGCA 

CTGTGGGAGGCTGANGCAGGAGGATCATNTTGAGGCCAAAGAGTTCGGGATCAGCCTGGG 

CAACATAGTGATACCCTATCTCTTAAAMAGAAGMGTTTTTAMTTTGAAATAATAANA 

GGTACCTTGCCCGGGCCGGCCGCCTCTAGAAACTAGTGGGATCNCCCCCGGGCTGCAAGG 

GAATTTCGATATCAAGCCTTATCGGATACCGTCCGACCTTCGGAGGG 

Sequence 2580 

CCGCGGTGGCGGCCGGGCAGGTACGCGGGATGATTGAATTTTGTTTCCGCCTAAAATAGT 
AATCTATMGATATAMCTCGAGTTAGGGTTTACATTTTTTACTTATGAACACAGGGCAC 
TAGGGCCACTTCAGTCTMTTTCCTGCTTTTTAATTACTTTAACACTCCACAGGAGGAGG 
ACTGGTTTTCCTCTGTGACTTCCTAATGTATGGCAAGCAGGACTTCTTCTAATCCACTAC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

421/467 



CCTCTTCCCCTAGCTTMCTMGGCTTGCAGTAAAATTATAMTTTCCACTTTCTTTCCT 

ACATTCTCAAATGTAGGAAATGAGGACAAACAACTTCCTCTCTCCAATTTACAACACTAT 

CMCTAT7TGTCCTTTATTGTGCATTTCAGACACAGGTGTTTTAATTGNTAATCATGTTT 

TTACACTGCAGTGGATGGCAGGTTTT 

Sequence 2581 

CCGGGCAGGTAC 1 1 I I I 1 1 I I M I I I GAGATGGMTTTTGCTCTTTTGCCCAGTCTGGAG 

TGCAGTGGCATGNNCTCAGCTCACTGCAACCTCCACCCACTGGGTTCAAGCGATTCTTNT 

GCCTTAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCCGTGCCTGGCCCTGTCT 

CTCTTAAGAGTAGGTTCATTGTCTGTCTTAGAGTCACTTCTATTGCMCTCATTTTCTTT 

TTCCAGGGCACAGATCGACCAAGCTGCCGGTTCCCTATTCTGCAGGGACAGGGACTATTT 

CTAGCATACCTGCTTTCGTCCACCCAGGCAGGGGTTTGGGGGTGGGTCTNTTCTGTGCCT 

GCAGTCCCCCATTTTGACACTTTGGGTTGCNCACCCATTTTTTGGGANMTNATTTC 

GGGAAATGAAGGCTTCCATTGGG 

Sequence 2582 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTC1 I I 1 I I I I t I I I I I I 1 

TTTGAGATGGAGTCTCGCTCTATCGCCCAGGCTGGAGTGCAGNGGCGCGATCTTGGCGCA 

CTGCAAGCTCTACCTCCTGGGTTCACACCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGG 

GACTACAGGCGCCTGCCACCATGCCTGGCTAATTTTTAGTAGAGACGGGGTTTCGCAGTG 

TTAGCCAGGAAGGTCTCAATCTCCTGACCTCCTGATCCGCCCGCCTCGGCCTCCCAAAGT 

GCTGGGATTACAGGCGTGAGCCACCGCGCCCAGTTGTGCAT7TCTGGTTTCTAAGAATCA 

AACCACTTG^CTGTTTTTAGAAGTTA 

1 I 1 1 1 1 1 I II I IGAGACAGAGTCTTTGTCNCCCAAGCTTGGAGTGCANTGGTGCAATCTT 
AGCTCCCGGGGTTCMGCAATTTTTCTGCCTTAAGNCTTCTGAGTAAGCTAAAAATACAG 
GNGNGCCNCCACCCCGNTNGGCTTATTTTT 
Sequence 2583 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACGCGGGGGCACTCAG 

GGAGCTCAGATTTTGAGACAGTNGCTGGCCGATGCTCCCAGNTGAATAAAGCCCTTCCTT 

CTACNAAAAAAGAAANNGAAAAAAGAANACAGGATATCTGAAATTAAGACNGCNGATGGA 

GNNGTTTCTNNAAATGACAGGGNCCAAGGNGNGACCACGGGACCAAGNGGCTGAACTGGN 

ATGAAGTTAAGAAGCAGNAANAAACATCCNATAATATGGTGATCAGNTCAACAGAATGAC 

ATATTNACCATGTNCCCNAGGAGGNGATGACTGAGATTTCAAATT 

Sequence 2584 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCTGGATAGCCTCCAGGCC 
AGAAAGAGAGAGTAAGCGCGAGCACAGCTAAGGCCACGGAGCGAGACATCTNGGCCCGAA 
TGCTGTCAGCTTCAGGMTGCCCCCCCGCGTACTTTTTTT 
Sequence 2585 

CGGCTGCGGCGAGCCGGTATCAGCCTNACTCAAAGGGCGGGTAATACCGGGTTATCCACA 

AGAATCAGGGGAATAACCGCAAGGAAAAGAAACATGTGGAGCCNAAAAGGCCAGCAAAAG 

GCCAGGGAAACCCGTTNAMAAGGCCCGCGGTTGCTTGGGCGTTTTTTCCAATAGGGCTT 

NCGNCCCCC 

Sequence 2586 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATT 

TCCGCTGAGTGACCCTTACAAGTCCTTCTTGATCCTGAACTGGGTTAGGTGCCGCTGTTG 

CTGCTCGTGTTGGATGTAGAACCGTAGCCAGACATGGGACTGGAGGACGAGCAAAAGATG 

CTTACCGAATCCCGGAGATCCTGAGGAGGAGGAAGAGGAAGAGGAGGAATTAGTGGATCC 

CCTAACAACAGTGAGAGAGCAATGCGAGCAGTTGGAGAAATGTGTAA 

Sequence 2587 

CTATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I I I I I I I I 
MINI NGGGT ATGA AAACTTAGGGACTAAAATTAATATAAAAATTGGCATAATGTTGGA 
TTGMTCTA CATTTTG GCAGAAGTTAAACA TTCC CACATAATGTCAAAATTATACATCAT 
GCAGTTCTG i I I I l I I GTTTGTTTTATTTTGTTTTGTTTTTGAGTCTGGCTCTGTCACCC 
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AGGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGCGATTCTCCT 

GCCTNAGCCTCCCGAGTAGCTGAGATTACAGGTGCGCGCCACCACACTTGGCTMTTTTT 

GTATTATTAGTAGAGACGGGGTTTCGGCATGTTGGCTAGGCCGGTCTCTCCTGACCTCAG 

GGGGATCAGCCC 

Sequence 2588 

TAGGGCGAATTGGAGCTCCCCGCGGNGGCGGNCGNNGNACANAATAANGCCTGTCACATA 

TTMGTNTGTMTMCGCATTTATTATTACTTATCAGGGTATGATTTATGAATTGNGGAA 

C CTGN GATTATGGGAGAGTCTGGCTTCAATCAAGGGCTGAAATTCCATTTCCACTGACAT 

CTTTTNCTTCCCCATCCCCCGATTCTGTCCTGCAACAGGGTAACAAGAAGGGGCCCTTAG 

GCCGTTGGGACTTTGATACCCAGNAAGAATACAGCGAGTATATGAACAACAAAGAAGCTT 

TNCCCAANGNTGCATTNCAGTATGGTNTCAAAATGTCTGAAGGGCGGAAAACCA 

Sequence 2589 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTGTCGCGAACTCA 

AATCGGCATTTAGATAGATCCAGTGGTTTAAACGGCACGTTTTTGCCTATAAAAAAAGTG 

C AAAA AAGATGTGGTTTACMGTTAAAGCTACAGMTCCCTTTTTGCTGTAATTGCACCA 

GTTTTAAAGCCTCTGGACAGAGCAGTATTTCGTTTAAAACTTTGTTTTrCTTAAAAGC^ 

ACAGTGTTTGGCTAATTCTCCTCCCCTTTTTACAAGACGGGGGCCGGAGGGTGGACACTG 

GTGGCAGGTTAAGGGATACTGTCACTTTAAGAAGCCTGCAGATTGAAGTGTAAACATGGA 

GAMTTAGGGGCTGATTTTTTAAACTGTGTGAGATATTAACCAGCCGCCCTGTTATAAAA 

TCAGGGAMTCCAMCAGCGATTTACACCCGATTMCACCCCGTTTATATATTTTTTACA 

AAAATACACTGGGGAAAATAATCNAACCGNTTTCATCTCTCTTGG C I I II 1 1 GG I I 1 1 1 1 

AA 

Sequence 2590 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGC^ 

I I I I I I I I I AATGTGAGACAGGATCTCATTCTGTTGCCTAGGCTGGAGTGCNGNGGCGCA 

ATCTCGGCTCACTGCAACCTCTGCCTCCTGGGCTCAAGCAATTCTCCCACCTCAGCCTCC 

CAAATAGCTGGGATCACTGGCACAMCCACCATGCCCAGCTMTTTTGTATTTTTTGTAG 

AGACAGGGTTTCACCATGTTGCCTAGGCTGGTCTCAACCTCCTGGGCTCAAGCAATCCTC 

CTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCGCATCCAGCCCCACA 

CCCTCATTTATACCMTTACCTGCCCAGTMCTGTGGACTTTTGCTTCCTCACCCCTGCT 

CTGATCTGGAAGGAGAGGGATTATGTTATAGCTTGTCAGCACAAGTCCCAAGTTCAATAT 

TTCTGCGGCAAAAACTTCTTCAAAAAATAAATGTAC 

Sequence 2591 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTCTTCCCGCTGAAG 

GGAAATTCACAGAAGCTATAGTTGATGCAGAGCCGAAATATCTGATAGTTGNGCGACCTG 

CTCCACCTCCAAGTCAAAAGAAGTCATGTTCAGGTAAAACTCGTTCTCGCAAACCTCTGC 

AGCTGGTGGTTGGCACTCTGACACCGAGCTCGGTCTTCCTGTCATGGGGTTTCCTCATCA 

ACCCACACCATGACTGGACATTGCCAAGTCACTGTCCCAATGACAGATTTTATACAATTC 

GCTATCGAGAAMGGATMAGAAAAGAAGTGGATTTTTCAAATCTGTCCAGCCACTGAAA 

CAATTGTGGAAMCCTAMGCCCMCACAGTTTATGAATTTGGAGTGAAAGACAATGTGG 

AAGGTGGAATTTGGAGTMGATTTTCAATCACAAGACTGTTGTTGGAAGTAAAAAAGTAA 

ATGGGAAAATCCA 

Sequence 2592 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCTG 

ACTCCTTTTTCGGACTCAGTCTGCCTGCACCCATGTGATTAAAMGCTTTATTGNTCACA 

CMAGCCTGTTTGGTTATCTCTTCACACAGACATGCGTGACACTTGGTGCTGAAGACCCG 

GGATGGGGGACTCCTTCGGGAGACTGGTCCCCTGTCCTCACCCTCACTCCATGAGGAGAT 

CCACCTACAACCTCGGGTCCTCAGTCCAACCAGCCTAAGGAACATNTNACCAATTTCAAA 

TCAGATCTTGGCTTAGTGGCTGAAGACTGATGCTGCCCAATTACCTCGGAAGCCTCCTGG 

ACCATCACAGATACTTTGAGTAATCTCTTATAGTGGAAGGATGCAAAGTTGGAATAA 

Sequence 2593 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

423/467 



CCGGGCANGGTACCATATGTTTCAAAGTAGCTGTTCATCCACAGATAAAGAGATCAAGAA 

ACTCTCATATACATACACTAGGAAATATTCTCCAGCCATCAAAATAATGAAGCAGTGTCA 

TTTAGAGCAACACAGATGAACCGCGGAGACCTGCCTCCTACTCCACCATCACATGGAACC 

CACCACTGCTTCTCCGAAGCTCGCTCTGACCACGCCGCTGCTGCTGCAGGGGCCTCGCAG 

GAAGTGCAGTCTCCCCCGCGTACCTN 

Sequence 2594 

TAGGGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACTrTGGTATGTTTTTGTCTTG 

AAAAGAGGTTTTCTAGCTAGAAATTCGGGGCCCAGAAGGTCAAGGCTCCAGAGATCCCCT 

TCTCATCATCCTTCAGTTGTCTGAGGAGACAGAGGATCACATGTGTCACAGTGAGTGCTG 

ATTCCCCAGAATCTGCAGTCTTAAGCTCCCCTGCTGACGCACACAGCCAGTTCTCTGCTT 

CATATTCATGTGACCCACTGTGAAGTGACAGGCAGCTGCCAATTCTGTCATATAAGAAGC 

AGCTCTGCCCATGCAACTTTGAAAGTTCCTGCACTTCTCCATAGACTGCTCCTTCATGGA 

MTAAACACATCATCACACTGTGGGTGCCTAAGCTAAGTTTGTTTGCAGTTTGATGATGC 

CCAGCTCCTTTTCAACTCAGCCTTAATTGGCACTTTA 

Sequence 2595 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAC I I I I I I I 1 1 1 II 1 1 I I I GA 

GACAGAGTCTAGCTCTGCGCCCAGGCTGGAGTGCAGNGGCACCATCTCGGCTCACTGCAA 

CCTCCACCTCCTGGGTTCAAGCAAGTCTCCTGCCTCAGCCTCCTGAGTATCTGGGATTAC 

AGGCACATGCCACCACGCCCGGCTAATTTTTGTATTTTTAGNGGAGACGGGGTTTCACTT 

GTTGGTCAGGCTGGTCTCAATCTCCTGACCTCGGGATCCATCTGCCTGGTCCTCCCAAAG 

NGCTGGGATTACAAGGCGTGAGTCACCGTGCCCGGCGAGAAGCAACTCTTAAAATACTTT 

ATTCCTTCCTCTAGGACCCTTTAAATGGTGAAAATGGGCAGATGAGTAGCAATAAGTGGA 

CCTTTGTTACTCTTCTGAGTTAGAAAMTTCTMTTTAAGTACCTCGGCCGNTCTAAAAC 

TAAGTGGATCCCCCGGGC 

Sequence 2596 

ATTG GACCT NCACCGCGGTGGCCCGCCCGGGNANNAACCATAAACCGTNGCAGNCTCAGC 

ANATATTTTCCCTTCCTTMGTCAGTAACTTTCACCTTTTCACTTACAGGMGCACTTTA 

CGGCTTCTCTTTAGCATATGCAMTTGCCAGCATTACCACTCTTGGACTTTGGGGCCACT 

GTTAAGTAAAGTAAGGGTTACTTGAACATAAGCACTGTAGGCCGGGGCATGGTGGCTCAC 

GCCTGTAATCCCAGCACTTGNGGTAGGCCAAGGTGGGTAGATCACCCGAGCTCAGGGAGT 

TCGAGGACC ANCCT GGGCCAACATGGTGGAAAACCCCACCTTTACTAAAAAATACAAAAA 

TTAGCCCAGGTTTTGGTGGGCTGTATACCTGGNGATGCGCANCCTATTCAAGGAAGGGCT 

ANAGGGCAGGGAGAAATCGCTTGGAACCTGGGGAGGCCGGNANGTTGCAGNTGAGCCCAA 

GAATCCGTGGCCCACTGCACTCCAANCCTGGGGTGACAAGAAGCCGAGGACTCTGGNCTC 

AAAAAAAAAAAAA 

Sequence 2597 

CCGGGCAGGTACTTCTTGATTTCATCATACAAGACAAGCACAAAAGCACCACCCATGCCT 

CTGAGAACATNGGACCATGCACCCTTGAAAAAAGCTTTGCCTNCTTCATCACGAGCAATC 

TTCCGCCAGCAGTCAAGCCGTGCCTGTGTACAGATGGGGTTTTGCCATGTGGACCAGGCT 

GGTCTCGAACTCCTGGCCTCAAGTGATGCACCTGCCTCGGCCTNCCGAAGTGCTGGGATT 

ATAGGAGTGAGCCACCACGCCCGGCTACAGAGTTGGGTTTTAACAGAAGAGGACCTTGAA 

TGCTGAAGCTTNACAGGGCGGCCAMCTAACTCGCTGATTTTTGCAAGACCACAGTGTAA 

AGGTCGGATGTCCACCTGAAGAAGGGGTGGGTGCAACTCTCTGGGTGCTGCACACACCAT 

GAC CAAGC CTGGGCATGCAGCACCCCAGCTCCCATNCATTCACACTGGTTGCTTTGTGAG 

GTCATTTTGAGAGGGCTTTCANAGCCTTTTAATGAGAA 

Sequence 2598 

AGGTACCTTTGACCCATCATCTTGGGAGGTGGGGAGGACCNCGAGGGNCCAGGCAGGGTG 

TAGGGGAATGTATTAGNCCAANGAGATTTCCCTCTTCATCCGCAGCAGNGTATCTATTCT 

ATACCTGGCTATGGGAGAAGACCCCTTGCATGGGAGGGACCCCTTGCTATGGCCCCTTTA 

AGCCAGGCAGTGGGGATCTACCTGNGGCCCCGGCCTCCCTAAAGTCATTCACATTGAATG 

GGGGGATGAAGGNTCGGGACAGTGGCTCATAAGAGCCCGAGTATTGAGCCCTAANCTGTG 
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GGGCCTMNAAATGTTCCTTTAATAAMCMTCCTTATTTTTTTMNAAAAAAAAAAA^ 
TNAAAAAAAANNAAANGTACCTTGCCCGGGCGGGCCGCTCTANAACTAAGNGGGATCCC 
Sequence 2599 

GGCAGGTACGCGGGTGGACACAATTTCCACAACCTTTCAGATGCTGATGTAGAGCTATTG 

GGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCTAGAGATAACTCGGGA 

CCCAGAGTTGGTCGACAGATGTTAGGATGTATCCTAGCTTTTAGCTATAAACCACTCAAA 

GATTCAGCCCCCAGATCCCACAGTCAGAACTGAATCTGCGTTGTTGGGGAAGCCAGCANT 

GGGCCT TGGGG GAAGGAAAGCCCATGGCTGTGGTTCAAGAAGAGGGGTGGGGCCTGGGCA 

AGCCCACTTTTCGGGGGAAMACTCCTTCCGCCCCAGGGTTTCTTCTTCTCTTAAGGGAG 

GAG ATTAT TTNTTCACCAAACCCCGCTGCCTTTCATGCCTGCCTTTCAAAAGCTAAGATC 

CATGTTTTG CCTTT GCTT MGAG NMTTACCTGCAMATCANGCCCCAAGTGGCTTTGGG 

CGMTNCCATTTTAAAAGATTTTCTAGGCCCCTTCMGGGTTTTTGTAGAAGTGTNAAGC , 

CCTTGGTGGGGCAAGGGNTTG 

Sequence 2600 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGGACTTNN I 1 I II 1 I 1 I I I 

I I I I I I 1 1 I I I I I ACGGAGTGTNTGTGGACAGTGCCAACTTTATTGAAGACAGNTCCCTN 

TGAGACTACAGGCCCAGCCACTCANAAGGCTGAGGCAGGAGGATCGCATGAGCCCAGGAG 

TCTGAGGCCGCAGCGAGCCATGGTCACACCACCGCACCCCAGTCTGGCAACAGAGCAAGA 

CCCTGTCTCCAAAAAACAAACACAAAAACAAAACAAAACAGCCAAAAGCAGGCTAGAGGA 

TGCTAMTGAGACTTTCCTAAATCTCCATTTTGGCTGGGACCCAAAGCTGAGGACAAAAA 

TTGGCAMTATATCCATATGTTGCAGATTTGCTGCCNATCCATGAATGATTTGTTAAGGA 

Sequence 2601 

AGGTACATGGAGTTTTGATCAACGGACTTATNCCNCCTCTCCTAGGAACTAAAATGCCAG 

GGCCAGGCTGTGTATTTCTTTCCCAGGAMTTAGCTTTCCAGCCCCTTTATATATGGAAG 

GCTGGGTTAAAGTTATGGTTCCAGAAGCTTCCAMTCCTGAAATAGATGTTTTAAAGATG 

CAACCTCAAACACCAATGCTGTTGTTAAAGAGCCTATGGGGAATTGCTGCTGCTCTAGAA 

ACTAGTGGATCCCCCGGGCCTGCAAGGGAATTCGGATATCAAGCTTAATCGATACCCGTC 

GACCTCGAGGNGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTTAN 

ATTTGCCGCCGCCTTGGGCGGTAATTCATGGGTNCATAGGCTGTTTCCTGGTGGTGGAAA 

ATTGGTTTATCCCGCTTCACAAATTCCCACACAACATNACCGAAGCCCGGGGAGCCATAA 

AGTGGTAAAGCCTGG 

Sequence 2602 

TGGAGCTCCACCGNGGNGGCGGCCGAGGNACCAAAAAGACTCTCAAAAACCAATACTCCC 

CCGGGCMGGGMTAGCCAAGTTTGTTGCGGTTTNCAATGAATGACATCAGCCCTGTGTA 

GGTCTCMTCAAAATGGGTTCAGTTAACACCATCAGTTTCTTTCCTCTTCCAGATCCAGT 

TGMTTCTTGTGGGCATTCTGGATAGCTGGAACAAGCTTAGACATGAACCCAGACAACTT 

GCAMTTTCAAGGAATTTCTCACTGGTGNATTTCATAGGATGCTCAGTGAAAGTAGNATA 

AGGAACTTCAGTGGACCATGGGTTCCAGCGGGGACAAGAAAGAGACTGCTCCTCCGGGAC 

TCCCCCAAGTAGAATCTTAAGGGCCTTCTCCTTGGTCTTCTTGTCCAGGGGACATCCCAG 

GGAAAGGTGGAACTTTGCCCAGGC 

Sequence 2603 

CCGGGCA GGTAC GCGGGGGGCATATCAGATACGAGGGCACCATGTAGCACAGCTGGACCC 

CCTGGGGATTTTGGATGCTGATCTGGACTCCTCCGTGCCCGCTGACATTATCTCATCCAC 

AGACAAACTTGGGTTCTATGGCCTGGGATGAGTCTGACCTCGACAAGGTCTTCCACTTGC 

CCACCACCACTTTCATCGGGGGACAGGAATCAGCACTTCCTCTGCGGGAGATCATCCGTC 

GGCTGGAGATGGCNCTACTGCCAGCCATATTGGGGGTGGGAGTTCATGTTCATCAATGGA 

CCTGGAGCAAGTGGCCAGGTGGGATCCCGGCAGGAAGNTTTGNAGACCCCTGGGATCATG 

GCAGGTTCACAAATGGAGGGAGGAAACCGGACCCTGCCTGGGCCAGGCTTGTGCCGGNTC 

C ACCC AANGTTTNGAAGGAAGTTTNCTACAGCCGGAAAGTGGGNCCCTCTGANGAAGCCG 

CTTTTGGTTCTAAGAAGGCTGGCGGAGGGNACCCTTCGGCCCGCTTCTAAGAACTAGNTG 
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GGATCCCCCCCGGGCTGGCAGGGAAATTCGGATATCNAAGCCTTATCGATACCCGNCCGA 

ACCCTCGANGGGGGGG 

Sequence 2604 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGGAGGGTTTAGGCAGCGTGTTCT 

GATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTAGGAGGCCTT^ 

TGAGGCCGCGTAGTCGGTGTTTTTGAACTTACTCTACAGCTTCTGGCAGGCCGTGCGGCG 

CCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAMCGTGTGTGGGTTACGCCATCTTT 

AAGGTTCTAAATGAGAAGAAACTTCAAGAGGTTGATAGTTTATGGAAAGAATTTGAAACT 

CCAGAGMAGCMACAMATAGTAMGCTAAAACATTTTGAGAAATTTCAGGATACAGCA 

GAAGCATTAGCAGCATTCACAGCTCTGATGGAGGGCAAAATCAATAAGCAGCTGAAAAAA 

GTTCTGAAGAAAATAGTAAAAGAAGCCCATAAACCGCTGGCAGTAG 

Sequence 2605 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGGTCTGAAGGGGTCAC 

TCAAGCGCTCTGCTATCTCTGCAGCTAAAACGGGTGTCAGGTTTTCAGCTGCTACTAAAG 

ATAATGAGCATAAGCGTTCACCGACCAAGACTCCAGCCAGAAAGCCTGCACATGTGACCG 

TGTCTGGGGGCACCCCAAAAGGCGAGGCTGTGCTTGGGACACACAAATTAAAGACCATCA 

CGGGGAATTCTGCTGCTGTTATTACCCCATTCAAGTTGACAACTGAGGCAACGCAGACTC 

CAGTCTCCAATMGAAACCAGNGTTTGATCTTAAAGCAAGTTTGTCTCGTCCCCTCAACT 

ATGAACCACACAAAGGAAAGCTAAAACCATGGGGGCAATCTAAAGAAAATAATTATCTAA 

ATCMCATGTCAACAGMTTAACTTNTACAAGAAAACTTACAAACAACCCCATCTTCGGG 

ACAAAGGAAGAGCAACGGAAGAAACGCGAGCAAGAACGAAAGGAGAAAGAAAGCAAAGGT 

TTTGGGAATGCGAAGGGGCC 

Sequence 2606 

CCGGGCAGGTAC I ITTTTTTTTTTT! I AACATCTTTG I 1 I I I AAAATAG AATGATAG AAC 

T1TGCCAGTCTTTAAGATCTTGGCTTAATTTAATGTATTAATCTGTTTGTGCAAACATAA 

TACCACCATTTAAAAATGTTAGGGGGATGAGTTGCAGTTTTTATAATAG A I 1 1 I I I I I I A 

AAGTTTGGTATTGTAAAACATTCACACCTCTGTCCCTCAAAATTGATAATTACCGTTTAA 

AGNGCAGNCATTTGNGGTNTAGAMTCTTGTTTTGTTTTGCTTCCATTATTGAGTTCCTC 

CTAAGGGAAAATTGGAGGAGAAGGGGACTGGAATATGAAAGCCCCAAATTCATATAAAAA 

AGTTTGCGGTTNTMGGTTTGTATTAAAAAATAGNATATTATTAANGAAAAA AI I I 1 1 1 I 

TCACTTGATGTTTTGGTTAG 

Sequence 2607 

ATATGGGCGAATGGACTCCCCGCGGTGGCGGCCGGGCAGGA C I M I I I I II CI I I I I 1 I I 

TTTTTTTGGAAACTTCCTTTTCTTAGTTGTTGTATTCTTGAAGAGCCTGGGC 

GCTTGCCTAAGTTTTGGGCAGTGMCTCCTTGGATGTTCTGGGCANGTAAGTGTTTATCT 

TGGGCCTGCAATGAAGCCAGCCGAAGTCCATCCCTTGGGCAAGGGCCGGGCTTGTTGGTG 

GGGTTTTTGGAAAGAAGTTTGGGACCAAGGGGTCCTCCTCAAGGGGGAAGCCGGGGGGGG 

GGTTCTTCCCTCGGGCTCTNGGGCCGCCTGCCATATTTCTTCCTTGCCTTGGGCCGAACC 

GCCTGCTTGAATACCTTGGAATTGTTTTCTTGCCTTGCTTGGTTGTTTTACCTAAGGANA 

TTTCCCTCMTGGTMTGTGG1TTGGTAATTTTAAACCTAATAATCCTTTGGGCTCAATT 

TTCMTCCCACCTCTTGTCCAATCMGCMACTTGNTAANGATTCTTTCCCCCAAAAATN 

GAATTTGGCTGGCTT GGGCAA AGGGCCTTGAAGCAAATTCNATTGGTTTTAATCTTGCAA 

ACAAGCCTGAACTTTCTTTTTTCAAGGGTTCCCCACCATTTMGGGAACATTTTNAATCA 

AGGATGTGGAAATT 

Sequence 2608 

CGCGGTGGCGGCCGCCGGGCAGGTCAAACGATGTGTCCGTGATGACTGGTGGGGCTCATG 

TMTCCCCAC CTMGA AMGTAGAAAGTGCMCTTTATCTTTTAGGTTAATAAGTGCTGA 

GAGATGGAGGTTTTTCCTTCTCATTTTGATGGAGATGCCTAGAAAACCTCGCCTGACACT 

CTTTGTCCAACGCAGGATAGAGAACATAGCAACAGAAAGGGTGAGGCAAAAGGCATGGCT 

GGTNAAAAGGCACTGCATGTTATTAAGGATGNGGGGCCTGGTCCTGTTCGNTTCACATGT 

TTTTCTTNi I I I I ATACAGAAATAGGAATCTACCAGACAGTAATAAATGCCACTTCTCAC 
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AGAAAGTCTGACAGGCTTCCACTGCCTCTGAGAGAACAACAACATGTTGGCTCCATAACA 

TAAAGAAAAACAATGCTGGGTGCGGTGCGGTGGCTCGTGCCTATGATCCCAGCACTTTGG 

GTGGCTGAGGCAGGAGGACTGCTTGAGCACANGAGTTTGAGACCAGCCTGGCCAACANGG 

CGAAACCTTCTCTNTACTAAAAATNCAAAAAAAAAAAAATTAGCTG 

Sequence 2609 

TCACCGCGGTGGCGGGCGGCCGCCCGGGCAGGTACI I 1 1 11 1 I I 1 1 I 111 1 1 M I I I GTG 

GAGACAAGGATCTTGCTCTGTCACCCAGACAGATGTGCAGTGGCATAATCATGGCTCACT 

GCAGCCTCAACCTCTGGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CCACAGGTGTGCACCCCCACTCCTGGCTAATTTTTTTAI 1 1 I I I GTAAAGATGGGGNCTT 

GCTATGTTTNCCAAGGTTAGTCTNAAACTCCTGGGGTCAAGCGATCCTCCTGCTTAAACT 

NCCAAAGCACTTGCGATTCAGGNGTGAGCCACCAAGCCCAGCCCCAGCACCACTTTTAAC 

TAGCTGCATAATCTTGGGTAAACTACTAACCTTTCCAAAGTACCTCGGCCGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNATTGCCCGCTTGGNGNAA 

TCATGGCAMAGCTGTTTCCGGGNGAAMTGNTTATCCCGTTACAATTNCCNAAANATAC 

NAACCCGGGAGCNTAAAGNGTAAAGCCNGGGGGCCCANTGGGGAGCTAACTACAATTAAT 

GNGTTGGGCTACTGGCCNNTTCAATGGGGAAAACCNGGNGCCACTTNNTANTGAATCGNC 

ACCCCCCGGGGGAGGGGGTGGTTTTGGCCTTTTCCTTCTNGTAATANTNNTNGCCGGGGT 

NG 

Sequence 2610 

GCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAATGAGATGGATACAATTAGTNAAAC 

CTTAAAATTAAAAAAGCTGTAGACAACAGAAGGNAAACTGGAAATCCATTTACAATTCAA 

MGAACTCACTAATAACAAMTTAATGTTCATCAACTTCATTTATAATCCATTTNGGCCT 

ACATNGCNTAACTAAANTGACACATGTCCCCGGGGGCTGCAGGCGTNGCNCCAATCTTCG 

CTCTGAGGNGCTNTCTTAACCGNNANACCCTGGAAGCGGGCAAGTCTCTTGCTGTGTCGG 

GACCTTGCAGNCCCTGGCCCTTCCGCCACCATGGGAATACCTNGGGCCGTTCTAGAACTA 

GGTGGGATCCCCCGGGCTGNAGGGAATNCNNTATCANGCTTATCGATNCCGTCGACCTTG 

AGGGGGGGGCCCG 

Sequence 2611 

CCGCGGTGGCGGCCGAGGTACCAAGTGTGGGAAGATGTTGAGCAACTGGAACTCATGCGT 

GGCAGGTAGGGATGTAAMTGGCACMAGACTTTGTAAMTACTTTGGCAAATTNNNAAA 

AAGNAAACACATAGCTACCATACAACCCAGCCATCCCACTCCAGTATTTAACCCAGGTGA 

AATGAAAACTTATGTCCAMCAAAGACTTGTACTTTTCTATGATGACCCGGGCCGGCTTC 

TTTMCGNTTTTNGGTGCGAACCGCNGCCCATGTTGGCGGGTCCTTGGTAAAAGACCCCG 

CGTCCTGCCCGGGCGGCCGN 

Sequence 2612 

GGAGCTCCCCGCGGTGGCGGCCCGAGGTACANGAAAGTCTAGATGATCTTGTAGTGCCAG 

AAAGTAAGAAAGTAATAAAAAGATGACAGGTCTGTCACAATGATACAGAAGCCAATGTGA 

CMAGCTCTCAATAGTTAMTCTCGAATTTGAGTAGTAAAAGTGACACAGTTTTAGATTA 

TAACCCAAAGAACTAAATAAATATCCATGAGCCCTATTGATATAAATGACAATTAAGGGT 

TTTTTGTTTTGG I I I I I I I GGTTGCTTTGGTT 

Sequence 2613 

GAAACCTGTCGTGCCCAGCTGCATTTANTGAATCGGCCAACGCGCCGGGGAAGAGGCGGT 
TTGCGTATTGGGGCGCTCTTTCCGCTTTCCTCGCTCACTGACTCGCTTGCGCTCGGGTCC 
GTTCGG 
Sequence 2614 

GGAGGCGGCCGCCGGGCAGGTACTTTGTTTC I MUM ICCTTTTT CI I I I I I I AAGACA 

TGGTCTTGCTCTGTTGCCCAGGCTGCAATGCAATGGTGCCACCTCGGCTCACTGCAANCC 

TCGACCTCCTGGGTTCMGCMTTTTCCTGCCTCAGCTTCCAGGTAGCTGGGATTACAGG 

CGCCCGCCACCACACCAGGCCACATCTATGTATTTAGAGACAGAGTCTTGCTCCACCTGG 

GAGACAAAANCNGACCTCCGTTTAAAAAAAAAAAAAAAAGGGAAAGGAAAGGAGGAATTT 
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TTGAACCCAGTGCCCGTTCCTTGCAGATCTGCGAACAGTGGGCACCAGCCTCTAGTCTCA 

AGTGTGAGGTTTGACTCCAGGACGNTGAATTTA 

Sequence 2615 

CGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTGCTACACGGAGGATGMGCTAAGGAGCT 

GGCTGCGGAGGTGGAGGGTTCAAGACGGCCCCAATGAAGATGGGGAGATGTTCATGCGGC 

CAGGGAACTTGTCGACTATTTCCCAAAACCATACCCCAACAGTGGAGGGCTGNTNGAGNT 

GCCAACAACGGAGCNTNGCCCCCTGACCTCAGCTACATCGTGCGAGCTAGGCATGGTGGN 

GANGACTTCCNTTTTTTTCCTTGTTNACGGGCTNTTTGGMCCNCCCCCCGGNNTTTTN^ 

NTGGGGNAAGGGTNTTTATTAAAMNCCNTTTTTTTNCNGGCAAGCCTTTTGANATGGNC 

CCNCCTTTNAACAAA I I I I I I I I ANNTTTTTNGANGNGGGCCCCCCATCTTNCTNTGTTC 

CAANATAGCCAAGGGTTGTGTGCNCCTTCCTGCCGCTGGGCATTTTGANCCAGAACNCCA 

CCATAAAAAACGCNTGGGCTAAANATTGTTNATAATGATGGNTTTGGGGNNNCCCCTNTC 

TANMNATAMGGGGGACATGGGNTTTTTTCT^ 

TNAAAAAATGCNCCCCCCGGGGGAAAAAATTTTTN I M I I I I I I I I CCCCCCCCCCCC 
Sequence 2616 

CCCGCGGTGGCGGCCGGGTACATTGCTATTTTTCCACCTATGTCCACACCTTCTCATGAA 

CTTTCAAAGTCAGAGGCAGNTACCAGTGCCATTAGAAATACAGATTCAACAACGTNGGAT 

CAGCACCTAGGAATCAGGAGTTTGGGCAGAACTGGGGACTTAACAACTGTTCCTATCACC 

CCACTGACAACCACGTGGACCAGTGTGATTGAACACTCAACACAAGCACAGGACACCCTT 

TNTGGAACCGATGAGTCCTACTACGTGACACAAGTCACTCAAAGATCAAACATCTATCCA 

GGCCTNAGCATTCCCTTTCCATCTTACTGGAAGTCTACCTGNGCTCGGGACACANGGGAG 

AAGCTCTNTTGAGGCACCACTTTTTGGGAAACCATCTACAGACACACTGTCCAGAGAGGA 

TTGAGACTGGCCCAACAAACATTCAATCCACTCCACCCAT 

Sequence 2617 

TATTNAGMGTGGTTCCAGGTCTTAGCACAAAACATGGATGCACTTTAGGCCANNCATGG 

TGGCTCACACCTGTAATCCCAGCACTCTAGGGGGCCAAGGTGGAAGGATCCCTTCAGCTC 

AAGAGTTTGAGACCAGCCTGAGCAACATAGGGAGACCCTATCTCTACCAAAAAAAAAAAA 

AAAAAAAAAAAAGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTCGAGnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnTAATTGCCCCCTTGGGCGTAATCANTGnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnTCCGNTCACAATTNCACACAACATACNNGCCCGGG 

AGCATAAAAGTGTAAAAGCCTGGGGTGCCTAATGAGNGnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAAACCTGTCGNGCCCAGCTGCATTAAATGA 

ATCGGCCAACCCCCCGGGA 

Sequence 2618 

CCCGCGGTGG CGGC CGGGTCTTTAGGATGCTGGTAAACTATGGAATTTCAGACGTGGACA 

TCTGGCAGACATTTTCTTTAAAATGAATAAAGTGAGCTTATCACCTCAAGGAAAACAACT 

NAAGCCATCTGTTGCCAATGATAAAAATATGAGATTTAAAAAAATGTTGAAGNTTCAGGC 

TAAAATTAAAATTTTGAAAMCTTATATTGGCCACCACANGTTTGACACCTTACCAATTA 

TTAATGGGANGGTTTTNTTMGAGGNCMCAGTGATNTTMCAM^ 

TNTTGGNTTATTGGGAGGTTTANCNATTGGGAAGATCTGTAAGACTCAAGNGAACCATGG 

NTNTCCAAATGACTAAGACATGATGGNTACAAAATCACCCCTGGGTAAAAGATCCATTCA 

AAGTGCAGGACAGACCAATGGAATTTAGTGTAACAGAGGTNTGAAAGGTTCATGAAGGCT 

GGGCATGGTGGCTCACGCCTGTNATCCTAGCACCTGGAGGGCCCG 

Sequence 2619 

CTCCCGCGGTGGCGGCCGGGTAC I > II I I I CTTTTCTTTTC I I I I I I I I I I I I I I I GAGA 
CGGMTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAG 
CTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGTGGCTGGGACTACA 
GGTGCCCACCACCACGCCTGGCTACI I I I I I I IGTA I I I I I ATTAGAGACGGGGTTTCAC 
TGNG TTAGC CNNGAATAGGNTCGATCTCCTGCCTTGNGATCTGCCCCCCTTGGCCTTCGA 
GCCCi I I l IGACCTNAAGGACCAGCACTGGAATAATGTTTGGGANNCCCNTGCTTTTNAG 
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GCAGGCTTAGGGAGCAGGAAAGCATACAGGTGTAGCAGCCTTTCCAGCTGATCCCCATGC 
CCTGCTGCACCTGGAGGGCTGGAACAAGCTATTCTCATATTGGGGAAAAGGGCTGA 
Sequence 2620 

CTCCCGCGGTGGCGGCCGGGTACTTNGTTCCTTTTCTTTN Ci I II I I I I 1 1 I I I 1 1 GAGA 

CGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGNGGCGCAATCTCGGCTCACTGTNNG 

CTCCACNTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTNGAGTGGCTGGGACTACA 

GGTGCCCACCACCACGCCTGGCTACI I I I I I I I GTATTTTTATTAGAGACGGGGTTTCAC 

TGNGTTAGNCCCGGANAGANTNGATCNCCTGACCCTGGNAACTGGCCCCCTTGGCCCTCT 

AACCCTTTTTGACCCCTAAGGGMNGTACTGGAATATTGTN1TGGGNNCCCATGCTTNTG 

NGGCAAGGCTTAAGGGAGCAGGAAAAGCATACANGGTGTAGCANCCTTTCCAGCTGATCC 

CCATGCCCTGCTGCACCTGGAGGGCTGGAACAAGCTATTCTCATATTGGGGAAAAG 

Sequence 2621 ° 

NCCGCGGTGGCGGCCGCCGGGCNGGACCTGTTTTATCCCAGCTGAGAGGCAAGGAGAACC 

TTTGTTTCTTAAAAAATAAGCTGGTTTCAGCCAGGTGCGGTGGCTCACGCCTGTAATCCC 

AGCTCTTTGGGAGGCCGAGGCGGGCGGATCACCTGAAGCCAGGAGTTCGAGACCAGCCTG 

GACAACATGGTGAGACCTTGTCTTTATTAAAAATGCAAAAATTTGGCCAGGCGCCGNGGC 

TTTACCCCCTTATTCCCAGCACTTTGGGAGTCCCANGCAGGTGGATCACAAGGTCAAGAA 

ATTGAGACCCTTCTTGT7TACACCAGGGMCCCCCTTCTTTCCTAAAAATTTNTAAAAAC 

CAAANTTGCTGGCATGGNGGGGGCACCTGTAGTCCTAGCTACTTGGGAGCCTGAGGCANG 

AGAATGGTGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCGAGATCGCACCACTTGCAC 

TGCAGCCTGGGGGACAAGAGTGAGACTCCCTCTCNAAAACANAACAAAACAAAATTNNCC 

NAGCGTGGTGGCAGGCCCCTTGAGGCANGANGAATCACTTGAACCCCGGGG 

Sequence 2622 

GGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGANGACCTTAGAAAAAGGAG 

GAAAGGAGGAGAGGCAGATMTTTGGATGAATTCCTCAAAGAATTTGAAAATCCAGAGGT 

TCCTAGAGAGGACCAGCAACAGCAGCATCAGCAGCGTGATGTTATCGATGAGCCCATTAT 

TGAAGAGCCAAGCCGCCTCCAGGAGTCAGTGATGGAGGCCAGCAGAACAAACATAGATGA 

GTCAAGCTTTGCCTCCCCCCCCCCTTAGGGAGTTAAGCCAAAAGCTGGACAAATTGCCCA 

GAGCCTGTGATGCCTTCTTAGCAGGGAGAGCAGATGGAAATCCACCTTGTAGAGGCTTCC 

CCAGAAGAACCTCCAAATTCTGTCAGCTNATACCAGAAGTTAGAACTTCTGCCAGAAAAA 

AAAAAAAAAAAAAGTACCTGCCCC 

Sequence 2623 

TGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTCN I I I I I 1 1 I I I I I I I I I M I CTTATTT 

AGGGGAGMCTTTTACCTTTTCACTTMTGCACTTCTCTTTGGTGTATCTGTNTTGNGNG 

CAACACTACTCTTGCTCTTNAGGGCCATTAANTAAAATAAGAGTTACTCAGGCTGGGTGC 

AGTGGCTCATGCCTGTAATCCCAGCACTCTGGGAGGCCAAGGCGGGCGGATCATGAGGTC 

AGGAGTCAAGGGCAANCCGGCNAAATNGGGGAMCCCCMNTTTTTCTAAAAATACAAAA 

ATTCNTCCNGGCAATGGGGGGCNNATGCCTNAAATCCCAANANACCCTNAAAAGGCTNGG 

GNCNGGGAAAANTNTTTNTAANCCAGAANCCCCGGGNGGNNGGGGNGNTGNANATANGCC 

CAATANNCCCCCNTTNNCNTNTNCNCCCGGGGGAANNNTTGGAANNCCTTTNTTNTNAAA 

AAAAAAACNAAAAAGGNTNTCNCTNTNCCCCCCNNGGGNAANNNGANACCTC I I I I I ll I 

CTCCNCCCNNNTTTCNCTGGGGGGGGGGNNGAATNTTANTAAAANAANCCNCCCCNGGGN 

G ANANAATNTNAN 1 1 I I I I I I CACCCCCCCNCGGGG 

Sequence 2624 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACATACTGCTTTGATTGATAT 

ATACAAAACACCACCTNCTTTTGATGGAAACGGGGCCCAGAAGCAGATCCTGNGATGAGG 

ATTAATGTGCAATTCATGTAATNTATANCACAGTNTTCAGGANGTGCTCACAGGATAAAC 

CAATGAGGGAGNGGGTTCAGGGGCNGGACAGGGAAGGGAACTAAACTGNGCAAGGGACNN 

TTTNNGGGCANGGNNCGGNCTTAAGCCTANTCCCGTTGGGAGCTCTGGACATACATNAGG 

CTTCAAAGNTNGGNNCAACTGGGNGCAAAAAAACCTGGGCTGTTTATGCCTGCACCCCTT 

TAGGCACCCTAAGCCCTACTTTGAAGTNTTTATTTTAGCTCTCAATGTGTTGAACTTATA 
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AGGAAATTAAACGTTTCATGAGTTAAG 
Sequence 2625 

TCCCCGCGGTGGCGGCCGAGGTACAGATAATTCAAATCTATTATCTAATGTATTAGGTTA 

ATTTCTATTTGTTCTATAC I 1 1 I I I I I I I I I I I I GAGATGGAGTCTTGCTNNGTCGCCCA 

GGCTGGAGTGCAGNGGTGCAATCTTGGCTCACTGCAAGCTCCGCCTCCTGGGTTCATGCC 

ATT CTCCT GCCTNGGCCTTCCAAGNAGCTGGGACTACAGGTGCCCGCCCCGACACCTGGC 

TNATTTTTAAAAAAATTTTTTAGTAGAGACAGGGTTTACCGGGTTAACCAGGATGGNCTC 

NATCTCTGACCTCGGGATCCCGCCCNTTTGGGCCTCCAAAAGACTGGGGATTACAGGGCG 

GTGAGCCACCGACCCGGCCTATTGACATTTTTTMGGGTCMGATTTTCTTTTGTGTGTC 

TAGTMTTCGTCTTTTATTGCAMGATAATTTGCTTATTNGACTNAGAAAAATGATTTGT 

GGGCATACAATATTGTATGTGGTACCTGCCCGGGCNGCCGCTCTAGAACTAGNG 

Sequence 2626 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATAATTCAAATCTATTATCTAATGTAT 

TAGGTTAATTTCTATTTGTTCTATAC I I I I I 1 1 I I I I I I rTGAGATGGAGTCTTGCTCTG 

TCGCCCAGGCTGGAGTGCAGTGGTGCAATCTTGGCTCACTGCAAGCTCCGCCTCCTGGGT 

TCATGCCATTCTCCTGCCTNGGCCTTCCAAGTAGCTGGGACTACAGGTGCCCGCCCCGAC 

ACCTGGCTAATTTTTAAAAAMNTTTTTTNAGGAGAGACAGGGTTTCACCCGTGTTAGCC 

AGGATGGNCTCGATCTCCTGACCTCGGGATCCCNCCCNTTTGGGCCTCCCAAGGACTGGG 

GATTACANGGCGGTGAGGCCACCGCACCCGGNCTATTGACATTTTTTAAGGNTTCAGATT 

TTCTTTTGNGNGTCTAGNAATTCGTCTTTTATTGGCAMGAf AATTTGCTTATTTGA 

AA 

Sequence 2627 

GCTCCCCGCGGTGGCGGCCGNGGTACAGATAATAACATCTGATATCCACATGGGGTCTGG 

AGGTGCNAGCCACCTTCCTTTCATCCCACGGTCTCACAGCAGCCCTGGAAAGAGGCTGCT 

CTCTGTTGGAGGCTAAGGGCCAGTGTTGGAAGGAGCTCTGGTGGAAAAGTGTGGTCTGCA 

TiGANGGGCTCCCATGAATNAGAGGATAGGGGTGGCNGGTACCTGCCCG 

Sequence 2628 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGAAGGTCCCGGCAGCAGCAGGA 

AGAAGACGGACCCCGCGATGAGGGCGGCGGCANGGAGCACCTTCATGTTGCGTTCGGAGC 

CCCGCGTACCCTATGGACAGTTGTGTCCCCAAGGAAGGATGAGAATAGCTACTGAAGTCC 

TAAAGAGCAAGCCTAACTCAAGCCATTGGCACACAGGCATTAGACAGAAAGCTGGAAGTT 

GAAATGGTGGAGTCCAACTTGCCTGGACCAGCTTAATGGTTCTGCTCCTGGTAACGTTTT 

TATCCATGGATGACTTGCTTGGGTMGGACATGAAGACAGTTCCTGTCATACCTTTTAAA 

GGTATGGAAGAGTCGGCTTGACTACACTGTGTGGAGCANGTTTTAAAGAAGC 

Sequence 2629 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTCATCCCCTCAG 

TGACTAAGAATTGCAGNATTTMGAGGTAGCAGGMTGGGCTGAGAGTGGTGTTTGCTTT 

CTCCACCAGAAGGGCACACTTTCATCTAATTTGGGGTATCACTGAGCTGAAGACAAAGAG 

AAGGGGGAGAAAACCTANCAGACCACCATGTGCTATGGGAAGTGTGCACGATGCATCGGA 

CATTCTCTGGTGGGGCTCGCCCTCCTGTGCATNGCGGCTMTATTTTGCTTTACTTTCCC 

MTGGGGAAACAAAGTATGCCTCCGAMACCACCTCAGCCGCTTCGTGTGGTTCTTTTCT 

GGCATCGTAGGAGGTGGCCTGCTGATGCTCCTGCCAGCATTTGTCTTCATTGGGCTGGAA 

CAGGATGACTGCTGTGGCTGCTGTGGCCATGAAAACTGTGGCAAACGATGTGCGATGCTT 

TCTTCTGTATTGGCTGCTCTCATTGGA 

Sequence 2630 

CCGCGGTGGCGGCCGCCGGGCAGGTACAGATAGCAAAGACTGGGACCACAGTGGAGGGAT 

GCCTAATCCAGACAAGGGCAGGAATAGGCAGGGAAGGCTTCCTAGAGGAAGTGATTTCCA 

AGCTGAMCTTGACAGATGGMCAGMGNTAGCCAGAGATGGGAMACTATTTTTGGTCA 

ATGGAAGANCAGGTGGTTGAGATAGAATCTGACACATGANAGCAAAAAAAAAAGTCCANN 

GNTGGGAGAATACNGNGTGAGAATAAGACAATNTTAAACTGGCNATATAAGTAAGNGATC 

ATCACAAGGCTTTGTAGGACATAGTAGGGAGTTTAAGACTTTTTATTCTGAGGGCAATGG 
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GGAATCACAAGAGGGAGTTAGGCACTNTACACAACTTC 
Sequence 2631 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGGCCTCCTACACCT 

ACCTCTCTCTGGGCTTTTATTTCGACCCGNGATGATGTGGCT 

Sequence 2632 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATCATACGAAAGTGT 

AGTTTCAAAGGAGATGGACMACGATACCTACAGTTTGATATTAAGGCCTTTGGNGAAAA 

TAATCCTGCCATTAAATGGTGTCCTACTCCAGGCTGTGACAGAGCAGTAAGACTAACGAA 

ACAAGGGTCAAATACATCTGGATCTGATACACTCAGCTTCCCATTGCTGAGAGCTCCTGC 

TGTTGATTGTGGAAAAGGACACCTCTTCTGCTGGGAGTGCCTTGGTGAAGCACATGAGCC 

TTGTGACTGCCAAACATGGAAGAATTGGCTGCAAAAAATAACCGAAATGAAACCAG 

Sequence 2633 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACC I I I U ) M I I ! I 1 1 II I ) II I I GAGA 

CGGAGTCTCGCCCAGGCTGGAGTGCAGGGGCNCAATCTGGTCTCACTGGNNGCTNNGCCT 

CCCGGGTTCGTGCCATTCTCCTGCCTCANACTGTCGAGTAGCTGGGACTACAGGCAGCCA 

CCACCAAGCCCAGCTACTTGGGATCGCATGAGCCTGGGANTTTGAGGCNGCAGTGAGCCA 

TG ATCAC TCCAGCCTGGGCAACAGANNGAGACTCTNTCACAAAAATTAACTTTACTTAAT 

TACTTTTATTATTGNCATAATCGCTCCATTTTATTGTTGTTTGTTGTTGTTG 

ATTGTAAAACCATTTTATNTTTTGTTGCAAACCMAMGTTTCTGAGACAGGTCTTMTT 

TATTTAGAAGTTTATTT 

Sequence 2634 

TGGAGC TCNCCC GCGGTGGCGGCCGAGGTACI I I I I M I II I I I I I I I I I I I GTGATAAG 
GCTMTATTTTTTAAAAGGAGGCACAGTGAATTTAAGACATTGACATTTATAGCCGTAGC 
TCGACGCTTACTCTTGGAGGCCCAATTATCATCCCTGTCCATCTTGTAAGTGTCATGTCT 
TCGTCATCTTCTAGACCCCAGCTAACTGTGCCATCTCCTACTCCTTTCTGGCCTTCTTCG 
AGTTCTTCCAACAGTCGGAMTTGCGAGGGACTTTTACTCCCGAGCCCGTGGTGGCTGCC 

ATCTTGCGTCGCTGTTGCTTGAAGGCCGGCCCCTTCTCCCCGCGTCCTGCCCGGGCGGCC 

GCTCTAGAACTA 

Sequence 2635 

CTCCCCCGCGGTGGCGGCCGAGGTACTCTCCGCCTAAGTCCATAGAGAGGCTAACACCCT 

TCTAGTTTATTAGTCTTTGNTAMCCTATGAAATTCTGAATNTAATGGCTANNTTATATG 

CANNGTGGAGGNCACNCAAAAATGTTTCTCTCAACAGCTGTGATAAGTATAGGCTTATTT 

TGATGTCTAAAGATCTGNTACCTGTATCTGNTTTCATCTTTCAACACAAATTCATGGGAA 

NNTTAACTATGNNCCTGTGNTCNGGACAAGTGTGCATGAGAACATCATNCACTAAGTTTA 

TCATNAAATGGGAAANGAAGCAGACNTTTTAAAAAGCACCCAAACCGGCCGCTCTAGAAC 

TAGTGGA 

Sequence 2636 

AGGTACGCGGGGGCTGACTCTCTTCTCAGATTCAGCCCAGTTGCACCCAAGTAAAATAAA 

CAGTCTTGTTGCTCACACAAAGCCTGTTGGTGGACTCTCTTCATATGGACTCATGTGACA 

TTTGGNGCCGAAGACCTGAGACAGGAGGACTCCTTTGGGAGACCGGCCCCTGTCCTCGCC 

CTNTTTCA TGAG ATCCACCCATGACCTTGGGGTTCCTCANCCCAGCNCGAANGAACTTCT 

TCACCAANTTTTAAATCGGGGACCGCAAAGCCAACAAGAATGAAGAAGTGNCCAGGTATG 

GTGGCTCATGCCTGTAATCCCAAGCACTTAGGGAGGCCGAGGAGGAAGAATTGCTTGAGG 

CCNAGGAATCCCANACCAGCCTTGGACAACATGGCAAAATNCCANTNTTTTCNAATNANA 

TTAAAGNAAANAAATTAAAGTTACCTGCCTNGGCGGGNCGTTCTAGNAACTAGTTGGANC 
CCCCCGG 

Sequence 2637 

TTAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACACGGTCAGTCCG 
GGTCCTAAGGGCGCAGGGTAGGGCATNCCACTGGGAGTTCAAAGGGGAAACNAAGATGGT 
TCCCACTGC TGCTC AGACAGTGTCGCTAAAATTCCCTCACTCATTTTCAAGTCTTGTTTT 
ACATMTGGCTTTTAMGCMCTTTTGTTMTGCTNCTGATNCTTTMTNCAMCTATTA 
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ATGTNCCATTTTGGTGACATCTGGGTTTTCATACANAAAAAAAAAAAGTCAAAAAGGGGA 
MCAMAANATATTTACACATTTTATATATAMCTTAAAAACCTTGTACCTTN 
Sequence 2638 

CGAGGTACTATGGCAGTTTTGCCTCAGGTGCTGAACATTTCTCAGCCCTGGCTAAAAGGG 

AGCAGCACAGGGAGAGAAACAGGATAGGAAAGCAGAATGGCGAGCAGCCTATGGCCCAGG 

GCCTGTAATCCCTTCCCAAGACTAGCTGCTCAGGGTGGTGCAGGGACAGGACCAGACCCT 

GCGCCTATTTCCTGCCTTCTTTCCCTATAGGGAACTCTGTAGGCTGAGCCACTGTCCTGC 

TCTTATGACATTATATCTTGTGCCTTTCTCCTCAGCAGTGAGCAGTGAGCTACTCCTGGC 

CCAGGCCCTAGGGGAAATGGATCAGTCTTTGAGGTTTCTATTTGGGGAGGGGAGTACCTG 

CCCG 

Sequence 2639 

CCGCNTGGCGGCCGAGGTACTTG I I I I I I I I i I I I I I I GAGACAAGGTCTCGCTCTGTCA 

CCCAGGCTGGACTGCAGTGACATGATATCGGTCACTGCAACCTCTGCCTCCTGGGTTCAA 

GGGTG ATTCTCGT GCCTCAGCCTCTCAGGTAACTGGGATTACAGGCATGCACCACCATGC 

CTCGTA I Mill! IGCGTGTGTTTTTAGTAGAGACGTGTTTCACTATGTTGTCCCGGGCT 

GATCTCCAACTCCTGTAAGTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGAT 

TATAGACATGAGCCACCACACCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAA 

GTGTGGTAGTCATTAAGTGCTGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGC 

ACTTNCCGCTCCACTTGTGATTAGGTGGAGCCATGTGACTACTTT 

Sequence 2640 

TGGGCTCCCCGCGGTGGCGGCCGAGGTACTAGAAACTTTCCAAGGAGTCTTGGGTGTGTA 

GCCAAGAGGAGCCATGAGCTATGGACTCCTCAAGCACGGGAAGAGGAGGTGTGTGCTGAG 

AACAGAGAGGCCCTGCCCTCTGTCCACTAGCGAGAATCCCTAGCTGCCCCAGCCCAGTCT 

TTCTCCCCGGCATTCACAAACTTTGCAAGCGTTGGTCCAGGGCCCTTCTCCAGATCTGTT 

NCAACTTT GNAG AGTGAAGGGCTTGAGCATACGGGGGAAGAGAGTCTGCATNANGTTAGG 

GGGAMACTTTTAAAAGATACCCTCATTGTGTCAAAAGAAGTGTGCCMTCTATTTTTGT 

ATCAGCATTGGGAAGNGCACTTTCCCCTGGGGCCGTGTGGGTGNGTGAATGTGCAAGTGT 

CTGAGAGATACTGCATCAAGCCCTAGACCCTCAAGAGCCAGTCCCGCCCTTTTACAGAGC 

ANTCCCTTATCCTGGGGCCATGGGTCAGGCTGACCTTCAA 

Sequence 2641 

CCCCGCGGTGGCGGCCGANGTACGCGGGGGACTTCGGGCTTGTTCGCTGGTGGCGTNNGA 

GCCNAGCCCGGACTGGTCAGGATNGATCACGGACGTGCAACTCGCCATNTGNNGCCAACA 

TGCTGGGCGTGTCGCTCTTCTTGNTTGTNGATCTCTATCACTTACGTGGNCNGTCAAACA 

ATTCCAAGMTGCANGAATGMAGTTGGCGCTTTCTTCCGCCCCANGGTCCCAGGACATT 

AGTCTGNGGCANGATNGAGGGGTNTNGAAGGGGCCTTTCACACTTMCTTTATTCCTTTT 

ACCCTTCACMCATACAAAAGGCAACTTACACCTGGGATTTTTNCAMACAACCTTTTAT 

TTCCCTCAGANGNCTTTCCNTTAATCCCTATGGAACAAAGAANGCTNGNCCACTTGAANT 

AGGGGCCCNAGTATAGGGGGCTTTGCTTTTCTACTTCCNTC 

Sequence 2642 

GGACTCCACCGCGGTGGCGGCGAGGTACGCGGGTATCTGTCATAAGCTCAACATCTGTAG 

ATCAGAGGGCT ACCAG AGGAACCAGTTTTAGAAGATGAACAACACCTNTTAGAAAAGAAA 

TTGCCTGTNACGTTTTGNAAGATAAAGAAGCGGGGANAACTTCGAACGTTGCAAACCTGN 

AAACTGGGAGAAACGANGGCAAGCTCTNTTGGAACAAGCAAGCTGCAAGGGAGCAAGGNN 

CGCCtGNCCCANCATGGAGCCGGCCNNGNCNANGGAAAAGGAAGGAGCGTGAGCGCCAGG 

AGCAAGAAGCGCAACAAGACAACTTGGAACTNGGGGAAGCAACTGGGAAANGCATGCTGT 

NAGCTANNAACCTGCANTATNAGAGGGAGGAGAGGGAGGAAAAGAAATTNGANAGGCGAA 

GAGGCTNGCAAAACTGGGAAACTTGAAAGGCAACGACAACTTNGAGTGGGAACCGGAANT 

CGAAGGCAAGAACNTNCCTAAATCAAAGAAACATAGGAACAAGGAGGGACATAGTTTGNA 

C 

Sequence 2643 

CAATTGGACTCCCCGCGGTGGCGGCCGAGGTACCTTATGTAGCCCAAGAAATTCAAGAGG 
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AAATTGATGGGGCTCCTTCAGGAGCAGCGTGCAGATATGGACCAGTTCACTGCCTCAATC 

TCAGAGACCCCTGTGGACAGTCCGGGTGAGCTCTGAGGAGAGTGAGGAGATCCCACCGTT 

CCACCCNTTCCACCCCTTCCNAGCCCTACCTGAGAACGAAGACACTCAACCCGGAGTTTG 

TACCTGCCCGGGCTGGCNCGCTTCTAGAAACTAGTTGGATCCCCCGGGCTGCAGGGAATT 

CGATATCAAGGCTTATCCGATACCGTCCGACCTCGAG 

Sequence 2644 

AATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACI 1 1 I M I I I 1 I I I i I I I I I I IGGA 

AACAGGGTCTTGCTATGTTGCCCAGACTGATCTCAAACTCCTGGCCTCAAGCAATCCTCC 

TGGCTTAGCCTCTCAAAGTGCTAGGATTACAGGTATAAGCCACTGCACTAAGTCCAGTGA 

ATCATTTTCCAGACAACATTTACTGAATATCCGACACTGCCTTCCACAGACGGGGGAGGT 

TTCCCAMGAGGCCGTGCTTTTTGAGAAGCTGGCAAGATAACCCTGGGGTGGAGGGAAGG 

TGGCCCGCAGCTGAGACCACCACCTTACTGTCCCCAAATCCTCACCAGAGGTTGCTGACA 

CTTTCCTNTTCCCCATCCTCATCTACTTTTCAAATTAGAACAGTAGAAAATGGGATGATT 

AGGCTCCTTNNCATTTTTAAAAAACCCCGTTGTTTCAAATTTTTNTAGACNCCAAGTCTN 

AAANGGGNGTAATCNCCTTNGTAAAATTTAAATT 

Sequence 2645 

CCGGCAGGTACCAATCATATAATNTATATAACATTGCTATCAGACTAAAAACACATTCTT 

AGCTAMGATAACTTACCATTTAGAAGTCAAAATGCAGGGAATCTTACTCCTGTTTCCAT 

TTTNTGNCCCNCTTGCTTCACTCGNGTATGNCATGCTCTATCTCTTCTCCTATGCAGACT 

TTANGNCNGTNGGCCATTAANTCTTGMGA/^TTTCCTTCNNTCTTGCTGTCACNTACCA 

NNTTANTTGGTCTGCGTGCAACAAGAAGGNGTATTATANNAAAAAAGTTCTTGCTTAACC 

ATTCANGATTAAATAAANAAAAAAATTCCTTTGTTTNAACATTTTGNTATTT^ 

TACACCAMCTTTTTTAATTGCCTTTTNCANAGNNCCTTTCCCTCCAAAAAATAAAAAAC 

AAAAAATCTTCAATCNACATAAAATCAAACACCTGTATTGATCCATGTTCATGCTAAGCT 

GGGNAA 

Sequence 2646 

actccccgcggtggcggcgaggtacaagcgctttgaatatcatgggcaccatgactgtga 

ccctacaggtaggattggatcactccatgagagtagccggcaggtttctacaatggcctg 

ggmtggactgattatttttatacattttctggcctgagagaaagccaaggtcccctgct 

gttcacagcaaccctgcctgggagcttggaatcttggtaaatctgccccgttnggatcta 

ttggaggtaggctcaccctttttngtct^ 

gacntatcatcacctccagtggaactacagaanacctggacccanctgcactattttaat 

gtaaaaataacaatatggccagggtgcagtggctcacgcctgtaatncnatcactttgag 

cagccaaggcgggcggatcacgaggtcaggagattaagaccatcctggccaatatgggtg 

aaaccctgtntttactaaaatacaaaaaaattaactgggcatngtnttgcgtgcctgtng 

tcccanctacttggganggctgngaccaggggaattgctttgaaccccgnanggcntaga 

attgcantgagccnnaatcangcntctgacttctacctnggcgacaggantgggacnttt 

tnttaaaaaa 

Sequence 2647 

AGGTACCCTATATTCTTCTTGATTTCTAGCCTTTTATTGGCTCTCAGATTGCCAGAGTTG 

GGACTCAATAGTAAGCANCCATTCTGGTGAGGCGGAAGNGATNCTACCAGGGTGNGTTNT 

CATGACMGCANMTCACTGNGTTTTTCTCTCTACTCTGTGGCATANGACTCTATGCCAT 

AGAGNGACGTGTGAAAGGCTTGAGGCT 

Sequence 2648 

TCACCGCGGTG GCGG CCGCCGGGCAGGTACTTATTTTT CI I I I I 11 I 1 I I I GGGGGTGGG 

GNCCTGGGNANTTTTNTNMGGGGCTTTTTAACNNGGGANGANANCCTGTGCCGGTTCAN 

C CCAN GCCGG TNGTN MTTTTCANCCTTTTTTC^ 

GTTTTCNANCNTTTTTCTTTNAMNCCTTCTTCACAAGTNGANGCCATTCNTCCNGGTTC 
CTTCNA AATN TGGGCCTGGGTCCCTTNTCCCCGGNTTTAAAAAANAACTTNCCAAAAACA 
AAACANTTTT7TCCCCTNG 
Sequence 2649 
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CCGCGGTGGCGGCCGCCCGGGCGGGTACGCGGGGGTCTCCAGAGTGAGTGTTCCGGAGAG 

CACCGTGTTAGGGAAGGAGGAGGCTGCGGGCTAACCTGCCGGGAGGGAGGATGCTACTGC 

CTGTTTCCTATTAGTGACAACCCACCTCCTAATCACGTCCTGCTTCAAACAAGGTAACAT 

CACAGGACAGCCTCCGAAACMTAACTGTTTGAATATCCTTAATCTTCGGCAACTTCAAT 

MCTCCCCAAAATATATGTMTCAGAGMTTAACTTTTACMCTTTTGGTTATTGTTTGC 

TTGAACCATAAAGCAGAGCTCTTTCTGGGATAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 2650 

AGGTACTTCACAATACAACTCTTGCAGAAAGTATGAAGACACTCTGTGATGGTGGTGGCA 

TCCACGAAGTAGCCGGCGCATAGGCAGCAAACAATGTGTTCATTCAAGTCTTTGATCTTC 

ACTCGAACCTCCTCCTGACCCTCCTTTCCCAGGGGAGACTACACAACGTCGGCGACACAA 

CGCGCAGGCCCCCGCGTACCTGCCCG 

Sequence 2651 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAACAAGCAATAATAAA 

AGTCCTCTGCCGAAGACAGMTAGTGTMMCCTTTCCTTCCCCTGCCCTTACAGATTTC 

TAGAGAAGATAATTTGGGAATAAMGGTATMAAGGTTTTTGAGAAAAGAGAGATCAAAG 

GACATTCAGAGTTCGGGAGGATATCTGGGAAGCAGACTTAGCTTAGATCCTAAAGGGTGT 

AGAAGATCTGCATTGCTGGGGAGGAGTGTGGGAGTGGTAAACCAGTCAAAGAATTCACAG 

ATGTGCTGGGAGGAGGAGAAGGGGAGCAGAAGCAGGAAAACAGTCTGGCTGAAACACTGG 

TTGTATTTCAAAGAGCAGTAGCAAATCAGGGTGGATAAATGTGTGGGCCTAAGCAGCTGG 

TGAGCATTAAATGTGTMGAATCAAA1TGTATCCTTAGCAMCCTCTGAAGATTTCTGAG 

TAAGATTTTTATGAGTGTTGGTAAATTTATATAGGATGCATATGGGTAGTACCTGCCCGG 

GCGGCCCGCTCTAGATCTAGTGGATCCCCC 

Sequence 2652 

CCGCGGTGGCGGCCGAGGTACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGATCCA 

GTGGTTTAMCGGCACGTTTTTGCTTATMAAAAAGTGCAAAAAAGATGTGGTTTACAAG 

TTAMGCTACAGAATCCCTTTTTGCTGTMTTGCACCAGTTTTAAAGCCTCTGGACAGAG 

CAGTATTTCGTTTAAAACTTTG I I I I I CTTAAAAGCTTACAGTGTTTGGCTAATTCTCCT 

CCCCTTTTTACAAGACGGGGGCCGGAGGGTGGACACTGGTGGCAGGTTAAGGGATACTGT 

CACTTTAAGAAGCCTGCAGATTGAAGTGTAAACATGGAGAAATTAGGGGCTGA I I I I M A 

AACTGTGTGAGATATTAACCAGCCGCCCTGTTATAAAATCAGGAAATNCAAACAGCGATT 

TACACCGATTAACACCCCCTTTATATATTTTTTACAAAAATCACTGAGAAAATAATNAAC 

GTTTTCATCTCTCTTGGC I Mill I GGTTTTAAAAAGTGTCAAAAAGTCTACATTTAAAT 

NTTAAAAATTTAA 

Sequence 2653 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACACCAA 

ATGGATTACAAGCAGCATCCAGCAGAAGACAGACCCCCCAACCCTGCCCACCAGGGCTCA 

CACTCTACAAAACCCTGAGGGCCTAGAAATCTGTAAATGCATCGCCAAGCACTGGGGCTG 

ATTTGCAGTAATTCTCTAAGCAAGGCAAACATGATCTAGCTTTGAAGGCAGCATGAAGGC 

AGCGGGTTGGTGAGAATAATCTCTCCTTAAGAGAAGAAGAAACCTGGGGCGGAAGGAGTT 

TTCCCCGAAGTGGCTTCCCGCGTACAGAACACAGAACTTATTTCTGTCAGTTATTTAATA 

CATTGAAAATTTAGTGAMTGTTCAAAGAGAATAGATGTTTCCCAAMCAACAACTTTAT 

GTTAAAAATAGTCATTAAAAGATCTGTTGTAATTAAAA 

Sequence 2654 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTAAAGGTATAGT 

AGTTGGCAGCAGAATGGACCCATTGAGGATAACTATAAAATTACNGAAAATATTTAAATG 

CAACTTATTGCTATAGAAGCAAAGAGGACTAAAGGGCAAAATTCTAGAGAGTGGTAAATC 

TCAGAAAGCACAAGCATAAAATGCAGCTCTGGGGGCCCTTTCCACTTCTGGCTATAGGGA 

AGAACCTGAATACTGAACTTGATTCAGGCAGAGGACCATAACCTGGGGGTCAGGGGAAGG 

CATGGGGGGGACCAGAAACCAGAAGAACGATCAAGACTGCAATGAAAAAAATGGATACAT 

TAGGAGCTTCAAACACATATMTTTCTCAAGAAATTTCCAGATTCTCATGCTGCATAGGG 
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CAGGAGCCTGMAAGCTAATTTGAGMGATMTAAGTTGGATTTTTGNTTTGTTTTGCA^ 

TTTGCAAGTACCTGCCCGTGCCGGC 

Sequence 2655 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGGAGGTACCACTGAATCCAAGG 
CTCTCTTGGGTAGCCTATGTGCCTCTTGGATGGTATGTGGAAGCCAAGGACTGTCTGAAC 
GTGCTGAACAAGAGCAACGAGGGGAAAGAATTACTCGTCCCACTGACGAGTTCTATGTAT 
GTCCCTGGGAAGCTGCATGATGTGGAACACGTGCTCATCGATGTGGGAACTGGGTACCT 
Sequence 2656 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCCGGGCAGGTACTTGAACAGTAGGAGG 

AGGTGGTTCCTCATTCGTCTCCCGGGAGCGTCCTCTTCTCAGTCAGGCTGGCACCATGAC 

CCAAGGAACCTCGGCGAGTGACGGATAAACACCAGTCGGCCCGCGACTAAGAGCTGCGCC 

CCCGCGTACGCGGGTCACCAGGGTCAGTTTCTTTAATGATGGTTTCCAACTGGCCTAATA 

CATTAAGTAAGACTGGCTGATAACATGACCAGACAGACATAAAGACCCTGTTGGGAATGA 

CATTGAACTCTCAAAGTCAAGATTTCTTACACAAATCTATCAGCTGGAGAAAATGAAGGC 

AGTGTGGTATATGTGTGCAAATAAGGACATTATGAAGCTTAAATATGGAATGTCTCTTGG 

ACCCCCGATGTCATCTGNATTCT C I I I I 1 CTTCTTGTACCCT 

Sequence 2657 

CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTTGGCTCACTGCAA 

CCTCAACCTCCTGGGTTCAAGCGATCCTGCTGCCCCAGCACCACCCCCTCTCCAAGTAGC 

TGGGACTACAGAAGTGCATCACCATGCCCAGCTAGCTAACTTGMTTTTTAGTAGAGACA 

GGGTTTCACCAGGTTGCCCAGGCTGGTCTTCTGAGCTCAAGCAATCCACCTNCCTTGGCC 

TTCCAMGTGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCCATTTTATATATTATT 

TTTGCATAGCCTTCATTGTCTTAGCAAAGTCAGAAGAAGACCANTAACATAATGTNATTT 

AATTAACAACAACGNCAACAACAACAACAAAACAACCCNGGATGGGGGCAGTGGCTCAAG 

CTANTAATGGCCGNCCT TATGAA AGNCCAAGGGGNGNGGATTGGTTGATTCCATTTTAAA 

ACCMACTGMNCCCTNTTTTTTANAAAAAATMACCGTTAAAAAAATCCTTGGTTTTTG 

TTTAMNGNGCCGGTCCCCCCCCCMGGAGAAAATTTTTNGGGGGGGGGGNGTTAAAAAA 
AA 

Sequence 2658 

GGGCG AATCGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTN I I I I I I I 1 1 H I I I 1 1 I I 

inn ICTCCAKIAGGCTAGTTTTTCTTCAGTCCTTMNMCTCTGCTCCTTATATGGGCT 

TCGGTGGCAGTCATGGGGCAGCACCGCAGGTCTACAGTGGGGTGGAGGTGTTCGGTCCTT 

GCGGGGCTTCATGACCTTGATTTNTGGCGGTCGGGGGGCANCACCCNCAGGTCTACAGTG 

GGGGGAAGGNGTTCGATCCTTGTGGGCTTAATGACCTTTGATTCCTGACTACCGGGCTGT 

GAATNGGCACAACTTCACACAAGTATTGTGNTTTCNCATCNACCCTGGGAAAGAACCCTG 

CCCCGGGGGGGCGGGGTTT 

Sequence 2659 

TTAGGGCGAATTGGAGCTCCCC GCGGTGGCGGCCCGCCCGGGCAGGTACTTTNTTTTTTT 

MMM) MAJN I I III M M II 11 I MIllM II IN I IMMMMMMI IM1M 

I n I M u I I I M I M I I i I NAAAAA'I M I M M M I I I NAAAAAAAAAANCCNCCNCCC 

CCCNAANNNCCCCCNAAANAAAAAMAMNNNNTTNAAAAAAATTTCNCGGGNNNTTCCC 

CCCNNAANNTTTNT NTNGGC NAAAAAAANNNNNNN M M M M NGGGGCCCNNCCNTAAA 

AAMAMNTTTAAAATTTTTTGGGGGGGNCCNGGGNANCCCCAATTTTNANNTTNGGGGG 

GTTNAAAAAAANAAA NGGGG NAANNAAAAAAANGGNCCNNNNAATTNGNNTTNTNAAAAT 

TNCNCCCNNNAMANTTTTTCTNAANCCCCMMNGGGANGGGNCNNTNGGTTTTTTNAN 

GGTAANGG GGGGCCAGGGGGGGNTTTTCCCAANTTTTNCCCGGCCCAAAAAAGGAAAAAT 

I I I 1 1 1 I I I GTTGGGGGGAGGGAGTTTTNAAAAA 

Sequence 2660 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAACAGGGATAAG 

TTCTTGGATMGGTGCCAACATACCTATAAAAGCTGATTTTTGAGTAAATTATTGATTCT 

AACATATGTMTGGATTTGGTGTGATAATTTTGTGATCTTTMCTATAAGTGACTTTTTA 
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TTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAATG 

GAGAAACGGAAAAACTATC TCTGT TAAAAACTGATTCCTGTCATTCTTCTGATATCAAAT 

MGAGGMGGMMTAMCTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTGC 

AGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAGG 

TCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAATCACGTCTCTACTAAAAATACAA 

AAATTATCTGGGTGTAGTGGTGCGTGCCTGTAATCCCAGNTACTCGGGAGGCTGAGGCAG 

GAGAATCACTTTAATTC 

Sequence 2661 

TCGGAGCTCNCCGCGGTGGCGGCCGGGCAGGTACTCTGGCTGCAGACTGACCTTGCTCAG 

GTCCGAGAAGGATGGGGCAGCCACTGGAGTGGATGCCATCTGCACCCACCGTCTTGACCC 

CAAAAGCCCTGGAGTGGACAGGGAGCAGCTATACTGGGAGCTGAGCCAGCTGACCAATGG 

CATCAAAGAGCTGGGCCCCTACACCCTGGACAGGAACAGTCTCTATGTCAATGGTTTCAC 

CCATCAGACCTCTGCGCCCAACACCAGCACTCCTGGGACCTCACAGTGGACCTTGGGACC 

TCAGGGACTCCATCCTTCCTTCCCAAGCCCTACATTTGNTTGGCCCTTTTCTGGTGCCAT 

ACACCCTTA 

Sequence 2662 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGGAGGGACATACAAT 

ATTTMTAGGATATTTCTACAGAACAATAACTTATATTATGTCCTTGTAAAAATCTGTAC 

CTCTTTAAAACATTTAACTG AAACATCCA I I I I I I I I I AGCTTTGCTAATCAAAATTGTT 

TTAAGAATTAAAACTAGGTTGTAACTAATGTCAGTACCTGCCCGGGCGGCCGCCCGGGCA 

GGTACATAGGCATCCTATTCACTGCACCCTGTCACACCCGGCACCCCCCGCCCCGCACAT 

TATTrGAAAGACTGGGAATTTAATGGTTAGGGACAGTAAATCTACTTCTTTTTCCA 

Sequence 2663 

AGTTCGGTGTAGGTCGTTTCGCTCCAAGCTGGGCTTGTGGTGCACNGAACCCCCCGTTTT 
CAGCCCGACCCGGNTGCGCCTTATTCCGGTAACTATTCGTTCTTTGAGTCCCMCCC 
Sequence 2664 

AGGTACTGCAACTGCCAGAACTTGGTATTGTAGCTGCTGCCCGCTGACTAGCAGCTGGAC 

TGATTTTGAATAAAAATGAMGCATTMAGGGTTTCCCTACAAMCATTTTTCTT^ 

TACTTTTGAAAATGGCTATMNCAGTTGACTTTCACCCTTGGAGAGCATCACACTGTGTG 

AGGTTCAGTGATTGTTGACCCTNCCCAGCCCCTNCTGCTTCTTTAAGTTATCTGTGTGCG 

TGCNCTTCCTCTCMTCTTNTTTTGCACCGCTCATTTNTTTTTCTCTGACCCATGAAGAA 

AAGGAAAAACTTTACTGATTGATAAANTTTTTAAAANAA 

Sequence 2665 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAI » I M I 1 1 I I I I 

TTTTTTGGTAGMGTGGTGTCTCACTATATTGCCTTGGCTGGTATTGMTTCTTGGGCTC 

AAGCAATCCTCCCCTCTT GGCCT CCCAAAGTGCTGGGATTACAGGCATGAGCCACCTCAC 

CCAGCCCACTTATTCATCTTTTTGCCTGCAAGCTACACCACCAAAGCCCCAGGTCAAACA 

TCTTTCTCCACAGACTGTGAGAAAAGAGCCTTTCTTCCTCCTMTTTGTAATGGCTTTCA 

GCTCTATNTGTCTAGCTTCAATCCTGACATCTGCAGCTTACA 

Sequence 2666 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGAGGCTTGGCTCTGTGTGAT 

GCACTCTAACCAGCCCACACTTCAGCAGCGGCATCATTTGGATTCAGGGAGAGTTTCTGC 

AGCATGAAGGAGTAGGAAGAACACAGGGTGAGTTCACAGGAATCCCTACTTTTCCATGTG 

ACCTTGTTTGGCCTGMGTGATTTTCTTGACCTCTCTGGGCCTGACTTTCCTCATCTGTA 

AAGTGGAGGGTTTGAAAATATCACAGAAAACAAAATACCTTAAAGGATGCTCTGGCACAC 

AGCAGGTATTTCCATATAATGATACCTCCCATTCCTTTTTATGTGAGCTATATCCCCTGA 

AAC CCAG GTTTGACTAAATTGAGACCAACTTTCATAATATACACAATGACTGNTAGATAT 

GAATTTTGGTGTGGTGAAGATGGGGAGTGAAAAAAGTAGAAAAAGTCAAATCTCATTGAA 

TAAAAAAGGG 

Sequence 2667 

NNGGCGGCCGCCCGGGCAGGTACTTTCTTTNCTTTTTTTN 1 1 I I I I I I GAGGCAGAGTCT 
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NGCTCTGTTGCCCAGGCTGGAGTGCAGCGGTGCGATCTTGGCTCACTGCAAGCTCCGCCT 
CCCGGGTTCACGCCATT CTCCTGC CTNAGCCTCCCAAGGAGCTGGGACTACAGGCTCCCG 
CCACCACGCCTGGCTMTTT7TTTGTATTTTTAGTAM 

AGGATGGTCTCGATCTCCTGACCTNATGATCCGCCCGCTCTGTCTTCCCAAAGNGCTGGN 

ATN ACAG GGCCTGANCCATTGTGCCCAGCCAAANTGTNCCTTTGNAAAGTTNGCGAAATC 

AGATTTTGTTTCCCMTAGMCCAAMTTTTTATGAGGGATGCTAGCATTTTTCC 

ATANTMTTAGTTTACMCCTGMNAAATATTATGTTTTGTANTAGATAAATATTAAGGT 

GNGCATTTTAA 

Sequence 2668 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTACAT 
ATGATTTTTATTAGTTTCTGGAGGCAMTTTAATTTTTATTTTAM 

TAAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGT 

TCAAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTG 

GGTAACATAGCAGGACCCTGTCCCTAAAATAATAAAAAATTTAAAAACCACAATAATGTG 

AGTTACAAAAAAGTGTAACTTATGAAAAGTCCGTATTTATATTGAC 

Sequence 2669 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGGGGT 

GGGAGCATTTATGGTGATCAGATGTGCCTGGCAAGCCCCTGTTCAGACATTGCTCACATT 

CCAAATGTTTTCGTGTAGAATATTGCACAGGTCTGGGGACGCTCTACCTGTGCCCTGTGA 

GTGTTAATAATGGTGGAGAAAGAGTGTAGCTGTGCCCTTGAGAGAGAAGGTGAGGGAAAG 

AGTGCACCAGTCAGCTGACCGTCAGCTGGCTAGGCTCTTCACTGAGTCCTATGTCGCAGT 

GCACAAATCACTGCCCATCANGCCTCAGTTTCCTCATCTGGTAAATGGTGATAACATCAA 

TCTGCCCCCCCGCCAGGGTGCTGTTATGAGGGTCAAAAGTGGTAGTGGAGGGTAATACTG 

GNTGAGTCCATTTGTGTGTGGGAGGAAGAAAGGCTTTACATTNACCTGGTACCTTGGGCC 

GCTTTAAGAACTAGGTGGATNCCCCCGGGCTGCANGAAATTTNATATTAAAGCTTATTCG 

ATTCCCGTCCACCTTNGAGGGGGGGG 

Sequence 2670 

AT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTACATAT 

GAl l 1 1 lATTAGTTTCTGGAGGCAAATTTAATTTTTATTTTAMATCAMAT 

AAAGAAATAGTTCTCAAAAAGACAACCGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 

CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 

TAACATAGCAGGACCCTGTCCCTAAAATAATAAAAAATTTAAAAACCACAATAATGTGAG 

TTACAAAAAAGTGGTMCTTATGAAAMGGTCGGMTTTAATATTGACCTTTGGTAAGAG 

CTCAATTANTTTCAGGGAAAGGCAAGGGAGTATCACCATTCTGAGTAATACAATTTCAAC 

TMTCTTTATTTCTTCTACTTGAAGTCAGTGCCTATCTACCACAAACATTCCTATATCAG 

TGTGCAAATTMTTTNGAGACMTAGGCTTTTTACGACMTGMTTGGTTACTTTNAA 

Sequence 2671 

ACGGCGA A TTGGAG CTCCCCGCGGTGGCGGCCGAGGTACTTNN ) I M I I I ) I I I I ) I 1 1 1 
TTTTTTTTCATTTTTATMGAATATATAMAAATGATATAMNGGACATTTACGGTAGTG 
GGGGAAGGCATATATNTACGTTAAAAGGCAGGACATTTTTAA 
Sequence 2672 

TACGAGCCATTTACAAAAAATCGACCGCTTCAAGTCAGAGGGTGGCNGAAAACCCGACAG 

GGACTATTAAAGAATACCAAGGNCGTTTCCCCCCTGGGNAAGCTCCCCTCGTGCCGCTCT 

TTCTNGTTTCCCGAACCCCTGTCCGCTTTANNCGGGAATACCCTGGTCCCNGCCTTTTTC 

TTCCCTTTTC 

Sequence 2673 

CCGCGGTGGCGGCCGAGGTACGCGGGATGTGGTCTAATCAAAGCCATCTCAATTTGTAGA 

TGAAGAAGGCAAGGACTAATGACAAGAAATGAATTGTTGGCCGGGCATGGTGGCTCACGC 

CTGTAATACCAACACTTTGGGAGGCCAAGGCTGGTGGATCACCTGGGCTTGGGAGTTCGA 

GACCAGCCTAACCAACATGGAGAAACCCCGCCTCTACAAAAAAAAAAATACAAAAATTAG 

CTGGGCATGATGGCGGGCGCGCCCCCTAATTCCAGCTACTCCTGAGGCTGAGGCAGGAGA 
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ATTGCTTGAACCCAGGAGGCGGAGGTTGCGGTGAGCTAAGATCGCACCATTGCACTCCGC 

ACTCCGGCCTGGACMCAAGAGCAMMCTTTTGTCTTAAAAAAAAAAAAAAAAAGTACC 
TGC 

Sequence 2674 

CCGCGGTGGCGGCCGACGNACCNGGTNCTCTGTGGGATACNCATTAGAGTTGCTCGNTGG 
ANATGGMTGATGGTGGGGTGCATTTNANCATGGCTGANTGCTTTCTGCTTAAGGACCTG 
ATGTATTAATGCTCTCCANGTCATTCATATTTGGGGGAAGGAACAAAGAAGGGTACCTN 
Sequence 2675 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTAGCATGATCTTGGCTTA 

CTGCAACCTCCGCCTCCCTGGTTCAAGCAATTCTCCTGCCTCAGCCTNCCGAATAGCTGG 

GATTACAGGCATGCACCACCATACCCGGCTAATTTTTGTATTTTTAGGAAGAGATGGGGT 

TTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATTCACCCGTCTTGG 

CCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCGGCCTCCATTACACCTCT 

TTATTCTAGTTCAACTCAGACCGTGAAGTTAGCATACAGGTCCTCAGGAGTTTGAGGCCA 

CTTTCC MGG ATAAGGGCCACCTTCMGGGCACATCTTGCCCTTAAACAATTAAATTTNT 

GMAGCTTTTGGGAAAAGGGGNNGAACATGCCTTTT 

Sequence 2676 

ACTTAGGGCGAA TTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTAC I 1 1 1 I I I I I 

I II I I I 1 I I M I IGAGATATAAATTGGTATGTTCACATGTTGGTAAGAAAGATCTAATAG 

CTAGAAAAGGAACTGCTGAAGTTGCAGGAAAGAGAAGAGAAATNTAAGGAATGAAACCTG 

AAAAATAGAGGGCAGAGTCAGAGGGGTGGAGAAGGAAGGGGCCCTGACGGGAGGNGGACC 

ACTTTTTNCAAGGCACATGCAGGCAGGCACCTNTNTGGAGGGTCCCATGGCTGGCTATGT 

GTGGCAGTGCATGAAATAGTCACTGAATATGCTCCCTTGCTGTCCTCTGCTTGTGCTCAC 

CGAGGAGTGACTGTGTCTACTGGAAACAAGTTTGCCTTTTCACTGGTCTAAATAAANGGA 

TCAATGACCTCAAATCTAAAGAGCATTAGGAAACCTTNTNAAGCCANAACTGTACCCTTG 

GCCGNTNTAAAACTAAGGGGGATCCCCCCGGCTGCAG 

Sequence 2677 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGCTGCGTGCGGCGGG 

AATCATGGCTGCTCGCAGAGCTCTGCACTTCGTATTCAAAGTGGGAAACCGCTTCCAGAC 

GGCGCGTTTCTATCGGGACGTCCTG(3GGATGAAGGTGGAGTCTTGCTCTGTCGCCAGGCT 

GGAGTGCAGTGGCGCGATCTCGGCTCACTGCTCTGACTACACCCGGATAACAGAAGACAG 

TTTCTC AAAG CCTTATGATGGGAAATGGAGTAAAACAATGGTGGGATTTGGGCCTGAGGA 

TGATCATTTTGTCGCAGAACTGACTTACAATTATGGCGTCGGAGACTACAAGCTTGGCAA 

TGACTTTATGGGAATCACGCTCGCTTCTAGCCAGGCTGTCANGCAAACGCCAGGAAGCTG 

GMGTGGCCACTGACCGGAAAGTTGCANGAAAGGTGTTTTTTTAAACCCAGG 

Sequence 2678 

NCCCGGGCAGGNAC I I I 1 1 I I I Ml I 1 1 1 1 I I I I 1 I I I I I I i I AATGGGGATTTGTCTAT 

TAAAATAGGTTGTNGGAAGNACCCCANAGGACCCTTNTTGGATCAGGAATCAATAATCCC 

ACCNAGACGGGGNCCCAGCATTTATTGGGNCGCAGCATAAAACTTGGCCACCTCCTTATT 

GGGGNTGCCCTGGGCTGAGTCAAACCACATTTGGATGCANCnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnhCTGCAGGAATTTCCATTATTNAAAGCCTTATCGGATNCCCGTTCGAC 

CTTCNAGGGGGGGGGGCCC 

Sequence 2679 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTAACANGGGTGTCTGGTCTT 

AGAAGCCTCCCTTCAGATCCCAGCTGACCCTGGTGACTGCCTGGCCTTGATGTTGGCTGC 

AGCCTTCTGATAGAACCACATGGATTCCACCCACAGCTGGCCAGGCTTGTTACATGGGTC 

AAGGGAATACAAATGGCCCCCCCCCAGGGAGCAGGTGTTGGCCTCAGTTTTCAGGGACCC 

TTGGTGTTGCTCCTTACCTAGAGCCCATTAATCTACCCCATNAACTCTCTTGCCATGAAA 

AGCCATCTTCCAGGAGCCCTGTTTT 

Sequence 2680 

AGGTACTCAAAGACGAATCATGAAAAAGAAAAAAACTTTATTTCAAACAGGTTCAGTGAT 
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ATATGTGGGTGCTNCAGCAMGGCTGGTTGTGGCAMGTTTCATTTCAAACTGTATGATG 

TGGGCTGGGCAAGGTGGCTTCACGCCTGTAATCCCAGCACTTTGTGCCCGCGTACTCAGC 

TGTGTTCATGTGGNGGTCTGTGGAAAGAAAAGAAGACTCGTTTGGAAATGAAGCTGTCCC 

TTTCCAAGCAAGTCTTCTGGTGGCTTTTCTTCTCTCAAAAATGGGATCCCGATAAAATAT 

TTGAATAGGAGCNGAATTGGTAGAAATGTTCGTGCCTGTCACCCCAGAAAAGCCTTGCCT 

GGTTTT 

Sequence 2681 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGTTTAAT 

TTCACAGTGAGGGAGTCGGTTGGTGATCTCTAAGAAATCCCCAAGCACCTGGTGTTGGGA 

AA GTCCCT CAAATAAAGAAGTGTTCTTTTC I 1 1 I I I I I I CCTCTCTCTCTTTCTATTTAT 

TTATTTTTTGAGATGGAGTCTTGCTCTGTCACCCAGACTGGAGTGCAGTGGCAGGATCTC 

AGCTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAAT 

AGCTGGGATTACAGACACCCACCACCACGCCCAGCTMTTTTTGTATTTTTAGTAGAGAC 

GGGGTTTCACTATGTTTTGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCCACCAC 

TTAACCT 

Sequence 2682 

CCTCAGATTTTGGGCCTAGGAAGGTAGGTGATTTAAACTCACTGAAAGCATGTACACCTT 
GCTGTTGCTGCTTGCTGCCACTGCTGCTGTTTCATCTGTTCCTGCTGCCGCTGGAATCGT 
GGAGGTAAAGACTTCTGAAACTGTTTGAAATAGCCAGATAATACAGC 
Sequence 2683 

GCGAATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACTTTC' I I I I I I I I CTTTTTCTTTTT 

CI I M I I 1 1 I AATGTGAGACAGGATCTCATTCTGTTGCCTAGGCTGGAGTGCAGTGGCGC 

AATCTCGGCTCACTGCAACCTCTGCCTCCTGGGCTCAAGCAATTCTCCCACCTCAGCCTC 

CCAAATAGCTGGGATCACTGGCACAAACCACCATGCCCAGCTMTTTTGTATTTTTTGTA 

GAGACAGGGTTTCACCATGTTGCCCAGGCTGGTCTCAACCTCCTGGGCTCAAGCAATCCT 

CCTGCCTCGGCCTCCAAAGTGCTGGGATTACAGATGTGAGCCACCGCATTCAGCCCCACA 

CCCTTATTTATACCMTTACCTGCCCAGTMCTGNGGACTTTTGCTTCCTCACCCTGTTT 

TGATCGCTTTMAACTAAGTGGGATCCCCCCGGGCTGCAGGAATTTCGATATTCAAGCTT 

ATTCGAATACCCGCCGCCTTCGANGGGGGGGG 

Sequence 2684 

GACTACTAT AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTC 

TTTTCTTTTCTTTTC I I I I I I I I CI I I I I I I I GAGACGG AGTTTTGCTCTTGTTGCCCAG 

GCTGGAGTGCAATGACATGATCTCAGCTCACCACAACGTCCACCTCCCAGGTTCAAGTGA 

TTC TCCT GCCTCAGCCTTCCCAAGTAGCTGGGATTATAGGCATGTGCCACCAAGCCTGGC 

TMTTTTCTATTTTTMGTAGAGATGGGGTTTTCTCCATGTGGGTCAGGCTGGTCTTNGA 

ACTCCTGACCTCAGGTGATCCACCCACCTCGGGNCTCCCAAAAGTGCTGGGATTACAAGG 

CCGTGGAGCCCACCGNACCCAGCCAAAGGCCAAAATNCTNAAGTCTTTTGN i I I I I I I I I 

TCAMTTGAGGGGTNGTTATTTMCAACAAGCTTGGATCATTTGAGNCCAATTNTTTTGG 

TTCCCCGTTTCCCCACCCGCCAATTCN I ill I I I I II C 

Sequence 2685 

AGGTACMCATTTAGAGMCCTTAGCTGCCCAGAAAAACTCAGATTTTCTGCTTTACAAA 

AGAATAAAAATCATCGMTTTATTACCCTGGACTTTATTGGAAATCAGTGAAGAATATTC 

ATACCAAATACCAGGTTTCACGAACNTTNCCTCTCTCTCTTTCNTTCAAGGTAAGGGGGT 

TTGGNCMAAGGNTTGTTTNTTTTTNGAAAANGCCGCCCNGNTTGGNNCNAGGTGGNNCC 

TGTTTMGGGTTGGGCNNACGGGGGGTGGTTMGGAAAAMTTCAAATTTAMNCNTTTT 

TATATTTTCCCGGGGGCTGNGTTTTAANCTTTAAAACCCCCTNGGTTTTNCCAAAGGGGN 

AATTCCCAAGGTTTTTTAAAAAAAGGCCNN I I I I I I I I I I I GGGGACCCTNGCCCNGGNC 

GGGGCCCTTTTANMAAAMGGGGGGGNTNCCCCCGGGGGGTGGGANGGAANTTTTGGAN 

NTTTAAGGC I I 1 1 I I GGGTANCCGGGGAACCCTTTNAGGGGGGGGGGG 

Sequence 2686 

CCGCGGTGGCGGCCGAGGTACTCCAACCCAAGCAACAGAGCAAGACCCTGTCTCAAAACA 
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AAACAAAACAAAACAAACAAAAAAAATGAGGTAGGCATGTTTTATTCCCATCTTACAGAT 
GAGGAGACTGAGGCAATAATAATTCAATGGCTTATCTAAAGTCACAAAGCTAGTAAGGAG 
CAAAATCCAGTTNTGTCTGCTTCCAGCCCACCTTGTCCCACTTGCTTCTTTATT 
Sequence 2687 

TACTATAGGGCGAAT TGGAGCTCACCGCGGTGGCGGCCGCCCGGGCMGGTACTTTTTTT 

n I 1 1 1 I I I I I I I I J I GGGTATATAAACTATTTATTAACAGACAAGGCCTACAGACTTAT 

TTCTTCTTGGACACACCCACGGNGCGGGCACGGGCGGNNMGTNGGTTTTGGTGTGCNTN 

GCNTTNGNACACTAAA GGGCC CAAAMGTTGACGCANTCCCTTTTNTGGGCCCNMTTTT 

TTTAANT CNTTC NAAGGTTTTTNACGGGAGNTTGTTGTCCAGACCATTGGCTAGGACCTG 

GCTTGTATTTTCCATCNTTTANATCCTTNTTGTTCTGTTCAAGAACCANTCTGGGGATCT 

TGTACCTNGGNCCGCTNTAGAAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnTGGATATT 

AAAGCTTATTCGANNACCCGTCCNNCCNTTCAANGGGGGGGG 

Sequence 2688 

AACACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTG 
GAGAAAGAAGGGGGACCCCACCAGGGCTATGGAGAGACAGTAGAGGCAGGACTGAACCGT 
CAGCAAAGATTAAAAGGATGACCTGAGGCTGGCAACCACAAAACC 
Sequence 2689 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGTCT 

TTTGCTTCCTCCCACCCCTAGGGGGAAAMCTGCTTTGTGCTTTGGGAAGTTGTCTCTGA 

AACCCGGGGACAGAGGACGCAGGACAGACTANGAGGGAGCCCGGGAGGATGGGCTGCANC 

TGTGGAGGAGGGTTTTAAAGGAGAGAGGGTCGGAGAGCAAAAGGCCTNAANAAGCCCANA 

AGCAAGTTGAGAGAGGGTGGAAAAGTGAACCACCGGCTTGGGCTTGAACCCGACACGCTT 

TTCTTCCAATGGTTAAMTAGCCACCTTTTAGAAAAAMTTTNACAAAGGNTCCCATTCC 

NCNAAAAAAAAAAAAAAAAGGAAAAA 

Sequence 2690 

TATACGACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGACG 

GGAAGGATTCCTGCCAGGGTGATTCTGGGGGTCCGCTGGTATGTGGAGACCACCTCCGAG 

GCCTTGTGTCATGGGGTAACATCCCCTGTGGATCAAAGGAGAAAGCCAGGAGTCTACACC 

AACGTCTNGAGATCACGAACTGGGATCCAAAAAACCATTCAGGCCAAGTGACCCTTNACA 

TGTGACATTTTACCTTCCGACCTACCACCCCACTTGACTGGTTTCAGMCGTTTTTNACC 

TAMCCTTGGCTTTCCCTTNTNTNNTGGCCAGNTTTTNACCCTGATGCGTAAAAAACGCA 

ACCGACGTGAGGGGTCCTGANThfTCCCTGGGNTTTACCCCAACTCCATNCTTNGATTAAT 

GGGGG 

Sequence 2691 

CACTACTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCACGGTACI I I I I M M 

t I l l l l l l I I l GGTAGGAGATGAGGTCTCATTATATTGCCCAGGCTGGTCTCAAATTCCT 

GGCCTCAAGTAATCCTC TCGC CTAGGCCCATAGTATTGGGATTACGGGCGTGAGCCACTG 

CGCCCAGGCTTTACTAGTTTTCCATCTATCTTTAGGCCTNCTCANATTTCTTGTGGGGCT 

CCACTTCCACCAGTCACACCTTTAA 

Sequence 2692 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCACTCAAA 

AAAATTATTTCTCAATTTGTTTGCTCATACTATTGA1 I I I I I 1 1 CTAGAATCTAAATAAT 

TGTGAATTAACTATGATAGCAGTATGCACAAACAGTAAGTGAAATCAGACCTATTAATTC 

TGAGAGGAAGGAGGTGTCAGGATTTTCACAGAAGAAGAGCACTNAGGCCAGGCGCAGTGG 

C TTAT GCCTGTAATNCCAGCACTTTGGGAGGCTGAGGCAGGCNGATNAAGGNGGNCAGGN 

GTTTTANACCAGCCANCCAAACNTGGGGAMCCCCGNTTTTTANTAMANTCCMAATTT 

CCCCG CCTGTANTCCC AGTT ACTN AG N GGGG NTTGGG NC AAAAAA I M I NI I ACNCCCG 

GNGGGNGNGGGTTTTNCAGAGNNTATAMTCGCGCGNTTNGCTTTNAAATTTTNGGGNAA 

A A AAG A A AC G AGTTC N CTTTTG AA AAA A AAA A AAAAAA AAA AAN G G 

Sequence 2693 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAGCCTAA 
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ACMTTTCACCTANGTAAAATATTGATGTCATAACCAAACTATATGGCCCCGTTTCATAA 

AGGTTACTATATTCTATAGAGAGTGMGAGGTGGCCTTTCTATCCCAGCTTACCCTATTC 

TTGTTATTGTTCAAATTCTCCTGAAGCTTGCATAACTAGCTGCCATCAGGTAAATGCTAT 

TGGCTAGCAGAAGACTGCAGTTCTGTTAATATTAGAACCAGCAGGGGGAACTTGGGAACT 

TGACATTAAAAATCTAGAAACGGAATTTTA 

Sequence 2694 

CCGCGGTGGCGGCCGCCCGGGGAGGTA C I 11 I 1 I I 1 I I I I I 1 1 1 1 I I 1 GGGCGGGGGTCT 
TTATTTGAGTTTAGGCATGATTCGAATGAAGAGGATCATGCTAATGAAGATGAAGCAGAC 
GATAATGAGCGTGGCCCAGAGCAGCCAGTTGACTGACT 
Sequence 2695 

GAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGTGGTCCCAGTTACTCCAGAGGCTGAAG 

TGAGAGAGTCTCGTGAGCCCAGAAAGTTGAGGCTGCAGTAAGCTGAGCCATGATTGCACC 

ACTGC ACTGT AGCCTGTCTACAAACAAATAAACGAAAAAACAAAAAGACTTGTGAAAAGT 

GCTGATTTTTAATTAGGAAMGATTAMCATTGGATAGTCATGGMTTGTTTACTGAACA 

TTAGAAATTGGTTGCAAGGGTCTATGCTTCTGTAAAATAAA 

Sequence 2696 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATATGAATCACTA 

AGAGTTCAGAAATTAAGCTCACTTTAAGAAAACTCAGCCAGGCACAGTGGCTCACGCCTG 

TAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACGACGTCAGGAGTCCAAGACCA 

GCTTGACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGAGCGTGGTGAC 

ACACGCCTGTAATCCCAGCTACTCAGGAGGCAAGGCAGGGGAATTGCTTGAACCAGGGAG 

GCGGAGGTTTGCAGTGAGCCCAGATCGCGCCATTGCACTCCAGCCTGGACAAGAGAGCGA 

GGACTCTGACTTCCAAAAAAAAAAAGGAAGAAAAAAAAAAGTTCCCTTGCCCGGGCCGGN 

CGNTTCTAAAAACTAGGNGGGNTNCCCCCGGGGCTTNNANGGAATTTNGATTATCAAAGC 

TTTNTCGANNNCCCGTCGACCCTCGAGGGGGGGGC 

Sequence 2697 

CTACTTAGGGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I I I 

n I 1 1 I I I I I I I I I I IGGTANGGGACCGGGGTTTTACCATGTTGGCCAGGATGATCTCGA 

TCTCCTGACCTCATGANCTGCCTGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGCATGA 

GCCACTGNGCCAGGCCTTTTGTCCATTTTTTATTGAACTGCCTATTNCTTCTTACTGATT 

TGTAGAAAGTCTTTATTCTAGTCTGGCAGGTATTTTNTTTCMCACTTCCMGATTATTN 

ATTGGCTTTGGNAGCTAATGCTTCATTTGANAAAATGCTACCTGTATTATTGATGCCCAT 



Sequence 2698 

CACTACTTAGGGCGAT TGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCT 

TTTTTC1 I I I I I I I I I 1 I I I I 1 1 I I 1 1 CTTTTTCTTTGAGACTGGGTCTCGCTTTGTTGC 

CCAGGCTGGAGTGGAGTGGCGTGATCTTGGC7TACTGCAGCCTTTGCCTCCCCGGCTCGA 

GCA GTCCT GCCT CAGC CTCCGGAGTAGCTGGGACCACAGGTTCATGCCACCATGGCCAGC 

CNACTTTTGCATGTTTTGTANAGATGGGGTNNNACAGNGTTGCCCAGGCTGGNCTTAAAC 

TCCTGGGCTCAGGCGATCCACCTNTTTTANCCTCCCAAAGTGNTGGGATACAATTGNGAG 

CCAC CACGTCCAGCTGGAAGGGTCAANAATTTTTTACATTTTTNGNAGCACAATNTGGAT 

TTTTCACCCANCCTTTCCCCTTCTTTTTC^ 

NCNNTTTTNTTTTANAAAAAMMANCNTTTTTTNNCCNAAAAAAAA 
Sequence 2699 

CTACTATAGGGCGM TTGGAGCTCNCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTT 
n M I I I I II I M I I IGI I I I I I I I I I I I I 11 1 I I I ICCGACCAATATGGTTTATTTNTG 
CCCCAGCCAAGCTTNTTTGGACCCTGGCTGGGGGGAAAGGCACCCCAGGCACCGGCAAGT 
TCCAGTCATTGCAGATCCTCCAGGTNTAGGNGTGACTGGAAGTANCCTGGGCACTGNTGC 
TGGACCGTNGGATTCTCCTTCCTTNTTCCGCCGGCGGGTGGTCACCAGGACACCGCANAT 

CAGGCATGTGATGAGTCCCAGGAGTCCTGCCAAGCCGATGAGGATGACAGCCCAAAAGGG 
AAGGT 



GTGA 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

441/467 

Sequence 2700 

A CTTAG GG CGAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGTGGATGCAAATGA 

MTTTTATCTTTTGAAAGTAAGAGGAACTATGGCTGGGCGGGGTGGCTCACACCTGTAAT 

CTCAGCACTTTGGGAGGCCAAGGTNGGGCGGATCACCTGAGGTCAGGAGTTCAAGACCAG 

CCTGGCCAACACAGTGAAACCCTGCCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGT 

GGCACGCTCCTGTAATCCCAGCTACTTGGGATGCTGAAGCAGGAGAATCTTTTGAACCTG 

GGAGGCAGAGGTTGCAGGGAGTTTGAGATTGTCGAGTATAGTGGATTGAGTGCTGTGGAG 

CCGAGATTGCCAAGTGCAGTGGATCGAGTCCACTGCACTCCAGTCTGGGCAATAGAGCTA 

AACTCAGTCTNAAAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGCCGCTCTAGAACTA 

GGTGGATCCC 

Sequence 2701 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGCTCACCAA 

GCTCTTGAGGCCMGCTTGATACGGCTGTTGTTTTTTTCTACATCGTAGCCAGCAGCCGC 

AAGCGCTTTCTTAAGCGCGGCCAGAGAAAACGCCGCTGCGCTCCTTAGAAGCTGCCACTG 

CCTTGGTGATAAG CTCAGA TACTGGGGGTCCGGATGCTTTGCGTTTCCCAGCAGTTGCGC 

CTGCCTTCTTCGCCTTTTTCTTCACAAGGTGTTTTTTCTGCGGGTGCAGGAATGGTAGGA 

GCMGTGGAGCAGTCTCCGACATGTTTTTGTCTTCCCAGAAAAGACAATAAGTAATCTCA 

AACTGTCAGAACAGCATGTCCCCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTAGGTG 

GATCCCCC 

Sequence 2702 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGAAGGGGAGG 

GCTTGGTGTTCCCTACGCGAAGGTCTGACTCAATGCTTATTCCCTCTTCTTCCTCTCTCT 

GCAGACTGAAGAATTGAACCGGGAGGTCGCTGGCCACACGGAGCAGCTCCAGATGAGCAG 

GTCCGAGGTTACTGACCTGCGGCGCACCCTTCAGGGTCTTGAGATTGAGCTGCGGTCACA 

GCTGAGCATGAAAGCTGCCTTGGAAGACACACTGGCAGAAACGGAGGCGCGCTTTGGAGC 

CCAGCTGGCGCATATCCAGGCGCTGATCAGCGGTATTGAAGCCCAGCTGGGCCGATGTGC 

GAGCTGATAGTGAGCGGCAGAATCAGGAGTACCTGCCCGGCCGCTCTAGAACTAG 

Sequence 2703 

CCGCGGTGGCGGCCGAGGTACGCGGGTGTGAGGAGGTGGGGAGACCACCCCACCCCCATG 

TCCACCATGACCCTCTTCCCACGCTGACCTGTGCTCCCTCCCCAANCATNTTTCCTGTTC 

CANANAGNTGGAGCTGAGGTGTCTCCATCTATGNCTCAACTTCATGGTGCACTGAGCTGT 

MCTTC7TCCTTCCCTATTAAAATTAGMCCTGAGTATAMTTTACTTTCTCAAATTCTT 

GCCATGAGAGGTTGATGAGTTMTTAAAGGAGAAGATTCCTAAAATTTGAGAGACAAAAT 

AA ATG G AAC AC ATCA A AAA AAAATT AA A AAAA AAGT A 

Sequence 2704 

TNAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I i I I I 1 1 I I I I I I I I l I 

nrrn ictttci 1 1 1 1 1 ) 1 1 n n 1 1 1 1 n 1 1 1 » 1 1 1 1 1 1 1 1 1 1 ingcatcaaaaagct 

TTATTTCCATTTGGNCCAAGGCTTGTTAGGGATAGTTAM 
GGGAGAGGCTTAGGCANAANCCCTNTTACTTTGNANGGGGCCCTT 
Sequence 2705 

CCGCGGTGGCGGCCGAGGTAC 1 1 I I I I I I I I I 1 1 I I I II 1 1 1 1 GTTTG AGACAGAGTCTC 
ACTCTGTCACTGAGGCTGGANTGCAGNGGCGCGATCTGGGCTCACTGCAACTTCCACCCC 
TTCGNTNTAAGTGATTCTNCTGCCTCANCCTCCNAANTAGCTTGGATCACAGGCGCCCGC 
CACCACACCCGGCTMTTTTTGTATT^ 

GAAAAATAAAGGTGCATGATCAACAATCAAACTTNTAGGACCGTCCCCTANCAGGAGAGC 

AGCAGCAGNAGCAGCACACANACCTGCCC 

Sequence 2706 

CACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCTNACCACTT 
CTMGAAACTCCAANAAAGGAANCATGTGTNTTNTATTCTGACTTMCTTNATTTGTCAT 
AAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATANTGNNAGATGGAGAANAGTGAATG 
AGNTTCTACCACTCTNTACTAATCTCACTATTTGNATTGAGCCCAAAATAACTATGAAAG 
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GAGACCGAAAATTTGNGACAAAGGATTGNGAAGAGCTNTCCATNTTCATGATGTT 
Sequence 2707 

NCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGAGTGGGGG 

TGAAGCGTGTTCTCTACATAGGCAACACAGCCGCCTAAGTCACAAAGTCAGTGGTCGGCC 

GAGGTACTTTGGTAAGGAGATAGGGAAGGAATTAAGGCTATTACTCTGAAGAAAGTTGGG 

GGGCCAGGGCTCCTATTTTTTTGCTGAGGAGATGGAAGATCAGGGCTTGTATTCAATAAG 

AATGGGAGGGGCCAGGGGATGCCTGGCAAAAGCCTTGCACTGTGAGGTGCAGGTAGAGGC 

TTTTATTCTGGTGAGAGGACATGGACTCTCTCTCTCCCCTCAGGTAACTGTGCCCTGTAC 

CTGCCCG 

Sequence 2708 

GM TTGGAG CTCCCCGCGGTGGCGGCCGAGGTCTNMTTNTTTTNCCCTTNNNNGAGACG 
ACCT7TTTTGGANATTTTTTTATTTTTTGGCAMTTTGAT 

TAATTTGGNTAAAAGCTGATTATGTCGTACAATGNCAAAGTCAGCTAACTGNCGTCTACT 

TAAGACTTNTGGNCATTTCCAACTTATAGAGGAAGGGNGNCTCTAAAATCTCTTCTTCAG 

AAGGCACCTCACTTNTCANACTTAAAANNCCACATCAAGTGTTCCATTAAAAGAAGATAN 

GGCATTCTGAGTGCAAACAAATGGGGGCTTNTTAAACTA 

Sequence 2709 

AGGTACATGTCTCTTCTCCTCTGACTCTGGGAATGMGATTTTTCTACTCCCACAGGCTT 

GAACTCTCCTTATAGGGAGTGGTCTTNCACATGGCCCAAAAATCAAGAAGGAAGGTCAAG 

AATAAAAACCTCCCAAGGCTTGAAAACTAAGAGCTGGCACGTAAGTACCTGCCCGGGCGG 

CCCGCTTCTAGAACTAGTGGGATCCCCCGGGCTGCAGnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnCGTCGACCTCGAGGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGTGAG 

GGGTTMTTGGCGCCGCTTTGGCGTAATTCATGGGTCATTAGCTGTTTTTCCTGTGTGGA 

MTTGTTATTCCGCTTCACAATTCCACCACCAACATACCGAGCCCGGGGAGCATTAAAGT 

GTAAAAGCCTGGG 

Sequence 2710 

CCGCGGTGGCGGCCGAGGTACACGAGAANAGGAGACCCCAACACGCTTCAAGCTTTGAGT 

GGAGAGGACACAGCCTCTGCTGNGACAGGGAACANAGGGATGCGGAGACCCTGAAGATGC 

TTTTGGACAGTGGTCTGAGGTTGGGACAGTGGCAGGAGATACCATTCACCCAGGATCTCC 

AGG ACAAG AGATCAGCCTGGCAGTTACATGTG I I I I I I I I CAAACTGGTTGCCAGGTTGG 

CATGAGCGATGACATCAGGAGATTCCGACCTTTTCTTATTGGANGGGACCGGACTNTGTN 

GGAC CTGGGGAGTTNAGTTGGACAAANAAGAACCTTTTAAMGGGGNTTTCNNTTTTGNT 

TTTTCCCCCCCNCNAMTTm"AAAANACCTTTTTTNTTTNCC 

CCCNGCCCCCCCCNNNNTNTTTTTTGNGGGGGANNGGGNNANTTTTTGGANAAAAATTTT 

TNTCCCCCCCCCCCCCCCC 

Sequence 271 1 

CCGGG CAGGTACCTTAT TACAT ATGATTTTTATTAGTTTCTGGAG 

ATTTTAAMTCAAMTCTATTTTAAAAGAAATAGTTNTCAAAAAGACAACNGATGACTGG 

GTGNGGNGGTGNGTGCCTGTAGNTCAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTT 

GAGGCCAGTTCAGTCTAGCCTGGGTAACATAGCAGGACCCTGTCCCTAAAATAATAAAAA 

AA TTTA AAAACCACAATAAATGTGAGTTACAAAAAAAGTGTAACTTATGAAAAGGTCCGT 

AATTTTATTATTGGACCCTTTGTTMGGAGCTCMNTMTTTTCAGGGAAGCAAGGGAGG 

TATCACCCATTTCTTGAGTTNAATACCAMTTTTCAMCCTAMTCTTTTAATTT^ 

Sequence 2712 ^ 

CCGGGCAGGTACCCCCTGACACTCCAGAGCTACTGAGAAATACTCCTTTGTCTTTCTGCA 
CTTGTAGCCAGGCTGAGCTTTGTGAAAGMGATTGTGTATGTGTGTACTTTCTTTTTTT 
Sequence 2713 

CCGGGCAGGTGCGCGGGTGGGTGACGCTGGTGAANTGGCCCANGGAAGTGCACATGTCTC 
TNCCTGCTCTTCCAGGGTGATTTTTTTGATGTTTGGCTCTTGGTCCTTGTTTCCACTGGC 
TTTCCATCCCCATGGGGGAGAAACAGTGGCTCCTGGGAGCAGAAAAGGAATTGAGGTGGG 
CAGGCAGAAGAGCCTGGATTGCTCACTGTTTTGGGAAACTTACTTNNAGAWNTANANAAG 
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ATCCNGGAAMCCCAAAANCCTTCTTGACCMACCATTTTTTGGGG I 1 1 I I 1 1 I I GNGGG 
CCNAAAAAI I I I I! I I I I I I AAAAAAAAAAAGNNNGTTGNGGANAAAAACCCCGGGGGGG 
GGGGGGGGTTTC I I I 1 I I I 1 1 1 NGTNGGGGGGGGCCCNNAAAAAAANGGGGTTNAAAACN 
CCCTTTTTTNCCCCCCNNNNNNNNNNN I I I 1 1 1 1 I I I I NAAAAAAAAAAAAAATTTTTCC 
CCNNCAAAAANNANNNNNTTTTTTNTTNNNCNNN I I I I I I I I I I I M NGGGGGGGGGGGN 
NNAAAAAAAAAAATN 
Sequence 2714 

CCNGGCAGGTACNCGGGGGCGCGTGAGCAGCTGCAGCGGCAGAGGCAGCATCCAGCGGCG 

GCGCCAGCAGTTCCAGTCCGTTGCTTTACT^TTTTGCTTCACCCGACANTTAGTC^^TTA 

ATGCCCCGAAAGNAGGAAAAGGTCCTTCCCAGGAGGAAATAACNAGTAGGGGGNCAAAAA 

NGAATNGGGATTCCCCAAAAAAGGGTAAAACCTTAAAAAANCAANNNGAANGCNCCCACC 

MAGGGAACGGGTTCTTGCCCCMGGAATTTGTTCAGNCGAAAAACCCTTNNCTTTTCCN 

ACCC C AAAAA AAACCCCTTGTNNANCCCCCCAANAAACCCCAAAAGGGAAAAAANAACAT 

TTTTGCTTTTAGGAAAAGGGMCCCTNNTGTTNGGNNAACCCC^ 

GNNGGGGTTTGGCCCTTTAAAAAAGGGGGGAAAAAGNAAAAAGGGGAGGGGNNAAAAAAA 

AAGNNCCAAGGGGNAAAAAGGCCTTTTNGNAAAAAANGG 

Sequence 2715 

CAGTAAATGAACTAATCTACAAGCGTGGTTATGGCAAAATCAATAAGAAGCGAATTGCTT 

TGACAGATAACGCTTTGATTGCTCGATCTCTTGGTAAAATACCGGCAATCCCTCCNGCAT 

TGGGAGGGM TTTTGG ATTTTCCAATGGAGGMTTCCTMNTAACCTGGTTTTGGGNAAA 

AAMACCGNCCTTTTTCAAAAAAGGNANGGGNCCAAAAAATTAAAACCTTTTTTCCCCTT 

NGGTNGGGGGCCCCCCCTTTTTCCAAAAAMTTTTTGGGTTCCTTTTNCTTTCCCCM^ 

CGNAAAGGGGGTTNGGGGGAAAAATTGGGAAAAGGGAAAAAAAAAAAAAGGGAACCCCCA 

NACCCCCCCCAAAI I I I II 1 1 1 I I I GGGTTTAAAAGGAAAAAAAGGGGGGTTTGGGGGNG 

AAAGGAAANTTGNCCCTTGGGGCCAAACCAAGGGGGGGGAGGGGGGAACCCCCAANGNGA 

ATTCCAAAAACCCAAAGGGGGGCCCTTTTTAAATTTTTAANGGAAAAAAG^ 

NGGAAAAACCTTTAAAAAGGGGGNGGNGG7TCCTTTAACCCCCCATTGGGAAAANTTAAA 

I I I I I I I I I I 
Sequence 2716 

AC I I I I I I I I I I I I I I I I I I I GTCAGAAAAACTCCTTTATTACCATCTCCCTATTACATT 

TCTATTCTAGGGTTAGTGTTMTCTCAGGGTCTTATTTCTTTAAGCACNGCNAAAAAGGG 

CCTTTGGTAGGAAAGTTAGGTGGTAMGGTT1TTCMTCCTTTCNATTTTTNTAATCC 

ANGGGGAATTGGAAAAAATTCCANAAAGGNCCATAGGGGNAGGCCNTNGTTGGGACCTTA 

GGGGAN CMATT AACCAGGTTTAGGGTCCTTAGGGGGGCAATAAMCTTTTCCCCAAAAC 

CCMCCATTTTTMGGGGGGGAAAAAAAGGCCCCCCAAGTTTTTGGGGAAAACCCACCNA 

GGAGGCCTAATTAGGTTTTGGGGCCTTNCAAMCCCCCCTTTTMTTTTCCAAATTGG^ 

NCCAAATTTCCGGGTNGGTTTTTTTCNM 

I I 1 1 1 I I II I IGGGTNGGGGGNNAMAAGGTTTCCTTTCCTTTGGTTAAAGGNTTGGGAA 
NCCCCCAANCCCCCTTGGGAATTNMCCCTTNGGCCCCCTTGGGGGGGGGGAAGGGTTTT 
TTAAl I I I I I I I ICCC 

Sequence 2717 

AGGTAC I I I I I I I I I I I I C 1 1 I I I 1 1 1 1 I II I GGTGGGGGTGTATAGGTCTGGGGAGNGC 

CTTTCAGGTGCTGCTCCATAGACATGTGTGTGCCCTGTATAGAAACCGCCCCCCATTGTN 

TAGAAAAAACNNCCAGACNCTTNTCCCAAANAGNCNNGNTTCTTNACCCATGTAAANNGG 

AAGGNTGNGCGTAATTAGGAAAACCAAGGGCTCAACCANANAGGGCCAAACCAATTTAAT 

TAAMATTCCGGTTCTTGAAGACCCTTTGNCCAAMTGNATTACCTTGTCNCCCAATTTT 

ANAGGGGAAGGCCCCTATNTANNTTTNTNTCTCTTTGTCTGGTGGGCCCCCCCCAACTTT 

CAAGGTTTGCCTTNGGNCTNCTTACCTTAAAANTNAGNNAAACCAANGGCGGNCCAATNA 

AAAAANGGCCAAAAGGGGGNCTTTGGTTNAAAMMTTAAAAAAAAAAGGTTTTTANTTT 

TTAAN CC A ATTAAAAATT A 

Sequence 2718 
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TCGCGGTGGCGGCCGAGGTACANAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAAACT 

GCTTTGTGCTTTGGGAAGTTGTCTCTGAAACCCGGTNGNACAGATTGTACCGTCATTGTA 

CCAGGACCTTACNGCANCGGNNAANGCNCCGGGNGCAGGGCATTNGGGGGNCCTTGGCCA 

AGTCATTGNTTNGNGAAATGNGGAAGGNGGGGATCTTTTACCAAGGAAAGGGGTATGGAA 

NGTTANGNAGGNTTCCCNGGTGTAAAGAAAAGGTCCCAATGTNATNAGGACCCCCCTTGG 

GAAGTTAAAANGTCCCCCCATGGTAAGCGGTCCCAAGAGGGTTNTTGTANTNTAANGGGA 

AGNGGNGATNTGNGGGNAAAAAAAGGGATTGNNAAGANCCCAAATTTACCGNNGGANATC 

CCTTTGGGGNGGNGCCCCTTTGGNGTGGAAAGAGCCCNCCCGATTNTAAANCCAAACCCG 

TGCCTTA CTTTCCCNTTCNCCNNMTTGGTTTTTAAAAAAATTANGNACNCAACCCCCCT 

i I I M I IMNGGGNAAAAAAAAMMAATTTTTTNCAAACCNAAAANGGGTTTTCCCCCC 

CCCCAAANTTNCCCAAACCAAAAAAAAAAAAAAAAAAAAATG 

Sequence 2719 

GCCGAGGTACAAACGGGTTTCCCACCTTGCCAGGGATCCTGGGGACAACAGATGTAAAAC 

TCCTGAGTCTCTGTGTGTGTGCCTGAGTGGCCAGTCTGCCAGAACTCCACACAGCTCTGT 

GTATTGAACCCAAGGCCTTGGTGGCCTGGGCTCATGAGCAGGTCTCCTGATCCATGGATT 

GCAGAGATCCATGGGAGMGCATGATTTCCCAGGCAGGGTCGCACATTCACTCATTGCTT 

CCCTTGGCTGGGGATTGGGGTTCCTTTGGCTCTGTGCCACTCCTGGGTGGGCCATTGCCT 

CACCCTTCTTTCCTCAGTTCTTCGTGGGTTGTTTGCCTAGTCAGTCACAATGTGAGAACC 

TGGCGATTTCAGTTAA 

Sequence 2720 

AGGTACACTCGCCAGCGGTTTTGCCACAGGAGTGTACGGGAACAAAGGAGACAGGCTCAT 

TTATAATCTGACGCGGNCACCCTNCTGCTGCGTTCGGTTTCCATTGGCTGGGACNGNACC 

TCACCTTCTGTATTTGTCCCGACTGGCTAGCACTTAGAACTTnTAAAAGAGGCAAAGGC 

ATACAGAGANCAAAGGAAGGAGGAAGTNACTTGTGGAATATTGAGAAAGGTAAAAACACC 

TTTAAATAAGGAAGAGGAACAGGCTATGACCTAATGCTTGTTNGGATCAGTATAAGCATG 

TTAGGGCAAATATTTANGCTAAATTGTGGGAGCTMGAACATAAAGTATATTGATTTTTT 

ATTATGGCTAGCA 

Sequence 2721 

AGGTACAATTTAA I ! ! I I CTGCTTGCCCNGGAAACAAAGCTTCTGTGGAACCATGGAAGA 

AGATGAAAATGAGAC TGGC AAAGAACAAATGCTGAATCTGAAGAAGAGGACAACTTTGGG 

CMAT AATCTGCATACTTTTAATTGGGAATAAGATGGAAAATATGAATGCTAAATCAAAT 

mm i AANNNATACACCACACGATACGACTCCCCGCGTACATCTTTGCTGTGGCTCACAG 

ATTGTTCCTCCCATTCCCCTTGCCGCTTTTTGCCTATCGATGGGTAGCAAGAGTCTTTGA 

AATAAGCCCATTTGAGCCCTGGATAACAAGGGATAAAGTGGAGCGGATGCACATCACAGA 

CATGAAATTGCCTCACC 

Sequence 2722 

CCGGGCAGGT AC I I I I I I I I I I I U U M CI I I I I I I I i II I 1 1 I I M I I H M I I I t M 

Tl I < I I I I M l < I I I I I If I i II I 1 1 I I II 1 1 I II I I I NTTTN I I I I I U I I I II I GGG 

GGGNAAAAAANNTTTTNNNAANANNNGANAAAAAANNGGGGGAACCNNNNGGGGGNNCNC 

NCCCCCNNA AAAAAA NGNNCCCCCCCCNNAANNNAAAAAANNANTTTTCAAAAANNAAAA 

AANG GGAAN I MM I ANCNAAAAAAANNTTTNTNTAAANNNNNNTNTNNGGGGGNGGNGN 

TTTTNAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2723 

GAGAANGACACCATGTGCCTCAGAAACTGCTCGGTCAGACGGTGATAGCGAGCCACGCAT 

TCACAGGGCCACTGCTGCTCACAGAAGCAGTGAGGATGATGCCAGGATGATGTCTGCCTC 

GCGCCTGGCTGGGACTCTGATCCCAGCCATGGCCTTCCTCTCCTGCGTGAGACCAGAAAG 

CTGGGAGCCCTGCGTGGAGGTATGTGGCTGGAGTCAGCTCCTCTGAACTTTCCCTCACTT 

CTGCCCAGAACTTCTCACTGTGTGCCCTGGTTTGTTTATTTTTGCAAAAAAAAAAAAAAA 

AAAAGTACCTGCCCG 

Sequence 2724 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCCAATGA 
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ATGGTAACTGATCCAGGCACGTTATCACACTTCCTAGTCATCTCCACCTTTCCTGTATTG 
CCTGTGGCTTGTTGTTTAAGATTAAGAATCAAAGAGATTAAGAAGTATCACTTCAAGTCT 
TGCTCTGCTCACTTCTATGTTTGCAGTCAAATTATTCCTTATGTTGGTGACCTAAAGAGA 
ATTACTTTCATTCATTTCATTTCCCCCGTAGCAGATGGAAGTGAGAAACCTCTGAGAAAA 
TGAAMCATCCTTMCCACTATCTTTC^ 

CAAAAGI I II I I ICTCCTCCTTCTCTTCCTTGTTGCTTMCTTTTTAATTCATGCCATAT 
GCAGATATCCAATTATGTGCATCCTGTGAATAAACCACGTCTTGGTCACTGTCATATTTT 
GAACCATCTCATCAGAGATGAATAATA 
Sequence 2725 

CCGCGGTGGCGGCCGAGGTACGTATTACTGTTTCCCATTATATGTTACAAATGAAATGAA 
CACAT TCTCATMGTTAAAAAAATATAGAATATATATTTTTCTTTTTCCTTTTC I I I I 1 1 
TTTTTCTGAGACAGAGTCTCGTTCTGTCACCCAGGCTGGAGGCTAAGGTGGGAGGATCAC 
TTGAGGCCAGGAGTTTGAGATCAGCCTGAGCAACATAGTGAGACCCCCATCTCTAAAAAA 
AAAATAAGAAATAAAAATCAGCGACGAGGCATAGCGGCTCATGCCTGTAATCCCAGCACT 
TTGGGAGGCCAAGGCAGGCAGGTCGCTTGAGGTCAGGAGTTTAAGACCAGTCTGGCCAAA 
TGGGGGAMCCCCTTTTTTCTTTAAAAAANTNCCI I I I 1 I 1 rCCCCCCCATNGGGGGAAN 
NCCCCNTTTTTCCCCNCCCNCCNNNGGGGGGGGGGGGGGGGGAACCCCCCCCCCCCC 
Sequence 2726 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I 1 I I I I I I I I 1 I I I I 11 I 1 I 1 I CCAAAACAA 

MCATGCTTAGCATGCACACTTTTACCACTTTTTTCGAGNGGAMGTTTATTGGCAATAT 

TAAATTTCACCCTAG ATAGG ATATGAGMTGTTTTGATAAATCACAATTTATAGTATATT 

AATGCCATGTGAGAATTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCATTAAAC 

CTTAAAGAAA CCTTTC TCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATGAAT 

TTGTAG ATTTC I I I I I AGmMTTCCTGAGTATACAGGGCAAAAGCTTATNTCCTTTAT 

ATAMCTTCTGCTTTGGTCTAAMCTGATATATCTTCACGTTGAGGTTTCATCTGAAATG 

CACCACGTTTGCTGACTTGCTTCAATATGAATTTGTATGGCTATAAAATTGNGC 

Sequence 2727 

CCGCGGTGGCGGCCGAGGTA C I I M I I II I I I 1 1 11 I I I I I M II I CGGGACTTTAGATA 

GCTGGATTATTCTTCTTTTCTCTTTTTGTCTTTCATGTAGTTATTAATGAGATGGATAT^ 

AAACCAACTACTAGGTNCATATCCCCAAAAAATGAAGTCAATATGTTGAAGAGATATCTG 

AACTCTCATGATTATTGCAAGACTATTCACAATAGCCAAGATNGGGAATCAACCTAAGTA 

TCCATCAACAGATGAGGNGATAAAGATAATGTAGCATATNTATACAACAGAATNTATTCA 

GCCTTAACANCAACAAAATAAATNTTGTCATTTAAGACAACACAGATAAACCT 

Sequence 2728 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACTCCACTAACCTGGGAACCTT 

GCCCCCATCCCCCATAGCANCCACAGCGAGACCTGCCCAAGGAGAGTCTGAGCTCAGACA 

TGCCTAGCCCTGCCCCAACTTGATGGGCCTTCCTATCTACCCTGGTAGCTGAAGGCAAAG 

GACATATACCCTTGGGAGTTCTAGGGCCCCGCCCATCGCCAGTTCCTCTCCATACTACCA 

CAGCTGATGCTCTCTGGGAAGTGCCACCTCCCAGCAGCAGGCCAATCAGCACAAAAATAG 

AACATTAAACCACCAAAGCTAANAACCCTCAGAGAATCCATTTACCCCCCT 

Sequence 2729 

CCGCGGTGGCGGCCGAGGTAC I I I I I II I I I I I I I I I I I CCCTGTATTTTTCTTTTTCCC 

GAGATGAAGTCATGCTCTGTTGCTCAGGCTGGAGTGCAGTAGTGTGATCTCGGCTCACTG 

CAACCTCCGCTTCCCAGGTTCAAGCGATTATCCTGCCTCAGCCTCCCAAGTAGCTGTGAT 

TACAGGCATGCATCACCATGTCTGGCTAGTTTTTGTATTTTTAGTAGAGACAGAGCTTCA 

CCATGTTTGCCAGGCTGGTCTCAAACCCCTGACTTCAGGTGATCCACCTGCCTCAGCCTC 

CTAGAGTGCTGGGAtTACAGGMTGAGCGACCACACCTGGTTTGTTTCTTAAAAAACATC 

TTATATTTCTCTGCTTAACGNGCTCMTGTTGAACATGTGAAATATAATAACTTTTCATA 

ATCTCTTCTGATTCTGGCTCTATATCATCTTTGGGGNCTGGTTTTAANGGMTTCTAATT 

TTCTTGGCAAT 

Sequence 2730 
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CTATAGGGCGMTTGGAGCTCCCCGNGGTGGCGGCCGAGGTACACGAGTCTTTTGCTTNC 

TCCCACCCCTAGGGGGAAAAACTGCTTTGTGCTTTGGGAAGTTGTCTGTGAAACCCGGGG 

ACAGAGGACNCANGACAGACTAGGANNGGAGCCGNGAGGATGGGCTGCANCTGTNGAGGA 

GGGTTTCANAGGANAGNGGTCNGATAGCACCAGGCCTGAGAAGCCATAGCCTAGGTGGAN 

AGAGGTTTGAAAGNGACANAGCGGGCTGATTANCTGCCGTANACNCTTTNATNCCATGTT 

ANATAAACATNNTTNAAAAACCCTTCNTNTTGTTCCCAATNNTANAAAAATTAANCCCGG 

GCNATGGGGGGGCNTGCCCCCTNGTNATTCNCTNTTTTAATNAANAGAAAGGGNTTGNCC 

CCGGGTNANTTTTANTTTTNTNACTTTNNGGGNGGTTTNTNTTTTA^ 

ANAATTGGGGNGNTNTTCAMNNNATTTNNTTGTTNNMTNTTTTTATATTN I ITTTTn 

TTNANCTNNCCNTNNNAACAAAA I I 1 1 II I 1 1 NTNTNNCATATAAAAAAA 

Sequence 2731 

GGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATA 

ACTTATCGGCCAAAAGTAGTCACATGGGTCCACNTAATNACAAGNGGAGCGGGAAGTGCA 

ATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCNCTAATGACTACCACACTTNG 

CTAAGGTNACATAATAAAf AAGCATCAGGACATTATGTGTGGNGGCTCATGTCTATAATC 

CCAGCNCTT 

Sequence 2732 

CTATAGGGCGNATTGNAGCTCCCCGCGGTGGCGGCCGTGNGCNCGGAGNTGGTATTGACA 

TAGCCTTTGTAGAAACAGTGCTTGAGTTCCGCTTCNTCTTCGGAATAAAACTTGGTCTGA 

TTCACCCCGGGCGTCCCGAGGAGGGTGACAGTGAACAGTGGAGCGATAAATCCGGCATTG 

GCGGTGAGATTAAA 

Sequence 2733 

AGGCGGGCGGCNGCCNCNCNCCNGGTACCTGATAAAAATTTANTNCTCCTTGGCCAGGCA 

TGGNNNCTCACACCNGTAATCCCAGCACTTTGGGAGGCCAAGGCCAGCANGTTGCTTGAG 

CTCAGGAGTTTGAGACCAGCCTGAGCAATATGGCGAAACCCCATCTCTACAAAATATACA 

AACATTANCCAGGTGTGGTGGCNAACGCTTTCAGTCCGAGCTACTGATGAGGCTGAAGTG 

GGAGGATGGCTTGAGCCTGGGAAGTGGAGGTTTCAGTTGAGCTTGAGTTTNATGCCATTG 

MCTCCAGCCTTGGGCGACGGGNAGACNCTNTTTTNAAAGAATTTGGAAAAAAAAAAAGG 

GAAAAAAAAAANNNNANGNCCCTTGGGCCGNNATTTNAAAAAANATGCNCCNCCCCCCCC 

GGGGGNTTTNNGAMTTTTNANTTTTANAAGTTTTTT 

GGGGGGGNGCCCCNNAANNNC I I I 1 1 I I I I I I TATTTAAGGGGGANGGNCCCCCCCCNAA 
AAAAAMNATNTTTNTTTTNTTTNTNNNCCCNTTNNNNANNNTANAAAAAAAAAAAAAGN 
GGGGGGGGGGGGGG 
Sequence 2734 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I II I I I I II I I 1 1 I I ATGAGATGGAGTCTC 

GCTCTGTTGCCCAGGCTGGNGTGCAGTGGTGTGATCTCAGCTCACTGCAAGCTCCACCTC 

CTGGGTTCACGCCATTCTCCTGCCTCAGCTTCCCAAGTAGCTGGGACTACAGGCACCTGC 

CACCACACCCGGCTAATTTTGGTTTTGTATTTTTAGTAGAGACAGGGTTTCACTGTGTTA 

GCCAGGATGGTCTCGATCTCCCGACCTNGTGATCCGCCCGCCTCAGCCTCCCAAAGTGCT 

GGGATTACAAGTATGAGCCACTGAGCCCGGCCTCTCTGTAGCTTTTAAGATGTTCTTAGG 

TGACTTATGAGAATGAAMAATGGAGAATTTTCCGTCTTCCTGCCATGAAATCAATTAAT 

GGCAATTGCTACTGAAAGCTGTTCTGTTTTT 

Sequence 2735 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTl ri 'GG 

GGGGTTGCTATTATTGTTATTGTTTCTAGATGAATGTGNGAAGCCAAATTATTCAGTGTC 

TTAAAGACCATATTAAAAACTCCTGCCAGGCGCAGCAGCACGTGCCCATGATCCCAACTA 

CTTGGGAGGCTGAGGCAGAAGGATCGCCTGAGTCCAGAAATGCTGGGCTATAGTGCACTA 

AGTCAATTGGGTATCTGCACTAAGTTCGGCATAAACGTGAGGGACCACAAGGTTGCCTAA 

GGAGAGGTGAACCAGACCAGCCTGGAAACAGAGCAAGTCAAAACTCCTGTCCTGATGAAG 

TAGTGGGACTGCACCTGTGCATAACCACTGTACCTGCCCG 
Sequence 2736 
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GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACAGACGAGATCTCGAT 
CGAAGGCGAGATGGCGGACGTGCTAGATCTTCACGAGGCTGGGGGCGAAGATTTCGCCAT 
GGATGAGGATGGGGACNAGAGCATTCACAAACTGAAAGA 
Sequence 2737 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTAAAAAAACAGTTTAT 

GTGCAAGGTGTATAAGAAAAGTAAAACATACCTTTGGTAAAAAGATTATAAAGGGGCATA 

AGMTGTGGATTTTTACCTACATTAMGGGTTMAMCMTTATTGTTTTAAM 

GCMGTTTTAAAACGTTMTTATAAAGMAATTCTGTGTGTAAACATATTAGCTAAAGTT 

AAMAGGTATCATCCAGTTTTTCTGTGAACTGGACATTAAAGTAAAAAATGCCACAGGTT 

TTTCTTAMGCATCAACCTGCTCTTTAACAAAAATTATAAAAGGTTAAAAAGAGTCTATA 

MATCTTACCTTATGGTCAAACATGAAAMTTGGATAMTATGTCTCMGGGTTTATTAA 

MTTTCAGTTTAAMTTMTMCACACTMTATAMGGTAAAATTTAGCTTATCTGGTAT 

AAAAATCATACNAGAAACATTATTAAATATNAAATGGGGT 

Sequence 2738 

TCTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I I t M 1 1 I I I I 1 II 1 1 

I II I II M IGAGACAGNCTCCCTCTGTCACCTAGGCTGGAGNGCAGTGGNACGATCTTGG 

GTCACTGAAACCTCCGCCTACTGGGTTCAAGCAGGTCTCCTGCCTCAGCCTCCCCAGTCG 

CTGGGATTACAGGCACATGCCACCACACCTGGCTMTTTTTGTATTTTTAGTAGAGACGG 

GGGTCTNACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTTAGCTGATCCACCTGCC 

CTGGGCTCCCAAAGNGCTGGAATTACAGGCGTGAACCACTGNACCCAGCCCNTTGACCTG 

TTlvnTTATTATTTTGNGGTTMTGCCMATNTAAMTMTGTTTATGTATAAAGCCCCAT 

NTCACAGGGGGGAANTTTTTTTAAAACAANAATTCC I I I I I I 1 1 I AAGG AAAAAAAGGTT 

TGTNTTTTTGTTGCANCCTTNTTTATTCN^ 

ATTCCI 1 1 I I I ICC 

Sequence 2739 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACGAAGCCATCTTGGCT 

CTGTGGAACCAGCTCTACATCAACATGAAGAGCCTGGTGTNCTGGCACTACTGCATGATT 

GACATAGAGAAGATCAGGGCCATGACAATCGCCAAGGTATGTCCTCAGGGCCACTTAGGC 

TGCCTGGAGGGAGGGCAGCGCTGCCCCCCGCAGTGCCTGTGTCCAACAGTTCAACCTTCT 

TGCTGTGTAGCAGTGCTTTTGTGTCTCGTNAAGCAAGTCAGCTCACCCCTCCTTAGAGGT 

TCTGGTCTGTCCAATAGAGAACGGGNGGGATTAGCATATGGCTGATTATGAGAGAAAGAA 

GCMTNCTMTTTAGGGTGGCCTGACAAGCAAGCCAGAATTGCCTGTGGAAAAGTTATTG 

CACTCCTGTAAGAATTCTGGACCCTATTGCTCCTTTGATGTAATGGAAAGTTAG 

Sequence 2740 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAAGTGA 

ACTGAGGGCCACCCTGGGAGGAAGCCGACTAGGCGAATTCACTTACTGACCGGCCTGGGC 

TGCTCTGAGACATGGAGGAAGCCAGTGAAGGTGGAGGAAATGATCGTGTGCGGAACCTGC 

AAAGTGGGGGTGGGGGGAGTTAANANTATTATCCCCANATTGNGGGGCGGGATCCTGCCC 

CGGGGGGAAAACTTTGGAACATCTCCGCAATAAGACAGA 

Sequence 2741 

GGGCANGGTACCCACTTGGATGACTGGGGAGAAGGGCTGGGCTGCTGCTGGGGAGTAACA 

CAGGCCTTGGGGCAGGGTTCAGGAGTTCATTAGTCTGGAGTCCAGATCGCCACCCAGGGC 

CCAGCCTGATGTAGTGTTCGCGTTCTCTCAGCGCTGCAGTTTTCCGATAAAGGAGAGGAC 

TCCTGTGTGCCAGAGCTCTGAATGGGAGCCTCTTCTCAGTCCAGCCAGGCAGAGGGTGAG 

GCTGCCACCTTATGGCCACTGGGGGAATTGGCTCTGGGCTTGGACTCCAATAAGGGGCCG 

GGAGCTGCAGAGACCTCCAAAAGGTCTCTTAACTAGTCCTCCAAA 

Sequence 2742 

CCGCGGTGGCGGCCGAGGTACTGCCATACCTGGCTAATTTATTTTTTGTGGAGATGGGAT 
CTCACTTTGTTGTCCAATCTGTTCTCAAACTCTTGGCTTCAAGTGAGCCTCCTGCCTCTG 
CCTCCCAAAATATTGGAATTATTGGCATGAGTCACCATGCCAGATCAAGAAAATATTTAT 
GTATAATTTTATCATACCTCATTGGTCCTAATGTTTTTTTGCTTGTTTAGGTCCCTTCT^ 
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GAGATAGGAGAAGAGAGAGATCCCTTTCTCGGGGAGAGAAATNCAAGCCGTCCCGATCCT 
CTTAGGGCTNGGA GGTAA MTCNTTTGATMCTTTGTATTNAAAACTTTGCATCCATAGT 
ATGCTMGGCNTTNTTTTANCCCAMAMTTTTNCCTTAAAAGTTrm 
AAAAAGGGCCCCNAANACCCAAGNNAAAGGGNNGGNNC I HUM I T 
Sequence 2743 

CCGGGCAGGTAC I 11 1 I If < I I 1 M I I I 1 I TTTC 1 1 I 1 1 1 I H I 1 1 11 I H II 1 1 I 1 1 11 

in M I I I I 1 1 M I 11 I M I M 1 1 1 1 I 1 1 I I I I I I I I I 1 I I M I 1 I 1 I 1 I I I 1 1 1 

mn iCCCCNGGNNNNAAAAATTTTAANNGNNNGGAAAAAAAAAAANCNGGGGAAAANG 

NNCCNTTNNTNNNAANCCNTNAANNAAAAAAANNCCNNNAAAANNAAAAAAAAACCCCAA 

AANNNNNNTNNNAAAAAAANCNTNGGNNNAAAAAAAAAGGGGGGNGNNANTTTNNNNCCC 

CAAAAAAANGGTNNGGNNGGGAAAAAATTNNTTNNCAANCCANAANTTNANTNCAAAAAN 

CCNTNNTTCCCNTGNCCNTNNAAAAAAANGGGNACCCCCNGGGGGGANANAATANNAAAA 

AAATTTANTTTAACCCCCCCCCCAA 

Sequence 2744 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I 1 1 I I II I 1 1 1 1 1 1 1GAGACGGA 

GTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCCCACTGCAAGCTTC 

GCCTCCTGGGTTCACACCATTCTCCTGCCTCAGCCTACTGAGTAGCTGAGACTACAGGCA 

CCCGCCACCACGCCCGGCTMTTTTTTGTATTTTTAGNAGAGAGGGCGTTTCACTGTGTT 

AGCCAGGATCGTCTTGATCTCCTGACCTCGNGATCTGCTCGCCTCGGCCTCCCAAAGTGC 

TGGGCTTACAGGCGTGAGCCACCACACCCGGCCCAGTCTTGTGTCTTAAATACCATTTCC 

ACATTGACACCTCCAGAATATTTATCTCTAAACCTGACCTCCAGAAGGTGCAAGCANATT 

GTGTAAACCATGTCTCCACTACACGCCTAAAGGCA 

Sequence 2745 

CCGGGAGGTGTCCGAACAGGCAGGTTGGTGGGTTAAAGGTCTTAATCTTGACTCGAGATC 

TCTCCCCGGAGTTCACAGAGTAGGCGACGAAGCCGAAGCAGCTGGAGCGCGACCCGGAGG 

AGTCTGACTTCTCGTTGTCTTCATAATTTTCATTCGTTGCTTTCTTCATGGACTTGCGGC 

TGGGGGAGGATCCCCGCTGGTCGCCGAGCANGCGGGCGGGTAAAGGTAGGCCGCGAGAGC 

CAGGTTATNGAGAGGAGAGGAGGC 

Sequence 2746 

A GGTAC CTGTGACTAACAAGGGGTCTGGGAGGATCTGCTGCTCCCATGCCCTCCTTTGTG 

TGTTrfAAATCtGTTTGAGCCTTCTGGGCTCCTGCGAATTAGGGAGTGGCAGCTCCTCAG 

TCTMCTCCTATTGNGACCAGGTTGCCTMTTGGCCCTTTGGTTTGGGCACCCACTGTCC 

TTCTGCGTGGTTGGATAGATGCTGCTCCCAATGTCCCTGATCTCTTACAGACCCCTCTGA 

TTCTTCACTCTTGGCTTTGAGAGCCCCTGATGCCCTGCAGTCTTGACTGAGCTTCTAATG 

GTTGATCAGACCCTTGAATGTTGAGCTCTTTCCATACTAGACTTGAATATTCTCCTG 

Sequence 2747 

GGCGGCCGCCCGGGCANfGGTACANGACATTTTCAMGTTGCCAGTGTTACTTTAATTGGA 

CTGCCTTCGTAATTCATTGCCTCTGCTTCAACAATGTGCAACTCATCCTTTGCACCAGCC 

CCTAMCTGACCGTTCTTAMGATAGCTGGTGCTCATTTTCATCATTATCCACCTTAAAG 

NGATAATCTTTGTCGGCCTTTAGTTCACAACCGAAAAGATAGTTCTGGGGCCTNAGGGGG 

CTCATGTCCATGTCCATCGAATCTTCCATNGGGNGGCGGCACNCNNTTTTTNTAGNAAAG 

AAGGCGGNCGGAANATAAAAAAAACTNTNCTCCAAAAAAACACCCGGNGCANGGAGGGNA 

NTCANACCAGGGGCCCCCGTTTNCCTNGGATCNTTTNATAAAAAAATNGGGGGNCCCCCN 

CNGNGNNNGGNNGNAMTTCTAATNNTAMNTTTTTANCCCCCCCCCCCCCCCCTNGGGG 
GGG 

Sequence 2748 

CCGGGCAGGACTTGGGAAGCTGAGGCATAAGAATCACTTGAACCCGGGAGGTGGAGGTTG 

CAGAGTGAGCCAAGATCGCAGCACTGCACTCTAGTCTGGGTGACAGAGCAAGACTCTGTC 

TCAAAAAAAAAAAGACTAGAGAATGTCAGGGAACACATGTGTATATTTAAACAACTTCAC 

TTTGCATTTAAAAACTCGAAGGACAGCGAAGGTGAAATCAATTCACGGGCCACCTAAACT 

TTTCAGTCTAGGACTNAGAGCTTGTANCCATNGATCTGTNGGGNTTGNACCTCGGGCCGN 
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TTCTANAACTAGTNGNATCCCCCGGCT 
Sequence 2749 

CCGGGCAGGTACAACATGAGACATGACGCCCTTCGGGACACATGCCTGAGGTAGTGACAA 

TCCMCTTTG GAAGAG TGGMGCCCTAGTTTCAMTTCMGCATGCTTTGAGTATAAATT 

AAGTTTACCTCTTTTTGCACAGCAACATGGCCMTCTTTCCTAAGCTGCTCAGCTTACAA 

GAAA AGGAATCAT ACTGC TAAGAATTCAAACTTCAGCAGTCATAGGTAAAGTAAGGGAAG 

TTTTN TA AACCNT ATTTTTAGCCCCCNTACCCNGMCCCTNGNAMTTTTNGCNAGGGTT 

TTTTTCAATTmCNAGGGACAGGTTGGGGTTTCNCTTTAMTCCANAGGGCCm 

NACCNTGGAANAAACCAGACCCTTTTAAAAAAGG 

Sequence 2750 

CCGCGGTGGCGGGGTTTCGCCATGTTGGCCAGTCTGGTCTTGAACTCCTGACCTCAAGTG 

ATCCACCCACCTTAGCCTCCCACAGTGCTGGGATTACAGGCATGAGTCACCACACCCGGC 

CAGTAGAACTCTTTAAACCTGAAAAATCAGTCAACTTTGCAGACTAGAGGAGGATGTTGA 

ACACCTATGTGTGTA I MINI I CTTTACCAACTATGCACCTATTTTTCAGACACCTAAA 

AGTMTGTCTGTGAAACAGTGGGTTTTCTTTTT 

Sequence 2751 

CCGGGCGAATTGGAGCTCCCCCCNGTGGCGGNCGCCCGGGCAGGTACAATGCTTATAAAA 
TTCMTAATTTGTATTAAAATACAAAATCCNATAACAACCAGGAGTTCTTCGGAAGAAAA 
AAAAAATCACAAAACAACCCCAACAGTGGTGAAGAACTA 
Sequence 2752 

TA GGGC GAATTGGAGCTCCCCGCGGTGGCGGCCAAAAGCTCCCAGATTATTTGGAAACCA 

TGTTTTCTTCCTAGTCCATGGTAACAAGAAAAAGCCANTGGAAGCATCATTCCAAACAAT 

AATCTCCAAAGATGGTGGCAACCAAGTGTCAAATGGGGACTGCAGGCACAGAAGAGACCA 

CCCCAAACCCTGCCTGGGTGGACGAAGCAGGTATGCTAGAATAGTCCTGTCCTGCAGAAT 

AGGGAACGGCAGCTTGGTCGATCTGTGCCCTGGAAAAAGAAAATGAGTTGCAATAGAAGT 

GACTNTAAGACAGACAATGAACCTACTNTTAAGAGAGACAGGGCCAGGCACGGTGGCTCA 

CGCCTGTATCCCAGCACTTTGGGAGGCTTNAGGCGG 

Sequence 2753 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCAGAGACAATGGAATAAACAGCCAAG 

AATGTTAAAATAGCTATTATAAATATGATCCTTATGCTCATAGAGAAAAGGAAAAATATA 

AGTATAATGTGAAAAGAAATGGAAGATATAAAAAGAAATGCAACTTCTAGAGGTGAAAAA 

TATGTCTGAAATGAAAACACCATATAGATGGAATTACCAGGAAATTAGACACTGCAGAAG 

AAAAAAATCCATGATGTTGAAGTATATTGCAATTAAAACTATCCAAAATGAAACTGAGAG 

GGGGAAAAAAGCCCCTGAAATGTATGAAGAAAGCCTCAGTGACCTGTGAGACAATATCAC 

AATGGCCTAACATAAGTGTAATTTGGAATCCCCAGAAGGGGTAGCAGGCCCGAAAAAAAT 

AATTGTTTG AAT AAAG G G NCAAA AG GTTTTCCC 

Sequence 2754 

CTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTA I i l l I I I I I I I i 1 

i l I"! I I iGAGANAGTGNGTGACTNTGNCACCAAGGCTGGAGGGNNGNGGNAAGANCATAG 

N TCACTGAAGCCTCCATNTN TGGAGTTNAAGGGATCCTCCCGCCTCANTCTCCNAAGCTA 

AAI I I I I II I I I n I I I I I I 1 1 AGNAANANACATGGTTTCACTATNTTTGCCCAGGCCAG 

GTCT AMCCTTNCTNACTTNAANGCAANTNNCTTCTNCCTTTAANCCTTCCCAAAAGGGN 

TTTTGGNATANNNT NANACN CAGNCCCTTNTNCCCTGGGCTCTTTCNTTNTTAAAGAAAG 

GACAC TTTAM TCANTTTTTNCNCNCNCAGAGATNAATTGNGCAACAAACANTATTANGT 

TGGGGTTTTATTTNAGGGTTCCTNNCCCGGNNNGGTCCCTTTNANAAAANNTNNGGGAAN 

CCCCCCCGGNGTNTGGGGGGANTTNTNANNNTNCAANTTTATTTTATTNCCCCCCCCCCC 

NTCG 

Sequence 2755 

CGCGGGGGCGGCCGGGAANGGNCCNGGAANGGNAGTCANGCAGGGAGCGTCTGTCCGAAC 

GGAGGCTAGGTAAGAATATTTCACCATGAAAATGTTAAAAGACATAAAGGAAGGAGCTAA 

ACAATATGGACCCANCTCTCCTTATATGAGAACGTTATTAGATTCCATTGCTCGTGGAAA 
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TAGACTTATTCCTTATGATTGGGAMTTTTACCTAMTCTTCCCTTTCACCCTCTCAGTA 

TCTACAGTTTAAAACCTGGTGGATTGATGGAGTACCTCGGCCGCTCTAGAACTAGGGGGA 

TCCC 

Sequence 2756 

CTACTTAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTACI 1 1 I 1 1 I I I 1 1 1 1 I I 
n i i i i i i I I I I I II 1 1 IGGAGNGAAACAGGAGTGCTTTATGGTCTGAGTGGAGTGTTTG 
GGAGGAGTNCCTCCCGGNTCCTGCCTTGGGGCTCACCTCCCG 
Sequence 2757 

CCGCGGTGGCGGCCGAGGTACATCTTCTCCTAAAAACAAGGGTAGAGCCAATGGAAAGTA 

ATGGTTCTG7TACATAGAATGAGTTGTCGCGTTGATC7TAAATGATGTATTGGTAGATAT 

ACTTCCCAAGTGGATTAA AAAG TTAAAACTTACAGCATAACAAAGTATTAGACTTACTGA 

GGTG ACTTGM TATCTCCTTTTGATTTTCACTCTATTTTTCTTTTCACCCATGGGA 

GATMTTTTTTAATAMCCMGGCTCTTACCATAGCTGMCTTTAAAACTTAGACTGTCT 

TTTCTGTAAACGATTCTGAGGCAAAGGGAAATGACTAGAAGAGGATGAGTAAACAATAAC 

CTGAAATGGGAAACTCGAGGGAAGCACAGGC I MUM \ GGTTTGTTTTGGTTGATNCGT 

TTTTTGGTCTTTG 

Sequence 2758 

CTAC7TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGCCGGTGGGCT 

CCCTGACATCCCTTCCAGGCAACCTGAAAGCACTGAAATAGCTTATGGCCCTGTGCCAGG 

GACCTTGGCCCAAGCTGCTGACCTCCAGGGTGGGGAGGGAGCTACCCCCAGGAGAAGAGT 

CACTCAGACAGCAGTATGAGCAAGCCAGCCAGCAGCTCCGTGCCTGCACCCAGCTCAGGG 

GAATCCCAGGGGGTTCAGATGCCCAGGAAGGAAAAGGGGACAGCGCTACTGCTATGGAAT 

GAGACCACCACTTCTCCTGTTGTCCTTCCCAGCTTCTCCCCAACCTCCCCTTTTCCTAGT 

TTATAAGACAGGAGAAAAGGGGAGAAAGCAAAAAGCTGGAAAGAAACAGAAGTAAGATAA 

ATAGCTAGACGACCTTGGCGCCCCACCTGGCCTGGGNGGG 

Sequence 2759 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGMAATTAGCAAGAGACATTTTCTGCATTGT 
GA GMA TCMCATAGACACCTTAAAGACCCCTTTGAGAGTGTGGCTTTTTGAACTTTTCA 
GATTTTGCTCAGTGACCTGCTAACACTTACGTGAGAGGCTCCAGGTGTAAATAGAATCTA 
ATGGCAGAATCTGTAAGTGTAAACAAGCATCTTAGGAGTGAGAGATCAAGACCACAAAAT 
GTCCA GAGCTAT GACCACAGCTATACCTACCCATAAAATACGATACTGGAGTAGGGTATT 
tttgtci rrm iCTTACCTAAGAGCTAGCTAATCAGGACAGGTGATGCAGGTTCTGGAG 
CTCTACCAGGGCAAGTTCTATTTTCTTTTTTTGAGACAGAGTCTCACTGTCGCCCTG 
Sequence 2760 

CCGCGGTGGCGGCCGGGCAGGTA C I I M I M I I I I M I M I M M I 1 CGAATATTNCTTT 

CAGTGTTCTCAGATTGACTTGACCAGCCTAAGACAGATGCCAGGGACATCCTCTTNTNTG 

CCTNTNAACACTTCAGTCAGATGGGAATATGGAAGGATCATATNCAAGAGGATCATATTT 

TNTGAAGCCAATCCATTANATGTCAGGAAA 

Sequence 2761 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGTAGATGTTCTTTGGTATCTGGGCA 

TTGMGMTTAGGTATTTATTATMTCTTCACAGTCTGAGCTTGTTCGTAGTGGTCCTTC 

TTGGAAAGGCTTTCTAGATATTCAGAAGGACCGTAGGTGTTGTGATCTAAGCTGTATCTG 

CTTTAGAGGGCACCCGAAGCCCCGTAATGCTCTGGTTCTTGCAGACTTCTGGAGATACTG 

CCTTGATGGTCTT GGATA AGATTTGGAAGAATCCTCTGGATTATCAGGCAGAGACTCTTA 

TTCTCTTTCCTCACTTTTTCCCAGAGTCTTTCTATGCTGAGCTCTCTGGAGCTGGGGGAG 
G 

Sequence 2762 

TNNCCGCGGTGGCGGCCGAGGTACTGTCCAACCAAAACTTTCCACNGNGAAAATTTTCCT 
GG GTGAGC CTCCCAGAAAAGCCCAGCTTAGTGTAAGCCAAAGACCTCCCAAGTCTGTCAC 
CAAl I I n l CCCTATTACTCACCTGATCATGTGGGCAATATCCCAGTTGGTCTCTGTAGA 
CMTGGTCCCTCTATTTCMCACCTTTTTCGGTGACAGTGGCGATTTGAAACTGGAAGAA 
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AGCAGAGCATTCAATAATGCCCACGCCTTAAGTCCTTAAATGAAAGGTCAGGTGGAGGTC 

TTCCCCAATGTGAAAATAGGAGTCACACAAGTAAGGCGNATCTGTTCTTCAAAGCATAGG 

CTC 

Sequence 2763 

AGGTACTATAATCTCCTTCTCCACTGGCAAACAGATTATACCAGAAAACAGCAGTTCCCT 
ATGGAG AACMTTTCACA ATTATCCCCMGTATACMTGCCAGGTMTTGCTTTCGGTAG 
TTATTAC I 1 1 1 1 1 I I I I I IGAGACAGAGTCTTTGCTCTATCGCCCAGGCTTGGAGTGCAA 
TGGCTCGATCTTCAGCTCACTGCAACCTCC 
Sequence 2764 

GCGGNCGAGGNACAANCNACTTGGGGGGGCANAAAACCNGCCCCCCCCACGANGAGAAGG 

GGACNANGAGAANNNTTTACACACAAGNGGGGGANNNCCCCCNAAAAAACCGGGCGCAGN 

GGCNNACACNNNGNAANNCCAGNACNNNNGGGAGGCCGAGGCTNTTTNAAAAAAATTNNT 

TTNGAGGGGGGGGGGGCCCCCGN 

Sequence 2765 

CCGGGCAGGTACGCGGGNTTCCGCGGGGCTTGCTGGGAAGAGAGGCGAAGCCAGGTCACC 
TTTCAAGGACCCAGAAGTAGGGTTTTGGCCTAGGTAACCGGGGCAGAGATGTGGTTCGAG 
ATTCTCCCCGGACTCTCCGTCATGGGCGTGTGCTTGTTGATTCCAGGACTGGCTACTGTT 
TNCCT 

Sequence 2766 

ATAGGGCNMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACCCTCTGCTTTTGCAAG 

GACTCTACTGTGTCATCTTAAGTGAGACAGGTGCATATGTATACTAGTAAAATTCTCTGC 

TTTCCTAGCATTGGACAAAAACAAAAATCAACAAAAGAATTGCCTCAGTGTCTTAAACTG 

GGATCCTTACTAGTTGACTAGGCACTTAGTTACTGAAGGATATGTGTGGAATTCAAGTTC 

TTTT CAACCTATAAGAAATATCCGGCCAGGTGCAGTGGCTCACGCCTGTAATCCCAGCAC 

TTTTGGGAGGCCGAGGCAGGAGGATCACAAGGTCAGGATTNTCAAGACCNGCCTGGCCAA 

CATAGTGAAATTCCTGNCTTCTACTAAAAATN 

Sequence 2767 

CGGCCGCCCGGGCAGGTACTATTACTAGGTTCATTGTTTCCAGAGGGGTGAAACGGGGCT 

TTGGAGAGGTTAAATAACTTGCCCAGGGTCACACAGCTATTAAGTGGTAAAGCTGGGATT 

TACATGAGCCCAGACAAAGAACCCAAGAAGCTAAGCTATTNTCTTGTAATACCTNCAACA 

TAGGAGGCAAGAAGTGAGGTATTATACAGGTTGAGGAGATA 

Sequence 2768 

CCGGGCAGGTACGCGGGCATCAGCTCCGTGGGAACTCACGAGCCTGGGGAGAGTTCGTCA 
TCCCCACATGGAACCTCAGTATGGCCAACAGGCAGCCCTCTGGTGGAAAMTCAACAATC 
AGTCCTTGCANCAACTTGATGGAACNCTAGTTGACCACCATAAAGATGTCAAGCCAGGCA 
GCA 

Sequence 2769 

AGGTACTACTGCTGAGGTCTCCAGGACAGAAGTCACCTCCTCTGGTAGAACATCCATCCC 

TGGCCCTTCTCAGTCCACAGTTTTGCCAGAAATATCCACAAGAACAATGACAAGGCTCTT 

TGCCTCGCCCACCATGACAGAATCAGCAGAAATGACCATCCCCACTCAAACAGGTCCTTC 

TGGGTCTACCTCACAGGATACCCTTACCTTGGACACACCCACCACAAAGTCCCAGGCAAA 

GACTCATTCAACTTTGGCTCAGAGATTTCCACACTCAGAGATGACCACTCTCATGAGCAG 

AGGTCCTGGAGATATGTCATGGCAAAGCTCTCCCTCTTCTGGAAAAT 

Sequence 2770 

CCGGGCAGGTACAGTTGGACCTGCTGGCATTCGAGGCCCTCAGGGTTCACCAAGGCCCTG 
CTGGCCCCCCTGGTCCCCCTGGCCCTNCTGGATCTCCAGGTGTAAGCNGTGGTGGTTAT 
Sequence 2771 

CCGGGCAGGTACTGTAGATTGAGATATAACAAAAAGATGATTTCTGAAATAATATTGGGA 
MAGMTTTACCGGATGGTATTTTGTATCTTTGCACACCTCTGTTGTTATGATGCTATAA 
GGCAATCTTGTCCAACCAGCAGCTCACAGGCCACATGCCACCCAAGATGGCTTTGAATAG 
GGCCCTATACAAATTCGTAAACTTTCTTAAAACATTATGAAA \ I I I I I M I I I I I I I I I I 
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ACAA TAAGG CTCTTGAAMTTGTCATTACTTGTGTTTTCCTATACATTCATCTGTGTGAA 

AGCCTTTTTCTTCTTTGATTTAAAAAMTTMCTATACAGTTMTGGTTTAGMCTTAGA 

ACTAC 

Sequence 2772 

AGGTACACTCAAAGGCTATGTTCTCTCTCAGGACTCAAGATGAATTACTGGCAGAATTCC 

TCACTACATCCTTTAATGGAAACCTCTGTCGGTCCTCATCATCAGTGCTTTCATA7TCTG 

ATTCTTCACTAGATAATTCCTCATCCTCGTCTGGCAAAGGAGGTTCCTCAGGTGGCTGAG 

GAGATGTGGGTGGAAGAGGTGCATGCAATGGCATATAGTCTTCATACATGGGAGGTCGCG 

CAGTMTTGGTCCAMAGGTGTGGGCAMTTCATTTTATTCATAAGATGAAGGACCTGTN 

CCTGCCCGG 

Sequence 2773 

CCGCGGTGGCGGCCGAGGTACTCAATGTTTAGCTCCCACTTAGAAGTAAGAACATGCCCA 

GCACTTTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAAGAGATCNAGACCATCTTGGCT 

AACAANGTGAAACCCTGTCTCTACTAAAAACATAAAAAATTGGCCCGGGTGTGGTGGTGG 

GCACCTGTAGTCCTAGCTACTCGGGAGGCTGAGACAGGAGAATCGCCTGAACCGGGGAGG 

TGGAGGTTGCAGTGAGCTGAGATGGTGCCCAACAAGAGGGAAACTNCTTNTNAAAAAAAA 

AAAMAAGAAAACACAGGTATMTTTCCTACCATGTTCTTACNGGATCATAACTTTAAAT 

GGTCACCCGCGTNCCTTGCCCGGGGCGG 

Sequence 2774 

CCGGG CAGGTAC I M ) I I I 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 I AG AATAAAAGACCATTTTTATTCTA 

ATTTTMCTCAGAMTTATTATGCTTATTCAATTTAACACTTTCGCTGAAAGGTTAAAG^ 

GATMGMGGACAGATTATAATTGCTTAATATTGCTATGGTAACTTCCATTCAAATACCT 

GTGAGTCACCAGAAGTCAAAAAGGTAGCCAGCATTGCAACACAGGATGGATCATGCAACA 

GAMCTAGCACCAGGTTACCTTATCTTATAATATTATTTGCTGTTAAAATGAMTTTTAA 

AACAGCACCAAMGTTAAGTTGGGGCTAAAACTGTTGTGCAGGAAAGATTTCATATAGCA 

G 

Sequence 2775 

TGGCGGCCGCCCGGGTNGGTACCAAAGCCAGATCCTCCTGTTTTGTAGCAGGAAGCCCTT 

TATTAGTTNNTCTCTATCAATCCATCTTTNATAATNNCCAAAATAGGATAAG 

Sequence 2776 

CCCACGCGGGCGAAATTTTGGGGAGGGCAAACACCCCCCGCGGGNGGGCCGGGGCCCGCC 

TCCNAGTAAACCTAAGAGGGGGAGGNTNTTCCAANNCCCCGCAGNTNNNTTAAAANCCCC 

ATCAAGGCCCNAANCGGAANAACCCGGGGGGGACCCATCTNAAGGGGGGGGGGGGCCCCC 

CGNGGNACNCCCAAGCCAANNANGGNNNCCCCCCGNAAAGNNGGAGGGGGGCGNAAAANA 

GGCAGCCGCCCNCNNGCCGGNAAAANCAAAGGGGGNCCAAAAAGNCAGGGGGGGACCCNG 

GGCGGGGGGAAAAAAAGGGAAAANCCCCGGCNCCAACCAAAANNNCNANCAACCAAACCA 

ATAACCAAAAGCCCCGGGGGAAGGCCAAATAAAAANNGAGGGGNNAAAANGCCCCCGGGG 

GGGGNGGCC 

Sequence 2777 

ATGGGGGAAAACCCCGGGGGCGGCCGCNCNAGAACTAGTGGAGGNNTCCAGCCCGCAGGN 

AANNAAACCACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCAA 

CNGGNCCCCCCAGNGAGGGGGAAAAAGCGCGCCGGGCGGAANCAAGGNCAAAGCAGGNGC 

CCGCGGGAAAAAAGGNANCCGCCCACAACAACCACACAACAAACGAGCCGGGAGCAGAAA 

GAGGAAAGCCCGGG 

Sequence 2778 

ATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA C1 iTTTTTTTTTI 

I I i I i i i n i C CNCCCN MCCAGTTCTMCCGCANGNCAATTAGTTTTTNAAAAGTCTCT 

CTCTTCTCATGTTTTTTCCTGCTTTGAAACTCMGNGNTCTCTTTTACTTAATANATCAA 

GATCCAACACTAATTTGACATTAAATCAAGACTCTNTACAAGTNGACTATCNCTAACNGG 

CTGGGGGACTTCTGCGGAA 

Sequence 2779 
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ATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATTGCTGGCCTTTGCCCAAATTATGCTT 

NCCCCATTTGGTATGACCTGACACCATTGTGATCAGTCTGATGACCTGGCAGCATCCCAT 

C TGCCTA CCCAGTTCACTNTTGCTCCATTTCAGGGCCTCTTACAGGCAACTNCTNACATA 

TATTTTTGACACTGACTCATGCCTTCTNAGGCTNAGCTAACATCAGCCCATTCATTGATN 

CAGCAAGACAATTTGAGAACCACTATGTGCTCTGCCAGAGTGAATAACAAAACCGCAAGT 

ACCTNCCNGGGCGGNCGGTCTAGAACTAGT 

Sequence 2780 

CTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA C i I 1 1 1 1 I 1 1 1 
I I 1 1 1 I 1 1 1 1 1 I NTNCCT MCCAGTTCTAACNCAAGTCAATTAGTTnTCANAAGTCTCT 
CTC7TCTCATGTTTTTTCCTGCTTTGAAACTCMGNGTTNTCTTTTACTTA 
Sequence 2781 

GGGGGGGGNTTTGGGGGGGGAAAAAGCCCCGGGCGGGAGNGGACAGAAAAAGANNTNGGG 

GNNGNGAGGGGGTTAGGAGNAANCCCNNGNGGGTTTTCCNAAACCCCCAGNGANGCCGGG 

NCGGCNGNGCGCGCANNGGAGGGCAGNGCTATGNANCCGNCNNGGGNACNCCCAGGCCCN 

GGGNNCCACGNGGCCCNNCCGGCACGNACNGNGCCGCNAAACCNCGGNAGGNCAGGCGAN 

GNANCCCNCNGCANAANNGAAGGAAAAAAAGGGGAAAAGCGGGGGCMAGAA<XAACAAC 

NCNGGGGGAACCACAAGCNAGGGAACCCCGNGAGAANAANANANCNGGGGANAANNNNGG 

GGNGGANAGNAANAACAAAGGGAAAANGGCCCCCGAGGCGNGGAAAACCCAAAAAGAGAG 

GNNAACGGGGGGCGGNNAANNGNACCANNGGCGNAAAAAAAAAAGANAAAAAGAGGGCNN 

GGNGGGGGGGNCGNAAAAAANAACACACGGGACCCCNNNNAAGGGGGANACCCNCNCGGG 

GNGCCGGGGGGNCCCCAAGNAAAACCNNAAAGGGAGGGGNANNCCCCCCCCCGNGGGACC 

CGNANNAANGGGAAAAANCCCCGGAANGAAAGNCAAAAGGCCCNNGNAAACCGGNAAAAA 

ANCNCGGGGNCAGAACCCCCCNCANAANGGGGGGGGGGGGGCCCNAGGGGNAAACCCCCA 

AGGCCCGGGGGGGGGNGNNCCCCCCNNNNCAANNNNNNNNNNAAAGGGGNNNCNAAAAAA 

AAGCCAGCCGGCCNGCGGGNNCGGGAAAAAACNAACGGG 

Sequence 2782 

TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGCCGATCTCTGAGAAGTTNTGTTGCA 

CCACTGTGAAGGTCTAGATGCAAGCTTGGCTCCCTCAGAMGGCGCTTCCCTTTTGCATG 

GCTGAGGATCCTTGAAGGAACCTGGTCAGTCTCCGGTTCAGCTTCCGACACCAGAGTGGA 

ACCCAGTAAGCACCATCAGGAATGGATTTCACTACAAGTGTGGATAACTCTGATTTTCAA 

AGGAGTAGTTACTTGCAAATTACATCCTTGCTGMTTCAGGAGGTATGAMCCCTATTTT 

ACCATGTTAG AAAACA GCCCAGGATTTTCTCATTGCTCTGCCATCATATATGTCTATGAC 

TTGAGCCCTTATTTTTCCATCTGCAAAACMTMTGCCTATATGTCTTTGCATATAGATT 

TGGAAATCTTCATTCAAGGTTTAGTAGGATCA 

Sequence 2783 

CCGCGGTGGTCGAGCGGCCGCCCGGGCAGGTACTTTTCTTCTTGAAGTGAGTTTAGATCA 

CGTTTCAGCAAACGTCCTCTGGAGCTCCTTCCTCTGGGGAAGGAGCAGCGGAGGGCTGTC 

ATCACCACTGCACTCTTCCAACCTTCTCTTCTGCTCATTCACCAGTTCCATCCTCATCTG 

ACCATATTACTCCAGGMCTTCCCTTTTCCCTTGTAAATTCTGTTCCATTnTGGCCAAA 

CCATGGTTTGCCTCTGCTGCATTAGTTTGAAGTC A I i I U H I CCAGACATTGTGGCGGT 

AGTTCCCAGATGTCCGAAMGATGAAGTTATCAGTGCACGAGCAGCCAGGGCTTTTCTTT 

ATGTTGATTCACCTTAGGACACAGCCCAACAATGCCTGCAATTGTCATTAGCCCGTGAAG 

AGGTTCGCTTAGGGCTGCCACAGCACAGAGTCCCAGCTGCGCCCGTTTCTTCCCCCCCC 

Sequence 2784 

CCGCGGTGGCGGCCGAGGTACTA A I I I U I I I I I I I I I Ml I M M GTATTTTTAGTAGA 
GACGGGTTTCACCTTGTTAGCCAGGATGGTCTCTATCTCCTGACCTTGTCATCCGCCCAC 
CTCAGCCTCC CAAAGTGCTGGGATTATAGGTGTCAGCCACCGCGCCCGGCCTGAATATCT 
TTTTTATTTTTAAGCTTTCAATAMTCTTACTGACATCTA 

TTTMTGTATACATTTTGATGAGGTTGGACATATGCATACACTNGTGGTACCTGCCCG 
Sequence 2785 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I I N I U I I I I I I 
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GAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAATTAGGGTTCACT 

GCAACCTCTGCCTCCCGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGAGACGGGGTTT^ 

GCCATGTTGGACAGGTTGATCTCAAACTCCTGACCTCAGGTGATCTACCCTCCTCGGCCT 

CCCAGAGTGTTGGGATTACAGGCATGAGCCACCATGCCAGGCTGCTMTTCTCCTTTTTA 

GTGAGTTAGGGMCTGAGCCTCAGAAAACTTAAACGATTTCTCAGAAAACACTCAAGTGA 

TAMGTGGCCCCATTGGAMGGAGGTTTTTATCTTCTCATTGGCAGGCCCAGNGTTCATT 

GCACAATATCATGCTACCTCTTGGAATCTTTTAAAA 

Sequence 2786 

TCTTAGGG C GAATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACTAA 1 1 H 1 I I I I M I I 1 

rmrn igtai i i » i agtagagacgggtttcaccttgttagccaggatggtctctatct 

CCTGACCTTGTCATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTATAGGTGTCAGCC 

ACCGCGCCCGGCCTGAATATCTTTTTTATTTTTMGCTTTCMTAAATCTTACTGACATC 

TAATTGATAMAGTTGCACATATTTMTGTATACATTTTGATGAGGTTGGACATATGCAT 

ACACTCGTGGTACCTGCCCG 

Sequence 2787 

CCGCGGTGGCGGCCGAGGTACAATACAATCTAGATGACGGTGCAGACTAAGTCAAGAACT 

AAAGTTGTGCAGTAACCCGAGTTAAGGCATGAATGCGGACACACACATGCACACACACAG 

CACCCATGCTATCAAGACACAGGATTTTTTCAGTTGCCTCATGAGAGGCAACCTGGGCTT 

GGCAGTTAATCA GAAC TGCTGAGCATTCCAGAAAATGCCCCCCACGACTTTATGCTAACA 

GCTGTGTGTATGTTTTAATCAAAAAATTAAAGAAGAAAAAAAACCTAAAAAACAAAGAAA 

AAAACAAACAAAAAATCACCAAAAACCTAGAAACCCCTTAATCTCTTACAATGGCTCTTG 

AGCATGGAACTCATGTAGCAGCATCAATGGCTGGCTCTTTAACAATTTGGAAATAAAAGG 

TTGGTTTACTATGTATTTCTTTGGTAGTCATCACTACCAAAGTT 

Sequence 2788 

CCGCGGT GGCGGCCG AGGTA C I M I 1 1 I I I 1 1 1 1 I I I ) 1 1 1 1 1 1 I GCTTTNATGTCTTTA 

TGTAT7TA 1 1 1 1 1 1 1 I I GAGACAGGGTCTTGCTGTGTTGCCCANGCTGGAGTGCAGTGGC 

CTGGTCATGGCATNAAGGCTCACTGCAGCCTGGACCTCCTGGTTCAAGNGATCCTCTTGT 

NTGAGTCCCCTGANANAAAACCCCACCCCCNNCTACANAAATTTTNTGGAAACANGGGCN 

NNAANCTGTTNCCTANGCNTGTNTGGAACNCCTGGGCTCAAGGGANCTTGTANCCTTANC 

CNCCTAAAANAGCTGGGANTTATAAGGCATGANNNAATTGTANCCTGNCCCGGCGGGCCN 

CTTTANAAAAAGTNGGAANCCCCCCGNCCTTGCAGGAAATTTCATTATNCAACCNTNTNN 

NAATCCCCCCCCCCCNCCAN 

Sequence 2789 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I 1 1 M M ( I M I I < 1 I I I M M 1 1 1 I I M 1 AG 
NAGANATGGGGTTTCACCATATTGGTCAGGCTGGTCTNGAACTCCTGACCTTGNGACCCA 
CCCGCCT TGGT CTCCCAAAGNGCTGGGATTACAGGCATAAGCCCCCGTGCCCGGCCCACA 
TGGTATTATTTTATATAAAACGCAAGTTMGTNTTTATGTC 

CTTTNTCMAMGMNTTNACTTATTNGTTAAMCCNTTTTTTGNCCTAAGGGCCAAATA 
NGNGAMAA AAANCCC NNATTNNNCTTTTTANATTTTTNTTGGCNAAAANTNNAAAAAAN 
ATTTNNCNNN I 1 1 I II I GGNGNNANTAAAAAAAMGGGTTTTTTTNNTTTTTTAAAA 
Sequence 2790 

TACTATAGGG CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTA C I 1 I 1 I I 1 i 
I I I I I i M I I iGAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAAT 
TAGGGTTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAGTTNTCCTGCCTCAGCCTCCTG 
AGTAGCTGGGATTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGA 
GACGGGGTTTTGCCATGTTGGACAGGTTGATNTCAAACTTCTGACCTCAGGGGGAATTAA 
CCTTCTTNGGCTTCCAAAAAGGGTGNGGGATNAACNGGGTTGNGCCCCCCCCCCCCNGGG 
GGGGGA AAAAATTCN NTTTTTTTNGGGGGNNGGGGNNNCAANNNCCCCCCNAAAAAAAAA 
NNNNAAAi i i u i I I I ANAAAAAANNANNCCTTTNTNAAAAAATGCCCCCCNCCCCAANG 
NNGNNGGGNI MINI 
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Sequence 2791 

GGTCTTCTATGTGGGTGTCAAGGATATGCTCCCTCACGGCTTCCGAAGGCCCCCAGCAAA 

AGATCTAACATTCTTGCTCAAAGTTGCGAGAGAAAGTAGCACATGGAGTAGCTGGGTTGG 

GGCGGCGGCCTCTTCTCTTCAGCTCCCTTAGCTTGGCTCCGTAAGTGGATCACTTGCCAA 

ATGCTTTAGATGATTGCCTCTCAATAATTGAAAGGTGGTGGTAAGTTGTATTGCTGCACT 

GTCGGTGTTAAGAGAMTTACTCTCACAAGAGCAGAGGCCTGAAGATTCTTTCTTCTGAA 

GGGATGATGAGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGACCATGT 

CCTAI I I I I I GGAGGGCGTTGGG 

Sequence 2792 

CCGCGGTGGCGGCCGAGGTACCTTCAATACCTTTAGTTGTCTCCCACACACGCGTGTGTG 

TGTGMATCTTCTACMTATCTTCCCTTTTTTAGACCATGTTCACTGTCAAAAAGGTGCT 

TTAAGAGCAGTCTTTGGCTGGGCACGGTGGCTCACACCCGNAATCCCAGNACCCTGGGAG 

GCCGAGGCAGGCGGATCACCGAGGTCAGGAGATCAAGACCATCTTGGCTAACAGGATGGT 

CTTAAGGGACAGNGAAACCCTGTCTTNAACTTAAAATACAAAAAATTANNTGGGCGTGGT 

GGGNACCGCNCCTGTAATCCCANCTACTCAANGAGGCTNAAGGCAGGANTAATCACTTTG 

AACCTGGGAGGTAGAGGCTTGCAGTTGAGCCAAGATTGCACCCACTGCACTTTCANTCCT 

GGGCCGAANAANAGCCANGGACTTTCATTTTTAAAAAAAAAAAAAAAA 

Sequence 2793 

CCGGGCAGGGACCNCGGGATGGTGNCAACTTATGACAGGACCCATGGGCCCTCCCNATGC 
ACACAGNACTNTTGGAATCTNATCCTTTTCCATGGCTCTGGCTCACACTTNCACAGNATT 
TACTCCTAAATATGCCCCCTGNGTTCA 
Sequence 2794 

T ATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANATTTGAATGGCTTTGA 

CTTTTGGCAGCTGCACAGNGCTAGGACTGGACCATGAAATATCTNTGGGCTNTGNCAATN 

ACATTTGGGTTAANCTAANCCTGATCCCATGTGTTCCTGGAAGAGAAGCCCCATGACATT 

CAAAGTCCTTGACAATNTGACACCAGCTTTTNTAACCNTATAAGGCCC 

Sequence 2795 

ACTACTATAGGGCGTCGAGCGGCCGCGCGGGCAGGTACGCGGGCACACTGAGGAATTATG 

ACTACTATGCAAGCCGAGTTCCAGAATCTATTAAAAATGCACTTGTTCCTTAAGGGAAAG 

TTTCATTTGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGCACTTTGGGAGGCCGAGG 

CAGGTGGATCACTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAATATCGTGAGACCCCA 

TCTCTACAAAAATACAAATTAACTGGGCATCCTGTCATGCGCCTGTCGTCCCAGCTACTT 

GAGAGGCTGAGGCAGAGGAATCTCTTGGGCCCGGAAGGCGGAGGTTGCAGTGGGCTGGGA 

TCGTGCCACTGCACTCCAGCCTGAGTGACAGGAGTTAAGCCCTGTCTCAGAAAAAAAAGA 

CAAAAACCCAAAAAGTACCT 

Sequence 2796 

TACAGAAGCCGGGAGCATAAAGACGTATAAGCCTNGGGGGTTGCTCTAATGTAGGTGAGG 

NTAACATNACATTANATNTGNAGTTGNCGCCTNACTTGCCCCGCTTTTTCCAGTTCGNGG 

TAAACCTTGTCNTTGCTCAGCNTGCATTTTAATTGTAATTNGGTCCAACNGCGCTGGANG 

GAGNAGGCTGGTTTTTGCCGTTATTTGGNGCCGCTTC 

Sequence 2797 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATGATAAT 

AAGA TGCM TTTGAATCTTCATCATACGAAGGATACTTTCTAGCTTGTGAAAAAGAGAGA 

GACCTTTTTAMCTCATTTTGAAAAAAGAGGATGAATTGGGGGATAGATCTATAATGTTC 

ACTGTTCAAAACGAAGACTAGCTATTAAAATTTCATGCCGGGCGCAGTGGCTCACGCCTG 

TAATCCCAGCCCTTTGGGAGGCTGAGGCGGGCAGATCACCAGAGGTCAGGTGTTCAAGAC 

CAGCCTGACCAACATGGTGAAACCTCATCTCTACCAAAAAAAAAAAAAAAAAAANGTACC 

TGCCCC 

Sequence 2798 

CCGCGGTGGCGGCCGAGGTACTGAGCCCTTCATTCCCCAAACTCAGACTCTTAGCTCTTT 
TGTCMCTCTGGGCAAAGGTCAGCATTGGAATCGAGCGGCCGCCGGGCAGGTACATTTTC 
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TCTCATCCACTTCCGATTCTCTTCATTGGCTGCMTATCTTTTTCTTCAAATCCTATTTT 

GTTGGCTTCTAGGAAACCMGCACATCTTGTTGTTTCTTCTTAATCGCTGTAGAGCCAGA 

GGAAGATGCAATATATACACCGGATCACCATCCTGGGAACAGCGGCTGCGGTTGTTTGGG 

TCCTGAAAAGGGCTGTGGAGCAGCTGCAGCAATGGCTGGAATCCAGCTAGGGGCTGAAAC 

AACGGTTGGCAGAGAAGGGTGGGGGAAGTGGGAAAAAGGAAGAACTCGCCAGAAGCCCCG 

CCTTCGNCTTTAGCAAGCTTCCACCGAGCTCTTCTAAGCGCTTGAGTTNCAGCCAAAAAC 

CCCCCCCCG 

Sequence 2799 

AGGTACTAACCTCTTACCTTCCAAGGTGGTAGAACATGCTTGAAAAGATAGTAAGTGAAA 

AGGGGTAGCCAGTGCTTTACTCACAAGACTGCTTGAACATGAGACTCAAGGAGGGACCTC 

AGCAGGCCTGGGGTGTTCAGCAACTATTCCTGGCCGGGGCATCTTGCAAAGGAGTTGCTG 

TGACAGTAAGCTCTTCCACTTTGAGACCGTCACCTCAGCCACGGCTCCCACCTGGGCTCA 

GGAATGGTCAGGCAACACGTGGGGCCCAGGATGGCGGTGGATGACTAAACTGCCGAAGAC 

CGCGCCGCTGCGACTACCGAAGTACCTGCCCG 

Sequence 2800 

CCGCGGTGGCGGCCGCCCGGGCAGGTACMGTATGCAAGTTTTCTGTAAACAGATTACTT 

TCAAACAGTTGGGAGCCCCAGGGAGATAATAGAAGGTATCCTACTCAACAGCAGTATATA 

GAATGCTGTAGAAGAGGAATATTGTAAGAACAAGAACTATCTCTTGAAAACTAAAAATAT 

G ATMCTG AAATATAAATTTAAATAGGATTGGAAGCTAAAGCTAGGAAATTACCCTAGAA 

TGTTm iGAGAAATGGAAAATACCGAAGACCAAAAAAATAAAATAAAACTATGCTGTGA 

MGAGAAAGATTAGCATTTGAGAAGGCGGGMGTTGGTCCTGCTCAGATGCCGGTTTTCA 

GATGCCTTATTCTAAAAAATAGNGATAGAAAACAAATTTCTACAAAGCAAAATTACAATA 

AATTTA 

Sequence 2801 

CTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGACGGGTGCCTGTAG 

TCCCAGCTACTTGGGAGGCTGAGGCAGGAGAACTTGTTTGAACCCGGAGGTGAAAGTTGC 

AGGGAGCCGAGGTTGTGCCACTGCACTCCAGCCTGGGAGACAGAGCAAGACTCTGTCTCC 

MAACAMCAAACAAACAMAAAACCCTGTAGCTTGGGATCAGCCTTCTCTTCTGTTGTT 

mCTTTAAAAAATAAAAATTAAAAATAGGCTTCAAGTGATCCTCCCGCCATGACCTCCA 

MACTGCTGGGATTGTAGGTGTGAGCACTGCACCCAGCCGTATGTTTTTTTCTACATAAA 

AAACAGCACAGGATTATCTTCCAAAGCTAACAAATATGTTCAAATAACCACAACCCCAAN 

TNNAAAAAAAAAAATNAAGTACCTGCCCGGGGCGGCCCCTCTAAAAACTAANTGGGATCC 

CCCGGGGC 

Sequence 2802 

NNANGNCCCNNNAGNGAGGGGAAAANGCGCGCCGGGCGNAANCAAGGACAGAGCNGNNNC 

CCGGGGGAAAAGGANANNCGCNCACAANACCACACAACANACGAGCCGGGAGCANAAAGN 

GGAAAGCCNGGGGGGCCNAANGAGGGAAGCCAACCCCACAAAAAANGCGGGGCGCNCACN 

GCCCGCNGNCCAGGCGGGAAACCCGGCGGGCCAGCNGCAANAAAGAAAAnnnnnnnnnnn 

nnnnnnnnnnnnnGNNNGCGAAAAGGGCGCNCNNCCGCANCCCCGGACACCNAAACGCNG 

NGCNCGGNCGGNCGGNNGNGGNGAGCGGGAANAGCNCACCCNAAGGCGGGGAA 

Sequence 2803 

NC CACCGC GGTGGCGGCCGTNCGGGCAGGTACTCCTTTCAGAGGGTCATCTCCTCCACAA 

GTAi MM I GT TTCTT TGGCTGGTCTGGGTCCAATGCTGTTGCCATCCCCAGCTTCAGAC 

TGTTCTCCTTGTTTTGGAGMCTTTCTTTGGACTGTATCTTCAGAGACACTCCTGGTCAA 

GGGGCCTCAGAGGACCCAAACGCTCTGAAACAGCGTCTTAGCTCATCGCCGAGTGTCAGC 

TCTAGCTCTTCGGAGCGCTTTCTTCTCCCCCCGCGTACCT 

Sequence 2804 

CCGCGGTGGCGGCCGAGGTACAGAGAMATATTTTTTAAAAATCTCATCAGGCTAGGTGA 
GGTGGCTCGTGTCTGTAATCCCAGCACTTTGGGAGGCCACGCTGGGTAGGTTGCTTGAGT 
CCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACACCGTCTCTACAAAAATAATAC 
AAAAATTAGTGAGGCATGGTGGCACACACTTGTAGTCTCAGCTATATTACTTGAGAGGCT 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

457/467 

GAGGTGAGAGGATCACCGTMGCATGGGAGGCAGAAGTTGCAGTGAGCCTAGAATGCGCC 

ACTGCACTCCAGCCTGGGCGATAGAGCAAGACTGTNTCAAAAAAAAAANATTTGGGGAAT 

AAAGGAAAGTTCCTGCCCCGGGGCGGGCCCGNTTNTAGAAACTTAGNNGGATCCCCCCCG 

GGGCCTTGCMGGMTTTNCGATATCNAAGCTTANTTCGANTCCCCGTCCACCCTNGAAG 

GGGGGGGGCCCCGGNTACCCCAAATTTT 

Sequence 2805 

TAGGGCGAT TTGGAGCTNCCCCCGGTGGCGGCCGCCCGGGCATGGTAC 1 1 I 1 I I 1 1 I I I 1 

rrmrm igccagcttttctmgcaaatagattgtctgmttagtcacagaataattt 

TGTGAAMTTCATGTTTMGTAGCAACTACCCTTTCTTTTTTT 

TAGTTTATCTTCTTCTAACTGGNGCAGTCACTTMTGTTTTCATTMTCTTCGACC 

GAGNGAMTACTGATATTTCTAGAAAAAAATTCTACTCCTCTGATTATTTGAAATGCTGA 

GGAAAATGTCCCTCCCATAGTAAAACTTGTAAATAAGGAACTATATCATATTCAGTAGCT 

GNGTTCTGTTCCATCTTTTT 

Sequence 2806 

CCGGGCAGGTACATTCTCTGTTCATTACTTAGTTCTTAAGGATATGTGTTCATCCATTCT 

GTCGCTGGCTCAGAGTTTGCTTCACTCTCTAGACCAGAGTATAATTTCATTTGGCAGTCT 

CCTATACAAATATGCATTTAAGTTTTTTGACACGTACGCGGGGGACTCAACAGAAATGGG 

TTTCCAGMGAATAATGAAMGTTGTGGGTAGGAAAATGAATCATTTGGACTCTTCAATG 

AAATGGAGTGAGCCCAGGAGAGCTCAGCCAACAGAGGCACTCTGGGAACCTGTTAGTAAA 

GCCAGGCTGGCCAAATGCCATTTGATTTTGAACCTCGTAGGTCCCCACTCACCCTCTGCC 

AGGAGCT 

Sequence 2807 

AGCCTCACCCGC GGTGG CGGCCGAGGTACCCCGGGTGTTCCTTTTTGTTCAAAGTCTATT 
TTTATTCCTTGATATTTTTC I I ! I I I I I I I 1 1 I GTGGATGGGGACTTGTGAATTTTTCTA 
AGGTGCTATTTAAACATGGGAGGGAGAGCGTGTGCGGGCTCCAGCCCCAGCCCCGCTGCT 
CACTTTTCCCACCCCTCTNCTCCCACCTGCCTCTTGGCTTCTCAGGNCCTCTGNCTCTTC 
CCGACCTTCTCTTCTTCTTGAAAACCCCTTCCTNCCACAGCTTGCAGCCCCATCCTCCCC 
GGGCTTCCCTNCCTAAGTCTGGTCCTGGCGGTCCTCTNGTCCCCCGGGNTTTCNAGAGGA 
CAACTTNCCCCAAAGGCCCAAAGCAGTTTTTTCCCCCCCTAGGGGGGGGGGA 
Sequence 2808 

CCGCGGTGGCGGCCGCTCGGGCAGGGTACATGCCACCACGCCTGGCTTATTATTATTGTT 

TTGTTTTGGAGACAAGAGTCTCGTTCTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTC 

AGCTCACTGCAGCCTCCGCCTCCTGGATTCAAGCAATTCTCCTGCCAGAGCAGCTGGGAT 

TACGGGCACGTGCCACCATGCCCGGCTMTTTTTGTATTTTTAGTAGAGATAGGGTTTCA 

CCATGTTGGCCAGCCTGGTCTCGAACTCCTGACCTCAAGTGATCTGCCACCTCGGCCTCC 

CAAAGTGCCCGAGGTTACAGGCATGAGCCACTGCACCCGGCCTATTATTATTAACTTAGT 

GTTTGCTMGTGCTTTATAGATACGGACTTGCTTAAATCTTATAATAAGTCCTGAAAGAT 

GGGGTGATAACCTCATTTAAGAAATT 

Sequence 2809 

AGGTACTGTAMTATTACCATTATTTAMATGTTGACATTTCTGCATTAAGTAGAAACTT 

TCTAAATGCCTAAATACCACTCAAACATGACTTAAAAGAAATTGAATGACTCACCACTAT 

GACCTTCMGAGTCTGATTCATAGAMGGTTACTGGGGGCTGCMGGCCCCTCMTTTTG 

CATCATCTGTCTGCGTCTCTAATTTCAAAACTTTCAGTAATCCATCTTCACCACCGCATG 

CT ATGM CCCTTGTTCCTTGTTCCAGGATACACACTGCAGCTTCACGTTATTGGGAATGG 

AAATTTTCTTGCTCAGGTAGAAGAACATCGTGGGATCCCCGAGAGGGTCACGGCGGCCGC 

T 

Sequence 2810 

AGGTACGCGGGGGGCTCTGAGAGGAGTCTACCTTGCCTTCTTATGGGAAGGGAGACCCTA 
AAMACTTTCTCCTCTTTGTCCTCCTTTTTCTCCCCCACTCTGAGGTTTCCCCAAGAGAA 
CCAGATTGGCAGGGAGAAGCATTGCGGGGCAATTGTTCCTCCTTGACAATGTAGCAATAA 
ATAGATGCTGCCAAGGGCAGAAAATGGGGAGGTTAGCTCAGAGCAGAGTAGTCTCTAGAG 



• 
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AAAGGAAGAATCCTCMCGGCACCCTGGGGTGCTAGCTCCTTTTTAGAATGTCAGCAGAG 
CTGAGATTMTATCTGGGCTTTTCCTGMCTATTCTGGTTATTGAGCCCTTCCTGTTAGA 
CCTACC 
Sequence 2811 

GGCAGGTACI I I I I 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 GCAACATTGCCCTTTGATGTCCCCATGAGG 

GCCAGGCCCAGGCAGMCCCATCCCATTTTATCCTTAAACTCANAAGGAAATTTGTCTAA 

ATATTAAAGGATTAATATGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAA 

GCTGTCTNTCTTAGAGTTCAATGMCACTTCAGGAGAGTATTTCCAACAATATTTAGATA 

TTGGMTATCTAAATATTGTTGATTTAGATAACCACCCTAGATTTCTCACCACCCTAGAA 

CATTTAGTGGGGAGACATTCTTTTCTCCTTT7TCTGATAACTTGGTCAGMGTGATTGAC 

TGTGCAAATGGTATTT 

Sequence 2812 

NNCGGCGGCCGGGCGGGNACGCGGGNGAGCNNTACGAGGGNCAGGAANCCCAGANGCCGC 

NCGACCTGGAGNCAGCCTACAACGCNTTCAACCGCTGCCGGCANGCCCGAGCCCGGGGCA 

CTAGCCCNNGCACAGAAGGGCAGAGNCNGAGGCGANGGCNCCNGGNCCCCNGNCCGCCAC 

ACAGGCCTTTGGTNTTNTCACACAACNCACGGGGCGGCAGCCGCCNGAAAGNAGACNGNC 

CCCGGGGGCAGAACANNGGGGGCCGGGCCCCNCCCCACAANAAAGANGCNCNCCGACAAA 

AAAAAAAAAAAAAAAAAGNACCNCGGCCGCNCNAGAACNAGGGGANCCC 

Sequence 2813 

CGGTACCCMGCTTTTGGTTCCTTTMGTGAGGGNTNAATTGNCGNCGNCTTTGGCCGTA 
ATTCAATGGGTCMTAGCTTGGTTTCCTGTGGTGNAAATTGGTAATCCCGCTTCAACAAA 
TTCCC ACNCAACATACG N AGCCCG 
Sequence 2814 

CCGGGCAGGTACGTTCTTTTNGCTTTTCCTTTCNGTAAGATGGTCTTCAGAGCTNCTTAA 

ACACATTTAGAAAAAAGTTAMCCCCMGACNCTTTGGGGATAGGTTAATTTTAAGANGC 

CCAACTTTGGACTNGGATTAAAGGMNTACCTTAAAAANCCCNGGNAAAACAATTATTTT 

TTGGG 

Sequence 2815 

GGCGGCGGCCGAGGTACAGAGAAGCCATCANTTTAGAGGGCAGCANAAAACCAGAAGCCN 
GNTTTGATCCCTNAACACCAAGANGCCTNTAACAACANGNCACCAGCACCCCCAGGAAGG 
CCAAGGAGTCCCACAGAAAAACCTAGGGNNAGACCAA 
Sequence 2816 

GCGGCGGCCGAGGNACMGNMNCNCCTTTTTTGGGGGGGGGAAAAACCCCCCCNCCCCN 

NNNNCNACCCAGANGAGGAGGGTTTCNGCCCCCAGGGGAACANCNCCNAAAANCAGCNCG 

G CCNNNGNC GGGACCAGNGCCAGGNGGACAGCCAGNGNCCNGGCCAGANGAAAGGCNGCC 

GTTTTTTTT1 NTTNGGGGGGGNNCGGGGGGGGGCCCCCCCAAAAAAAANCAGCCACCACC 

AGGNGGGGNGGGGAGGA 

Sequence 281 7 

AGGTACCCTGAGGTGCTCCGCTGGGGACTCTGCTCATTCTGGGGGTGCAGTTGACGGCTG 

GTCGTGATCTTTCCCCGTAATCTGTCCCCTCTTACGGAACCTAGTCTCCCGTTCTGGCCA 

TGGCCTTTCTTCTTGGACACTGCTTAGGANCCCAGAMGMGTATTGTTATCAAATTCTT 

AAAGCCTTAGGMGAAAGTCNAGGGAGTGGGAAAACCAGGCTTCTGANAAAGAATACCTG 

NTTGGCCCACCTGNATCTTCCNAGGNCANCCACGGAANTCCCGGGCCCCTTCCAATCAGG 

NAAGGTCGGNAATCTCTGATGGTCNATCGGTTCNATGGCCAACCTGGCCAACCAGTTTGA 

AAAAAAA 

Sequence 2818 

CCGGGCAGGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGT 
GGTCACC TTTGC TCCAAGTCAACGTTACAACGGAAGTAAAATCTGTCGAAATGCCACCCA 
TTGAAAGCTTTTGAAGTGAAAGCTTCTTTCCTTGGGGACCAATGGTGGGCT 
Sequence 2819 

AGGTACI I I M I I II I I I I I I I I I I I I GGGTTAGGATGGTTCTAACCTGATGGGTTGTGT 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

459/467 

TTACAGTGGGGTTTCCCCCAMGGTTATCTTCCTGCTTTCCTCTGTAAATAGGGCAGTTG 

CTGCAACAGATTAAATACACTCGGGCCACCTGTGGGTTAGTGGGTCMGMTTTTTGACA 

GMAGGCTATAGGCTGCCGGTGGCCCCCGTGTGTTTGGGTGAGCACCCCTAGAGCTATCC 

CTTTATTCACATT GACGA AGAGGTGGAATGGCTGTTCTAGGGAAGGTAGAACTAAGGCAG 

TTATGAGCAGGTGTTTTAATTCCTCTACTTGTTGGACCTCCCGTGGT 

Sequence 2820 

AGCTTATCGATACCCGTCGACCTTCGAGGGGGGGCCCCGGTACCCAGCTTTTTGGTCCCT 

TTAANTGNGGGNTAAATTGGCGCCGCTTTGGGCGNAANTCANTGGNCANTAGCTTGGTTN 

CNTGGTGGTGNAANTTGGTAATCCCGNCTTCNACANTTCCACACANANNACGAAGNCCGG 

GGAANCNTAAAGTGGNAAANCCTGG 

Sequence 2821 

C GGCC GCCCGGGCAGGTACCATCTCTTGGGAAAACCATGCTACCTCTTCTCTCTGTTCTC 

TATTTTGCCACTAGAGAAATAGAAAATAAGGCTGGGAGCAGTGGCTTATACCTGTAATCC 

CAGCACTTTGGGAGGCTGAGGCGGGAAGAATCACCTGAGGTCAAGAGTTTNAAGACCAGC 

CTTGACTAACANTGGTNGAAMCCCNCGTNTTTTTTNTTTAAAAAATACC 

Sequence 2822 

CCGCGGTGGCGGCCGAGGTACTCCAATCCGGGTGACAGAGGGAAGACTCTGTCTTAAAAA 

GAAAAAAAAATC MTAGA AATCGGTTTTTATTTATTTATAGTATGTGCTTTGAGTGGTTC 

TCAATCAAATTGATTTTTCCCCCTTCCAGGGGATATTTGAAATATCTGGAGGCATTTTTG 

TTTGCCCCATCTTTGGGTATCATATTGGCATCTAGTAGGTAGAGGCCAGGGTTGTTGCTA 

AGCACCCTATAATCCACAGGACAGCACCCAAAAACAGTTACCCAGCCCAAAATGTCAGTA 

TTGTCAAGGTTGAGAAGCCTTAATTTAGATGTAATGTTAAGAGTTCAGTAAATTGGCCAG 

GCACGGTGGCTCACGCCTGTAATCTCCTAGCACTTTGGGAGGCAAAGGCAGGCAGATTGC 

CTGAGCTCANGAGTTCGAGATCACCCCGGGCAACAAGGTGAAACGCTCTGTCTACTAAAA 

TACAAATCC 

Sequence 2823 

CCGCGGTGGCGGCCGCCCGGGCNNGTACTCCTCTTGCTACCACCTTTGTTGCAGAAGATG 

AAGGGGAGAGGGAGCTTCTCCCTATGGCCTCATGGCTTCTTGTGAGACAGATCAGTCCAG 

CCAGATACAGAGCMAGCAGCTTTGCATCACCGCGGGCCAGTTGCTGATGCCCAGCTTTA 

TGTCTAAAAAAAAAAAGTGGAAGCCTCAAGGGGGATGGAGGATAGCAAGAAGAATGGGTG 

CCTTGGCCCCAGAGGCATTGTAGGGAGAGGAAGACAATGTATCTCATCAGGGTTCTCAAC 

ATTATGAGATTATCACACATCAACtATCTTTGGAGGGGCTGAGTGATTGAGTTATGGCTC 

TGACTCCTCTCTGGGGGTGGAGAGTGAAAGATGACAAANAAGGCCATCTGTCCCCTAGGA 

GACACAGTTTGCAGTATAAGACAGGACANAAGAGAACAGAAAAACAAATNCAACTGGAAA 

AAAGGGGTGG 

Sequence 2824 

AGGTACCTTAGAGCAACATGCAAAGCTTCCCTNCTCAGCAATCCCAGGTTGGGGCCCCCT 

GTCTTCCTATCGTCCTACCCGTCACAACCACCACTGCAGGCTTCTGATGCTCTGTTTTCC 

TCCTCTGTCTCAGTTCAGTTGCTCTGAGTTAGAGAGGAGCTCTCTGGGACTGGAGCAAAT 

GACT GCACCGGCCCTCATGGGACTTCCATTTCAGTGGATTCAGGGGAAAGCAGCACGTGT 

TTTTGAGAGACCTTGGCMCGCAGGCGACCTTGACGTCGACCAAAANAAGAGAGATTTGG 

AATGNTTGTTTACAGCCTCCGGTCAAAAAAAAA 

Sequence 2825 

TTAGGGCGTTTGGAGCTCCCCGCGGTGGCGGCCGCCCNGTCAGGTACCANNNCTTAGCAN 

GGAANNTGGACAACANAAGCTNTAAATCCTCTTGCATCGNCACGNTNAATTTGCACTGAC 

CAATCTGTTGG CACA GTAACTGGTTATAAGCTAAATTTCTACATTTTGGCTACAAGTATN 

CCAAATNCACCTTTTAAAAAATCCTATGTNAGATGCCATCTGGTGTTAATGATTTGCACA 

CCCCTTAAATTGAAANTATTNCAAATAAATCTNACGGATTTATATANNATNATTAATGNN 

TNTATTTTAAAMGACAATCTGANMTAACACTTCCCCTAATTGTTGTCTTAATAATGAC 

CAAGAGCTGNNGAAAAATNATTCACACTGNTACGTCGTTNTGTTGGTTTGCTCACGGGGG 

AAGGGGGGTTG 
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Sequence 2826 

GCG AATTT GGAGCNAAACCNCGGNGGCGGCCGAGGTACTTCGGGGAGAGNNTNTCCTNCC 

NTCATTTTGMCNCCAGCGGCCTCTTCCCCTTCCNGGGGCTGCTTGCCCTGGGAACNCTG 

GCACCTTGGGCTGNGGAAGGCTCTGGAAAGTCCTTCAAAGCTGGAGTCTGTCCTCCTAAG 

AAATCTGCCCAGTGCCTTAGANACAAGAAACCTGAGTGCCANAGTGACTGGCAGGGGCCA 

AGGGAAAAAAAAAAAAGTTGCCCTNANCCNTNGGGGGAAAAAAAAGNGCCCNGGAAACCC 

NGGGGGGNCCCCC 

Sequence 2827 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCGGAACACTTGAGGTCATGAGTCTGA 

GACCAGCCTGGCCAATACGGAAAAGCCCCGTCTCTAACAAAAATACAAAAATACAAAAAT 

TGGGTGTGGTGGTGTGCACCTGTAGTCCCAGCTACTCAGAAGGCTGAAGTAGGAGAATCG 

CTTGAACCTGGGAGGCAGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGAGACAC 

TCCATCTCAAAAAAAGAAAAGAAAGGAAAAAAAGAAAAGAAAAGAAATGTTGAGGCAAAT 

GMTATACMACACATTTTAGATTAGCATTTGAATTAGTAAACTGCATAAAAAAGATCCA 

ACATGAGCTGACATCATCCAATCCATTGAGGGCCCAAATAGAGCAAAAAGGCAGAGGAAG 

AGCAAATTC 

Sequence 2828 

CCGCGGTGGCGGCCGAGGTACAGAAAGAAGAGTATCCATTTCAACTACTAGGTTAACTGC 
CT7TGATGAGCTTGATTTTACCTGGGTCACTAATTCCACAGAACCAATGTAGGTGTCTGG 
GCGGAGCMAATAT GCTCCA ATTGTGTTTTCTTTTGATAGATTCTTTCAACAGACAGTCT 
TTTCTTAGCATCTTCATTTTTCTTTATTTTC 

CAATGGTGACACTTCCATGGTGACGGTCGTGAAGGGGCTCAAGAACCCTGAAAGCGACTA 
AACAGGCAGGACCCCACGAGACCACCCCCGACCAAGCCGNTTCTCCACAGACGCGCGTCC 
CCCGCGTACCCGCCCG 
Sequence 2829 

CCGCGGTGGCGGCCGAGGTAC TTTAAN 1 1 1 1 I I I H H I I I I I I 1 HA I II 1 1 I Mi 1 1 I 

M 1 M 1 M M M 1 1 I N 1 M t 1 I I I I I I 1 1 I 1 I 1 1 M I I M I m l | 1 1 m i | i m | i | 

I I I 1 1 II I I 1 1 I I I I I I I 1 1 I NGGGNNGNNNANNNTTNTTTTAAAAANGGNAAAAAAAAA 

ANNGGGGGGNGGNNNNNGNNNAAAANNANNNGNNNNGGGGNNAANCCCCCNNCCNNNNNG 

GGNMNCCCCNNNNNAANNNNNNriTTTTNNGNGGNNNNNCCCNTTNTNAAANAANCGGGG 

TNAAAAAAAMAMAAATTTTNGGGGNNATTTTTTNNNGNNNCCCCCTNANCNNGGGGGG 

GNNTNTTTNCTNTTTTNANACCCCCGGGGGGG 

Sequence 2830 

CTACT TAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTAC ) ) I I I N r GTTTTGT 

TTTGGT MINIUM IGGCTTGACTCAGGATTTAAAAACTGGAACGGTGAAGGTGACAG 

CAGTCGGTTGGAGCGAGCATCCCCCAAAGTTCACAATGTGGCCGAGGACTTTGATTGCAC 

ATTGTTGTTTTTTTMTAGTCATTCCAAATATGAGATGCATTGTTACAGGAAGTCCCTTG 

CCATCCTAAAAGCCACCCCACTTNTNTNTAAGGAGAATGGCCCATTCNTTTCCAAGTTNC 

CNNANGGGGANAMANANCNNTTNTTTTCNGGN 

CCCC CCCCCAAAMCCTTTT TNNTTTNGNNMNAAAMANAAMAMAATTTTCCCCCCC 
CCNCI I I I I I I I I I I I I II I ICACNCCAAAAAAAAAAAAA 
Sequence 2831 

CCGCGGTGGCGGCCGA TGTAC I I I I II I I I I I I I I I I II I I I I I I I I 1 1 I I M I I I f I I I 

M I M I I M 1 1 I I I II I I I I I IMCCCTTTGACTNTTTTTTAANGCCNCGGGNGGATNTC 

NTTGCCCANCTCCCANNAATGTTTGCCCTTNTAANTCTGTTCCACTTTTAGGNGGNAGCC 

ACCAGGCCTTANCCATCCCGGGTACCTNGGCCGTTTTAAAANNAGGGGNATCCCCCGGGN 

TNNANGAATTTNNAAANTNAAAGCTTNNNNGAAACCNNCCCNCNNTNGNGGGGGGGGCCC 

CGGGCNNNAAAI I I I ITNTTTNNTTNAAANANGGGNAAAAAANGGNCCCCCTTGGGGNNA 

AAAAAAANGNAAAAAAN I I I I M I I i NTNGGNAAAAAAA I I I II M I I CCNNAAAAAAAT 

NNCAAAAAAAAAAAANAAANGGGNGGGAAAAAA 

Sequence 2832 




WO 01/070979 



PC17US01/09126 



TABLE 1 

461/467 



CCGGGCAGGTACAAAGATTCCTCACTGCGTGCTAAGAAAACAGATCCAGGCCGGGCACGG 

GGGCTCACACCTATNANCCCAGCACTTTGGAAGGCTGAGGCGGGTGAATCACCTGAGATC 

AGGAGTGCGAGACCAGCCTGGCCAACATGGCAAAACCCTGTCTCTACTAAAAACACAAAA 

ATTTGCCGGGCATGGTGGCAGATGCCTGTAATCCCAGCTACTTGAGAGGCCAAGGCAAGG 

AGAMNTTGCTTTGAACCTGGGGAAGGCCGAANGNTTGCAANTGAGCTTGAAGATTCGGC 

AACNAACTTGCACTTTCCAANNCCNTNGGGNTGACNAGGANGATAAGGACCTTCCNTTCT 

NCAAAMMAAAAAAMMGGGANAAMAAAAAAAMGGGNTCCCTTTTNGGCCCGGTTT 

TTT ANAMC TTMGTGGGAATTCCCCCCNGGNCCTTGNGNGGGAATTTCGNNTNTNNNAA 

AGCTTTTTTCGAATTCCCCGCCCCANCCNTNNGNGGGGGGGGG 

Sequence 2833 

CTAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAAGGTACTTTC I 1 1 i I I I I 1 1 1 1 I 1 M G 

AGATGMGTTTTGCTCTTGTTGCCCAGGATGGAGTGCAATGGTGCAATCTCAGCTCACTG 

CAACCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCATCCTTCCTAGTAGCTGGGAT 

TACAGGTGCCCACCACCACACCCAGCTAATTTTTGTATTTTTAAGTAGAGAATGGGGTTC 

ACCATGTTNGGCCAGGCTGGTTTNAMCTCCTGACCTTMGGNGMNCCCCCTTTGCCTT 

NGGCCNCCAAAAAGGGNTNGGNAATNANNAGGGGGGNNACNCNCCCCCTTNNNCNNANAA 

AAAGGGGGN 1 1 1 1 I I I J NTNTGGGGGGGGGGAAANATTTTNNAAGGGGGGGNGCCCCCCC 

NNC CCCTNA NAAAAAAAAANAAANCCCCCCCCCCGNGGGGANNAAAANNNTTTNTATAAA 

ANANTTTTTCCCCCCCCCCCCCNGGGGGGGGGGGCCCCCCCCCANC I I H I I I I I I II I N 

TAGNAAGNGNNNCAANCCCCCCNAAAA 

Sequence 2834 

GGGGNMACCCCGGNGGCGGCCGCCCGGTTTGGAACCNGGGTTNAAACCCCCGGNTTTNA 
ACCNCANAACCGCAAGANAACGGGNGNAAAAAAAGGGAAACANANCAGCNGTCCAAAGAA 
AACAAAANGNGGGCAAACC 
Sequence 2835 

GGTGGCGGCCGAGGTACTGATCATGGAACTCTCGGGGAGGAAATGATGTTTTCCTTCTAC 

CCATCTTATGTTCATTGGCTGGGGCTCCTGGAACAGAAGACAGATTTACAAAAGAGAAAG 

GCACACAAATTTATGTMTATAAGTTTTACATGACATGGGAGCCTTTATAAGGAAATGAC 

CCAAGGAAATGGTTAAACCTGAGTGGTTTTGNGTTAGGTTTGATGAGCAATGAAAAGCTA 

TGGAGAACTATGATAGGAGGAGTGTGAGCTAAACGCAATGAACTGGGGGAAACT 

Sequence 2836 

TACTTAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI 11 1 I I 1 I H I II II 1 1 

minim CTCAAAACGGCCTTCCTTAGTCCTGTAAAACCTGGAATGCACATAGTCCTGT 

AATGGCCATCCCAGAAGTGAATCTACATGATTATTCAAATTCAAATACCTAANAAAAAAA 

ATCACTTGAAGTTNTGCAACTTCCCAAATGCAAATTCCTGANAAAGAAACTTGACTGAGC 

CAGTTCAACTTTCTGTATTGGATTANAAATCCTATCTTGCTGGCTGGTCTGTANATTGGG 

TTGCTCTTGAGTCAAMTGANCCTTTTTTTAMGTTMGCCNCTTTTGCCCNTTNGGGGG 

GGGGCCCCNCTNGGGGGGGGNNAMAAMTAATTTNNCCCGGGGTTTTNTTTTTTTNCCC 

CCNAAAAANCTN I MINI f I CNAANCCCCCCNNAAAAAAANNGGNCCCCCCCCCCCTNG 

NNGGGGGGGGGGGGGGGTNTTTTTNTCCNCCCCCCCCCNCAAAAA 

Sequence 2837 

CCGGGCAGGTAC I I 1 1 I I I I II I I I I I I I I I J 1 1 I GGGTTTTTAGTAGAGACAGGGTTTC 

ACCATGTTGGTCAGGCTGGTCTTGACCTCCTGACCTCATGATCCACCGGCTTCGGCCTCC 

TGAAGTGCTAG GATTA CAGGTGTGAGCCACTGCGCCCAGCCCTGAGAAATAGTTCTTCTA 

ATTGTCATCCAGTTTTCATCTGAGTCCTGTTGTTCTTTGGATATGTGCCCTTCCAGAGCA 

CAGCAGGGGTTGTTCAAGTCTTCCANAAAAGCAGCTCTTGTTTCTCCTCATGTGGTGGGA 

GTGGAGTCAGAGCGTGGCTCAGGCCCCACATTCTCAGCTGTTTGGATCTGGGGACTCGAA 

GTTTCTGGTGGTTACTTCTGAAAGTCTTTTCCAGGATMTTATTCTTGCTCGGTTTCTCT 

GCATCTCTGACAGGCTGGTTTNCCTGCTTCCCCGCTT 

Sequence 2838 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I 1 111 1 111 111 1 1 1 1 
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CA GAAGAC GA TAACTCGC TTGCTGTAAGTCAGGAAATAAATAAATTCTAGGAGCCGGGCA 

ATA I 1 1 I I lAACl I I I I I I 1GAGACAGGAGTTTGCTATGTTGCCCAGGCTGGAGTGCAGT 

GGTGCGATCTCAGCTCACTACAACCTCTGCCTTCTGGGCTTAAGTGATCCTCCTGCCTCA 

GCCTCCCMGTAGCTGGGACTACAGGCATGAGCCACCATGCATGGCTAATTTTTGTATTT 

TTTGTAGAGACGGGGTTTCACCATGTTGTCCAGGCTGGTCTCAA 

Sequence 2839 

AGGTACTTTTTTTC I I I I I I I I I 1 1 I I I II I GGAGACAGGGTGTTTATCTGTCACCCAGG 

CTAGAGTGCAGTGGCGGGATTACTGCTCACTGCAACCTCGACCTCCTGGGCTCAAGTGAT 

CCTCCCAGCTCAGCCTTCAAGAGTAGCTGGGACTGCAGACCTGCACCACCACGTCCAGCT 

GCCCGGTTAA I I I I I I I CTGTCGGTTTGAAGAGGGGAGAAGGTCTCACTATGTTGCCCAG 

GCTTGTCTCAAACTCCCGGGCTCAAGCAATCCTCCCACTGTTGGCGTCCCAAAGTGCTTG 

G GGTT ACMGGTGTGAGCCACCACCACACTGGGGCTCTGCTCTGCCTTTCTGAGTTTTGG 

GTTTTCTGCTTATGGNGGGGGAGCTTTGTTCCCGTTCTTCCCCACAAAGAACCCAGGGAT 

GTGGCACMGCTTCCCTGCCCGTTTTTCCTTTAACTTCAAGTTGGG 

Sequence 2840 

CCGGGCAGGTACAACTGGAAACAGCCACCGGAGAGAGAACTGTCTCGCCTTCGCCGGCTT 

TACCAGGGTCATCTCCAAGAAGAGAGTGGCCCCCCACCTGAGTCAATGCCCAAGATGCCC 

CCTAGAACACCAGCGGAAGCCTCCTCCACTGGGCAGACAGGCCCTCAGAGTGCTCTGTAG 

GAGCTGTAGACTGGGAAGAGAGGCCAGGCGTGGTGGCTCACTCCTGTAATCCCAGCACTT 

TGGGAAAGCCAAGGTGGGCCTTGATCACTTTGANTCCCAAGGAAGTTTTTGAGACCAGCC 

TTNGGCACCATGGTGAAMCCTTTGTCTTTACCAAAAAATACAAAAATTTAGCTGGGTGT 

GGTGGTGCACACCTGTAGTCTTAACTATTGGGGGAGGCTAAGGTAGGGATTCACTTTGAT 

TCCCAAGGAGGCGGAGGGTTTrGCANTTGAGTTTGGCANTTCAACACCCCTTGCANTTNC 

AGCCTTGGGGTGGACAAGCTTAAAACCCTTTNTTTTCAAAAAAAAAA 

Sequence 2841 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGTCAATGGAGTAAT 

GCTCCCAGGAGAGTTATGGCTGCTCTGCTATGTCATGCAGGTTGTTAGGGAAGTAGGGGA 

AAGCTAGAAGTTACAGGCCTTACCCAGCTCCCATGCAACCCAAAAGGCCAGTCTCACTCC 

CACCGTGCCCCACCCTGACAGCACCMGTTTGTTTCCAGGCAGTGAGTGAGCAGGGCTGA 

GMCTTGTCCCAGGCTACCAGCCTGCCAGCTGAGAAAGAAAGCATGGCTTTTGCATCTTT 

TTGCCTGTTGAGTCTGCGCACTGGATTTATGCCCTCCCTCGAGTTTTGGCCGGGAGATTC 

ACGTTTTGGTTCMGTGGTTACCAAAGTTTACTTGGGGAGGTTTCCTTTTCCTTTGGGTC 

TT 

Sequence 2842 

CNAA TT GGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGGTAC 1 1 I 1 1 1 I M 1 1 1 I I I M 
TTTTGTAl I I I I AGTAGAGACAGGGTTTCATCATGTTGGCCAGGCTGGTCTCGAATTCCT 
GACCTCAGGTGATCCGCCTGCCTTGACCTTCCAAAGTGCTGAAATTACAGGCATGAGCCA 
CGATGCCCAGCCTGAGGAACAGATTTCTATATGGCAAATAATAAAGGCAAATAAAAATTA 
ATGCTAAAATAGAATGAGGAMGTATTNTTTTNTTCACCAGAATGGTTGTNANCANAAAT 
GNTTTGCACCAGGGTGGNNTTNAAAAAAACCCCNCNCTTNNATTTTGGGTTNTNCCTGGG 
GGGNTNTNGGGNNCATNAAANACCNTTTTANNNTNTTTTTTTNANNNAAAAAMNTT^ 
Sequence 2843 

AGGTACTTTAACTTC I 1 1 1 I I I I I 1 1 I 1 I I GAGATGAGTTTTGCTCTTGTTGCCCAGACT 

GGAGTGCAATGGCGCAATCTCGGCTCACAACAACCTCTGCCACCCGGGTTCAAGCGATTC 

TCCTGCCTCGGCCTCCTGAGTAGCTGGGATTACAGGCATGCATCACTATGCCCAGCTAAG 

TTTGTAi I \ I ! AGTAGAGACGGGGTTTCTCCATGTTGGTCAGGTTGGTNTCGAACTCCCG 

ACCTCAGGCGATCTGCCCGCCTCGGCCTCCCAAAGNGCTGGGATTACAAGTATGAGCCAC 

CGCACCCAGCCTAAGATCCMGATTCTTATGTTITCTCTCCTTGCCTTTTGGAACCTGCC 

CGGGGCGTGAGCGGCCCGCCGGGCCAGGTACCAACNAGAAACNCAAACACCTTGCNCAGT 

NTNTCNAGGCACCNTTTCCAAAMCCAAAATTTTGGANAAGGTGAAACNTTAACTTNATA 
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TNGGCTTAAATTTTAA 
Sequence 2844 

ACCGCGGTGGCGGCCGAGGTACTTTTTTTNTTNTTT A I I I I I I i I ATT A I 1 1 I I I NTGGG 

GACGGAGNGNCCCTCTTGNTGCCCAGGCTGGAGCGCNATGGCGTGATCTTGGCTCACTGC 

AGCCTTCGNCTNCCGGGTATCAAGTGATTCTCCTGCCTCAGCCTCCCNAGTAGCTGGGAT 

TACAGGCATGCNTNNACCATGCCCAGCTAATTTTGTATTTTTTAGTANCAAAACCGGGGG 

TTTCACCCATATTGGGTCAGGCCTGGTCTCGAACTCCCAGACCTCAAGGGTGGATCCCGC 

CCACCCTCGGCCTTTCNCNAAACCTGCTGGGGAATTACCAAGGCCGTTNAAGNCCAACCC 

GNCGCCCTNGGCCANGGGGGACCTNATACTTCTTTTTTTAAAAAAAAAGACATTTT^ 

GGGGGGCNTCACCACCCNTTATATTTNAAAAATTAGGTTNCCTTGCC 

Sequence 2845 

CCGGGCAGGTAC 1 I 1 I 1 I I I I I I I M 1 1 M 1 I I 1 I M I CCTCATCTTTCTTTGATAATGN 

GNGMGCCAAGGATAGGCAGGAACCCCAAGTTCCNTNTNTCCTGGGATTGTCACAAAATT 

CCCCCTNTCCCCATCGCATCTATTTTTTGTTTGTTrTNTm 

ATACCATGGGCTTTANAGGAATCATCGCTCACTGNGGGAACCTTGCCCCCTCCCCAAGCC 

ACCCCACTGNGC CCATA AACGTGCTGCCTGTAACCAATTGTTCCTGTTGAATAACAATGC 

GAGCTGAGGGGCTTTTTNTGCCTGAGCTGCAMTANATTAGGCTGCTCCCTTTTATGTGN 

GCAAANACATTACCCAAAAGC 

Sequence 2846 

CCGCGGTGGCGGCCGCCCGAGCAGGT ACCTCAGAATGTAACTGTATGTGGAGGTCTTTAA 

AGCAATGATTAAGTTAAAATAAGGCTATTAGAGTGGGGCCCTAACTCTAAATGACTGGAT 

TTATATGAAGGAGCTAGAAGAAGGACAGACACACCAGGGTCCTGTGTGCACAGAGGGACA 

ACCATGTGAAGAGGCAGCAAGAGGCCAACTGCAAGCCGAAGAGAGAGGTCTCAGGGGAGA 

CCAACCCTGCCAGCACTTTGATCTCAGGCTTCCAGCCCCCAGACCAGTGATAAAATAAAT 

TTCTGCTGTTTMGCCACCCTGTCTGTGGCATTTTGTTCTAGTGGCCCTAGCAA 

Sequence 2847 

CCNGGGGCGGCCGAGGTACCATNANGCTTGCAGGGGCTGAAGCATGGTTTGTCCANAACC 

CCAACCACCAGGTCTATCGNNCTCTTTCTGNCACCTTTTrNCTCTTTrTTCCTTCTNCCC 

TTGCACCTGAGGNCCTGGAAGGCCTTGATGAGGCCCAGCAAACAGGCATTCTCACAGCTG 

GGTTTATAGTCTTTGGGCCCCTTACTCAGTATCCTGGGAACCCTGGGCCAGGAAGTTAAC 

AGTGGTCAATCANAAANTNCTGAANAAAATCCCCCTCCCCCTG 

Sequence 2848 

CCCCCCGNG CNGGN NCANNTTTTGGGCNNTTTTTGGGT1TTCAGNANGGTTNGTGGAGNA 

TCCCCNGNGGTTTTTNNNAAACCCCCCCNCNNAACANAGACCAAANGGGGGGNNGNGNAG 

GGGGGNGGGGNCNNTNAGAGAGGNGGGGCGGGGGCAGCGGGGGGGNAAGNNGGNNGNNNA 

GGGGANGGGGAGGGGCNNNCCCCCANACNNGGAGGNGGAAGGGGAAAGAACGGCNAGGGG 

NAAAGGCCGGGGGGCCACCNAGCNCNGGGGGNNCNACNGANGGGGGAAGGAACGGGGGAA 

ACCAAAGGCCCNNCCCCCACCCAANAANGGCCNGGGAAAAACCCACANNCNGGGNAAGGA 

NAAAGGGGCCNGANGGCGAAGGANGGCNCCCAAGCAGGAGGCGGGNNGGGGCCGCAACNG 

CCCCNAAAAAANGCCCCCGGGNNNACCCCAAGGGGGGANGGGGGNCANAAAAGNNGGGGG 

GGAAGGAACNGGGCCNNAGGGNGGCCCCCNGNNAGANGNGNNNNGGCGGAAANCCCANGC 

GCCNGGGGGGNNNANCCAAGGGGGNGGGA 

Sequence 2849 

GAGATGCAGTCGATTNCATACCTANTGGGTCCCANTCCTNNNTNNGGNCNGTTGNGAAGC 
CGGATAGTGACTGAGATCACTGGGTAGACCTTGTCCACCTTGGCATTCTTGTCTGCCAAG 
GTCCATGGCCCATGGGGATGGGGACAATTTGAGTGGG 
Sequence 2850 

GGCAGGAACTNTTCTTTNNTCTTTTTTTNTNAAGTNAGNGGTAATTTAAAMTCTGAMT 
ATAGGCTGGGCGTGGNGGCTTACGCCTGNAATCCCAGCACTTTGGGAGGCTGAAAGTTGG 
GGCNGGATTCATCTGAGCTCGGGAGTTCAGGGACCAGCCTGACCAACATAGAGAAACCCC 
GNCTCTACTAAAAATACAAAAATTAGCCANGGCGTGGTGNGCACCATGCCTGTAATCCCA 
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GCTGTTTCAGGGAAGGCCCNANGCCANGANAAATTTGCNTGGAACCCCGGGGAGGGCCGG 

GAANGTTTGTTGGATGGAGCCCCGAAGATCANCCMTTTGNNNANCTNNCAMCTTTTGG 

GNTTGAACAAAGAAGCCNAAAAANTTCCCCAATTCTTTCATNAAAATACAANGAAAACNT 

TANANAAAMTTTTTTGAAAAATTAATGGGGACCCCNTGGGAAANGAGGCCCNNTAAAAA 

AAAAAAAAAA 

Sequence 2851 

CNANCGGAAGGGGCCCCNGGGGGGGGGGGANAAGGGCAAAACCCNGGNCTNAGCAACCAC 

ANGGGGGGGCCGGAGCCNNGGGGGGCCAGGACCAGGGGAGAGGGAAGCCCCAGCCCNGAG 

GCGNNCGCACCACNCNNAGCACCAGCGCACCAANACNGCAGACGAGGAAGGAAGCACAAG 

CNCCCCACNNNACAAAGGGGAAACCGAGGCGCGGGNAGCGCGGNCCCGGGGACGGCCGCA 

CAAGAACNAGGGGGACCCCCCCGGGCGGCAGGAGNGCCGGNNAGAAGCCAAAACGAANCC 

GGCGAACCNGGGAGGGGGGG 

Sequence 2852 

AGGTACGCGGGGATGCGCAGTCGTGAGTCCTCTTGTCCTTGAGCGTCAACCTTCTTTCCC 

TGMGTGGCTGGGGTTCCTGTTTCCTTCTTTGATTGACAACTTGTGTTAACCCTCGCACA 

TCTCTGGGCCAATTTTTGCTTGAAAATGGCAGCTCCCGAGCAGCCGCTTGCGATATCAAG 

GGGATGCACGAGCTCCTCCTCGCTTTCCCCGCCTCGGGGCGACCGAACCCTTCTGGTCAG 

GCACCTGCCGGCTGAGCTTACTGCTGAGGAGAAAGAGGACTTGCCCGAAGTACCTGCCCG 

GGCGGCCGCTCTAGAACTAG 

Sequence 2853 

CGGNGGCGGCCGCCCGGGNATGGTACCCTCTGTCACGGCTTCCTTTTNCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGCCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCANNCATGTTGCCTNCTGAACAAAGCCGTTGAAGATG 

AAGAATGGGCAAAATCGCCCCATACGGAACAAGCGCANCCTNGGGAGCCCGATACCTGGC 

NNGCGAACACCAAACGGGAGAATTTCGCCAATATGGATGTGACAGCGGTTCCCATTAAAG 

CGGTGATAGGGATTTTT 

Sequence 2854 

CCGGGCAGGTACGCGGGGTGGGCATTCTGGGTAACAGAGCTATTTACTTCCTGCGGGTGC 

ACAGGCTGTGGTCGTCTATCTCCCTGTTGTTCTTCCCATCGGACGAAGATGGCCCTGGAG 

ACGGTGCCGAAGGACCTGCNGGCATCTGCGGGCCTGTTTGCTTGTGTTCGCGTGGTCAAG 

ACTAGTACCACCAGNTTTAGAATATGATGGCTTGTNGACAACATGTTGATGTCATATGNT 

ATCAAAATCGAAACGGNGTCANCTCCGAAGAAGGATGGGTTATTATTGACTTGCACCTTA 

GCCTNTTCGCTTTTGCATGGGGATANCCATTTNGCTCATTNATGAAGTTNCCATGTANTG 

TACAGGCCTTGGGGrvrrCNTTTCAAAGGTNTNNNMNCTGCAGNTCCAGTTMAACCTTT 

Sequence 2855 

CTTTCATGTGATCTTTGTGGCAGTGGGACAGGAAGTAGGCGCGGGCCCTCAGGTTCTCCC 

TATCGAAGCGGTCTATGGAGATAGTTGGATACTCGGCCATCTGCCCCTCGAAAGAACTCA 

TAGCGCCGTCGATCCCAGAGTCCGGGACCCCAAAACCGCAGCTGAAGCCAAGGCCAGCCC 

TGACNCGCCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATTCCCCCGGGCTGCAGG 

GAAT TNGATA TCAAGCTTATCTGATACCGACCGACCTTCNAGGGGGGGGCCCGGTTACCC 

AAGCTTTTTTGTTCCCTTATAGTGGAGGGTTTAMTTTGCGCCGCTTGGGC 

Sequence 2856 

GGGCGATTTGGAGCAAACCCCGGGGGCGGCCGCCCGGNTTGGTACCAAAANTNCAAACNA 

CCANTTTNGAANCCGGCGNNGACGNNNGCGGNCCNAGCTACTCTGGAGGCTGAGGNGGGA 

GGANCGCTNGAGNCTGGGAGGCAGAAGTTGCAGNAAGCCGAGATCATGCCACTGCACAAG 

CTAGGTG ACAG AATGAGACTCTGTCTCAAAAATAATTAAAAAGCCTCTGCCCCAAACTCG 

TTAAMGATTTTATAACCACAACTGCTGNTTCTGNGNAGATGCATCTGCATGCCCAGGAG 

CAGTAAATGCMTMMNCATTTGGNTATACTTTGAACACAAAATAAACGGGTGAGGCTT 

TTACTTTCGAAAAAAAAAAAAAAAANAAGGGNCCTCGGCCGCTCTAAAACTAGGGGGANC 

CCCCGGGCCGCANGGAAATCGATA 
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Sequence 2857 

ACATTAATTTGCGTTTGCGCTCACCTGCCCGCTTTTCCAGTCCGGGGAAACCTTGNCGTG 
CCAGCTTGCATTTAATTGNAATTCGNGCCCAACCGCTGCNGTNGGAGAGGCCGGCTNTTG 
CCGTATTTGGGGCGCCTCTTCNCGCTTTNCTCGGCTTCACTTGACTC 
Sequence 2858 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGTCCTCAGCTTGGG 

CTTCTTCCTCCTNCATCACCTGAAACACTGGACCTGGGGGTAGCCCCGCCCCAGCCCTCA 

GTCACCCCCACTTCCCACTTGCAGTCTTGTAGCTAGAACTTCTCTAAGCCTATACGTTTC 

TGTGGAGTAAATATTGGGATTGGGGGGAAAGAGGGAGCAACGGCCCATAGCCTTGGGGTT 

GGACATCTCTAGTGTAGCTGCCACATTGATTTTTCTATAATCACTTGGGGTTTGTACCTG 

CCCG GACACATCCAGTAGGCTAAGGGGATGCTTTCCTTTTCTGGGGTTTTTCGGGGGGTT 

TTTTGGAGCGGGGAGAGGGATGMNGAGGTGCTCCCCTTMTTTCTTTATTGAGAATGAT 

GCCGTGGATACTTGAATTTAAGCANTTGTCACATGGGCAGTGTCCTACCTGGGG 

Sequence 2859 

AGGGCGATTTGGAGCTCCCCGCGGTGGCGGCCGGGCANNTACTTTGCGGTTTTTGGGACT 

TGATTTTNGCAGAGGGATCGGGCACTGAAGGTGCAGTTCTCAAAATCACACCTGNAGGCT 

GGCTCCTCGCTGTGGGTATCCAGGTGCTTCTGGAGGTCAATAAGATTCTTGCAGCTGTAG 

TCACAACAGTCACATTTAAAGGGCCGGTCCTCACTGTGACGAAAGCGCATGTGGTTGCGG 

AGGGAGGAAGGCAGCGGGCAGGTCATGTCACACAGAGGGCACTTATAGTGATTCACATGG 

TTGCGCA 

Sequence 2860 

ATGCGTTGCNGCTCACTGCCCCGCTTTCCAGTCGTGGAAAACNCTGTTCGTGCCCAGCCT 
GCATTTAATGGAMTCGGCCAMCCGCCNCCGGGGGAGGAGGGCCGGTTTTGCCGTATTT 
GGGGNCGCTTCTTCCCGCTTCCTTCGCTCAACTGGACTTCGCTTGCGGCTNCGGGGTTCG 
NTTCCGGGCTTGCTGGGCCGAGGCCGGGTATTTCANCCTTCAACTTCAAAAG 
Sequence 2861 

CCCTAGGGCGTTTTGGAGCTNCCCCCGGTGGCGGCCGAGGTACI I I I I I I M II I I I I I N 

CCCGGAGNTTTNTAANAAGATTTATTTAGCAAAAATACATATAGCCATTATTGCAAGACT 

TAMTGAGA TGNT AMTGTTCAACCCAATTTTCTTTCCTGGATMGTTTTTCTTTCATAT 

CCCTGTCAGTTTTGAAAACATAATACCAGAAGAAGGGGGGCCCAATTCCACAGAGAGCTC 

CCAAGMTGAGTTTCTGGGAGTGAGTCTGAAGTTGAGATAAACCTTTGCTGATCTTGCTT 

ACGTTCAATGCATCTGGGCAGCGTCTTTGATGAGCCCTGGCGGTTAGGCTGGTGGCACTG 

MGCAGGCCTCCAGGGTCTCCTGTTTAAGCAGGATTTTAAGGCCAACCCTGC 

Sequence 2862 

CATCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGGA 

GGCTGACGCAGGAGGACCGCTTGAGCTCAGGAGTTCAAGACCAGCCTGAGCACCATAGTG 

AGACCTCATCTCTACTAAAAAAAAAAATAAAATACCAGGCATGGTAGCATGTGCCTGTAG 

TCCCAGCTACTCTAGTCCCAGCTACTTGGGAGGCTGAGGTGAGAGGATCACTTGAGCCCA 

GGAGATCGAGGCTGCAGTGAGCCATTATCACGCCACTGCACTCCAGCCTGGGCAACTAAG 

CMGACCCTGTCTCAAAAAAATTTTAMAAATTTAAAAAATAAGAAAATCCAAGCTAGGT 

TGAAATCTGAATGTTGAGCAGNTCAGTGAGGCACAAACTTAGCTTAAGAAAGTCAACCCT 

GCCCACTTGCCATTTTGAAGGTTATTACTAGCCAAAATTACN 

Sequence 2863 

TA TAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAATTAGCAAGGAGA 

CATTT TCTG CATT GTGAGAAATCMCATAGACACCTTAAAGACCCCTTTGAGAGTGTGGC 

(Mil GMCTTTTCAGATTTTGCTCAGTGACCTGCTAACACTTACGTGAGAGGCTCCAGG 

TGTAAATAGAATCTAATGGCAGMTCTGTAAGTGTAAACAAGCATCTTAGGAGTGAGAGA 

TCAAGACCACAAAATGTCCAGAGCTATGACCACAGCTATACCTACCCATAAAATACGATA 

CTGGAGTAGGGTATTTTTGTCTTTTTTCTTACCTAAGAGCTAGCTAATCAGGACAGGTGA 

TGGCAGGTTCTGGAGCTCTACCAGGGCAGGTCTATTTTCTTTTTT 

Sequence 2864 
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AGGT ACAGCTCTTCTCTTGTCTTTCTGAGCTCTTACCAGAATTGCCTTCCGTGTCTTTTT 

TTTTGGAGATGGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCAGCT 

CACTGCAACCTNTGCCTCCCTGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGGGGCT 

GGGACTACGGGTGCACGCCACCATGCCTGGCTMTTTTTTGTATTTTAGTAGCGACAGGG 

TTTCACTGTGTTGCCCAGGCTGGCCTTGAACTCCTGAGCTCGGGCAATCTGCCCGCCTNA 

TTCTCCCAACATGCTAGGATTATAGGCATGAGCCACCACGCCCTGCGGTGTCCATATTTC 

TAACAGCAGTCACTTCAGAGCCATCTAGGTTTCTTCTAACATGCACCTAAAAACTCTTTG 

GCCTNTACTACCTAGNTNTGAAACCATTTT 

Sequence 2865 

CGGCCGCCGGGCAGGTACAGNTGCATCANCTGCTCGTAGGACATGTCCAGCAGCTGGTCG 
AGGTCCACGCCGCGGTAGGTGAACTTGCGGAAGGTCCGCTTCTTCTTCTGCTCTACTTCT 
GCCACCCGCGTACCACGGCTATCCTTATAGCTTTTTAAT 
Sequence 2866 

CGCCCGGGCAGGTACTTTCCCCTCCCCCAGAAAAGCAGGACTTGCCCGCTAAGGGTGAAG 

GACCAAGGCAGGCGTCCCTGAGTGGTCTGACACCTTTGAAACGTGAGTGAATAATCAGAG 

AGGTGTCCCTGCAATGATTAAACACCAAGGAAAGGCTGCCTTCCCAGTCTGTGACCAGCG 

CCAGAGTTTTGGGTCCACGGATAAAACGTGTCTCGTTTGTCTCTACCAGAAAATGAAAGG 

AATTGAAATTAAGAGAAGGGAGAGATTAAAAAAAAAAAAAGAAAAAAAAAAAGTACCT 

Sequence 2867 

CCGCGGTGGCGGCNCGCCCGGGCAGGTACTTTCCCCTTCCCCAGAAAAGCGGGACTTGCC 

GCTAAGGGTGAAGGACCAAGGCGGGCATCCCTGCATGGTCTGACACCCTTGAAACGTGGG 

TGAATAATCAGAGAGGTGTCCCTGCAATGATTAAACAGCAAGGGAAGGCTGCCTTCCCAG 

TCCGTGACCAGCGCTGGAGTTTTGGGTCCACAGATAAAACGTGTCTCCTTTGTCTCTACC 

AGAAAATGAAAGGAATTGAAACTAANAGAAGGGAGAGATTAAAAAAAAAAAAGAAAAAAA 

AAAAGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTT 

TAAGTGAGGGTTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCTGTTTCCTGTGTGA 

AAATTGTTATCCCGCTCACAATTCCACACAAACATACGAACCCGGGAGCATAAAGTGTAA 

AAGCCTGGGGGTG 

Sequence 2868 

CCGCGGTGGCGGCCGAGGTACAAATTGTCGTTTTTATTCCTCTTATTGGGATATCATTTT 

AAAMCTTTATTGGGTTTTTATTGTTGTTGTTTGATCCCTAACCCTACAMGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTG 

ACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGMCAAMCTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

MCTGCTMTGCAAATTAAMCATATTTCATTTTMCATGATTTCAAAATCAGTTTTTCA 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 

ATGGGGTAGAMCATTGTAAATATTTACTCTTTGCAMCCCTGGTGGTATTTrATTTTGG 

CTTCATTTCMTCATTGMGTATATTCTTATTGGAAATGTACCTGCCCGGGCGGGCGCTC 

TAGAACTAGTG 

Sequence 2869 

GCGMTTGNAGCTCCCCGCGGT(3GCGGCCGAGGTCTMTTTGAATTTGTAATGAGTCTGA 

TGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGMTATCTTATAT 

CCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATATTATGACAACTGAA 

CATGCTTATGCATCCCTCATGCCTTTATTTTTTATTTTGGGAMTCTTTCAGCT^ 

TTTGCTGATATTTATGTGATTCTTTGTTCTGCAATTCAAATTTCTGGGAGCCAAACAGTC 

TCCTTGGTTCAGATTACTGTTTTTTGACTAGAGCTTCTCGCTTCAGATTCTGTCATAAGA 

TTATGGCTTAACCTATGGTTGTCCTTTGATTTGGTGCCATATGAAATAAAACATTATTTT 

CTATGGCTATGTATTAAGAATTTTGTGCAATTCTGTTTTTCTTAGAAGGCTGAGGGTGTG 

TTGTCAGACACCATGACTGATGTGACAGGTGTATTTTATTATGC 
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Sequence 2870 

ATAGGGCGMTTGGACTCCCCGCGGTGGCGGCCGAGTCCTACCCTTTACTTTTTCCCCAA 

GACCATCTCAGGGTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACCCAC 

CTCTCCCAAGAGCAGACATTAAACATCTTTGTGCTTTGAAGAGAGTGAATTTGGATAGTC 

TTGTGATTCTCAAGACTAACTTCCAGAATTATACTTTAACCCCTTCCAGATATGGTCCGC 

CTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTAATT 

GTCMCCTCCAGTGTATGACTCTAGAAATATGAGGAAMGCTTTTCAGTTTTTAAAATTG 

CCATTTAAATTTAGTCTATTAAAAACAAACCTAGAGGTCTTGGGTTGCAGTTGATTTCAG 

AGTATATTAATTTAGTGGGTCCCNAAAGTATTACATNTATTTATATTCTGGAATGAAAAG 

G 

Sequence 2871 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTAACTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CMCTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCTC 

TGAGCTTAATATTGTMTTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGCCCACAGAACCTTCACG 

TGTATTCACAGCCTCAATGCCATAAGGAAACTCTTT 
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Sequence 2872 

CCCTTAGCGTGGTCGCGGCCGANGTACAAGGTGAACCCTGAACATTCTGTGTCAGAAAGC 
TAANG AAGTG CCC AG AG AATG ATAAATCAATCAAAAGAACAAAGCG ACAAG CATG AA AGG 
CCAAATA TGAGATMTTTGGGAATTTAAAAAATGAACATAATGGATCCGTAACACATTGA 
A I I I n lAMATTCTTGAGCCTGTMTGATACTGGAGGAAGGAGTCTTTTGTTTAC 
Sequence 2873 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGTGGGGATATTTTGTCTCACAGATTG 

TAAGAAAGGGGTATGGAAATCCCCAGGCAACAGTGTGCTCCTCAGCTTGCTGAAACAGAC 

CAAAGACTATGTTCTMTCAAACCTTCCAGGMTCTCATGGAMTTTCATTTAATGCCTC 

TCCAGGCACTTTTCTGAAAGCCCCCACGTTAGGGATGTCTTGGCTAAGACATCTCTCATG 

GTATCCACAGCAACCCTGATGAAGCTCATTTCTGGAGAAGAAGAAAGTCTGTCAACACCT 

CTGCTAAGTCATCATTCTCCCATNCTCACTGCAGCAGTTCCTGGAATCTCATGAAGGGAA 

GCTTGGGACCCACCGCACACCCTNTGCAAATACCTCACAAGTAAACTTGGNAAATGCCG 

Sequence 2874 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGTCCTAGCCACAGTAGTAATCACCACTGGCCT 

GACTGAGCCCTCACCCTTTATACAGTGTCTCCtGCCACCCTCCTGGGAGAGGCTGTTCTC 

GGCACAGCTGGCCTGGGGTCACACAGCTGGTAGGTGTAAAGCAGGCATTGGAGTCCAGGT 

AGTCTCACTCCGTAGCCTGTCTCTTTAGCCACTGGAAATGTAGAGCAAAGCGAGAATTGT 

CCAAAGAGATAAGCTAATAAAGAGGAAAACAGGCTGGGTGCAATGGCTCATGCCTGTAAT 

CCCAGCAGTTTGGGAGGCCAAGGA 

Sequence 2875 

CCCTTAGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCA 

CCACACTCTACAAAGGCAGTCANCTNCNTGACACATTCCGCTTCTGCCTGGTCACCAACT 

TGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCA 

GCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCT 

CCACCTACCAGTTGGTGGACATCCATGTGACAAGAAATGGGAGTCATNAGTTTATCAACC 

MCAAGCAGCTCCAGCACCCMGCACTTCTACCTTGAATTTCACCATCACCAACCTACCA 

TATTCCCAGGACAAAAAGCCCAGCC 

Sequence 2876 

CCCTTAGCGTGGTCGCGGCCCGAGGTACAAATAGAGCTGGAATGATGATGATCACAATGA 

AGATGACAACAGTGGTGGCAATGACACGGCTGCCACTACTTGGAGTGGCCTCGGAGTTTA 

ATATCAAAGGCTGGCCTGGGTTTGAGCTGGTTGAGGGAGAAGTGAGTGTATTGCATACAT 

GAGAAAACCTiGAGCAAAGGAAAGATGAGACTCAAATGGCAGCACTAAGACCATTTTGCAC 

TGGAGTATCAGATATATTTTGAAMGMGACTGTAGAGTGACTTTTTGAAAGACATGGGG 

GATTAAGCAGGTAMGAGAAGCTTGMGTGGTATTTAGAGATGCTACATTTAGTGACTTC 

AGAAGGTATAAACTGAAGCAGCAATGTAGATTAATATTCAAGATATGGGGATAAAAGGGT 

GACTAAAGTGGGATCCAAAGCAAATGGGTGCCCGAAAAACCANTGAAAGAAGAACTTGAT 

GGAGGCTATGATCTMTTTACATTTMGGGGAATGAGGAAAACCAGTGGGMGMTTTGC 

CACCCTTGMAMTTAAATACTGGAAGTTCTGACCCAAAGAAATTAGGGTTAAAAAAAAG 

AMTT AA AAN G CCTTTCTTTTTG G A 

Sequence 2877 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTCCTTCAGTGAGTTGTTGGCAACTAGGCAGTG 

AAATGAGGGAGTGACCGCTCATGGCTCATGAACTTTGCAAGTTGTGCAACTCAACTGTCC 

CACCTTGAMGGGCTCAGGMGGTMAGAGTGTTGTTTTGGGACACAGGGATTGTTGCAT 

TTMATCTTTGTGACTGTGGTTTGGTCTCAGACTGGTTT1TCATGCATMGTTCTTCTTA 

CCCTCAGGCATCTGGGATTTTACCTGTTTTTCTTCCAGTGGTGTGCCCTCTCCGCCACAT 

TGTGGMGCGTGCTTTCTCAATCCTTCAGGAAGAAATTCAAAGACTTTNTTTTTTAA 

Sequence 2878 

CCCTTAGCG TGGT CGCGGCCGAGGTACATTATTTAGTGTAATGCCTAATGTTAGTCCTCT 

TTATAGCATAT7TTATACAATTTCTATTATTTAACAGGCCTTGGGMTTCAAAGATGAAA 

TGTGTGTCCCTCAAGTAACTCACTAATCTATTGTTATAAGTAGATTGTAATTCATGAGTG 
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CTGTAACAGCTATCCTATGACAGCACTGAAGAGGATGTGGACTTCCTAGATGAATATGTA 
TTGAGTTGTAGGGGAGTAAGAATTCACCTACCAGTGAACTAGTGGACCTGAAGTGGTATT 
CCAGATTG 
Sequence 2879 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGTTCTCAGTTTTTCTTATAGGAGAAATATGGT 

ATATGTTTATMGAATCTTTTATGAGATTATAGATTTCAATGCTGTGGATAGTGTCTTGC 

ACCCAMCMGAMGTCCATMTGGMTGATCTTCCCTCAGCTTCCTATCGATTTAGTTA 

CCTCTTGAMGCACAAAAATTAAMCATTGCCATATGTTGMTTTTTAAAAAGCACTTGG 

AGTGAGCGMCATTTCCTGATAMTGCCTTTTAGAGATAGGTTCTTGATATTCAGACATC 

TGCAGA MTGTT CTGGTTCCCAMGTCATTTCACTTCGAAATAAAACACAGCTCCTrCAA 

ACAGCACTTTTTCCACATAAATCTAAGTTGCCTCTCCCTGTGGACATTCAGAACTGATAG 

MCAMCACTACTCTTTTGMTTTGATGGTTCGTGTCCTTTAAAGTGTTTGAGGACCTAT 

GCAGAGCCTGTACACTTGGGGTAGTACCTGCCCGGGCGGCCGCTCGAA 

Sequence 2880 

CCCTTAGCGTGGTCGCGGCCGAGGTACAMTTTATTAAGCTCTAGGTTCATCAGATGACC 

AGGAAAACATAGACTATCAGGACACAGACAAAGCAGAACTGCAGAATTATAGAAAATTCA 

AGGTCTCATGGAGGAAAAGCAGTATCTACAATAAGAAAGAGACCCCAAACAGCACTCTAG 

ATTTCTT TMGA GGGATTATATTTCCTCCTTCCTCCCATTCTCCCTTCATTTCTTCCTAC 

CTCCCTCTTTTCATCCTCTTCTCCTTTTATCTT^ 

CTTAGTTACTACTATGTTCTAAGCACTG 

Sequence 2881 

CCCT TTGAGCGGCCGCCCGGGCAGGTACAAMTTGCTMTTTTATTCCAGTGATGATTAT 

TTTTMCTTTGGTGATTACGTATCAAATTAAGGTAATTCTATAAAAATCTACAGCCAGTA 

MCTGAAACTATAGCTAAAACTGTAGGTACTGTTTTTATATTTTCTTAGATGCATGTTGT 

TTTATTATMTAGTTTMTGT AGTGGT TGAATGGMTATGTGGTTTTCTTCTAGAATACC 

TTATGTGATTTATTCT ATGAC TTTTTTCCTTTGMCCCTTATGACTCTTAGTATGTAAAT 

TTTCTTAAATTATACTCTTTTGTTGTTTATTCTAMCTACTTGAGGCAGATCTCAAAAGT 

TGMTAATTACTGTGCATCTCAGAGAATTGTCTTGTGTCTTTCAAATATATTTAGTTGTC 

TTTAGCAGTMCTCTTACATGGTTATTTGCATGACTTMTAAACATTTCAATGCCCCCAA 

GTTACCTAGCAGAAAAAGTCCACTTTTAMGACATCTGTMGAGAGAACTTTATTTTTCT 

TAAATTGGAAGCACATCATTTAGTCCT 

Sequence 2882 

GTACACAGCAAATACMTCTTATGTCAGTTATAMGMTMTAAATAATTTTTTGAAAAG 

CAMGTGACTTGTCCATMCCACMTATTTTTCATTTTCAMTCTCATTTTTAT^ 

TGCCACATTGCCTATAATTATACAGCTAACAATTATCCCATGCTCAGAAGAAATAAAACT 

GACGTAGTTAGTAATCAGTNATATATCATTCCCTTAAAACAAGTATTGCTGACTTAATAT 

GCATAAAACTGCTGTCACATACCAAATATGACCAACAGGTAAAGGGTGAGAACAATTTCA 

CTGACTACAATATTTATAGCAAAATAAAGTGTCTATATAGTGGGAAAGAAATATGCTTAA 

AAAGAAAATAGG GAAA ATGCCCAGTAGAAGAAATAAACTAAATTATGCAACCTGCCCTTA 

AAMAAAAGGCCTTTTATTGGTATTTCAACCTACCCAGAATT 

Sequence 2883 

CCCTTAGCGGCCGCCCGGGCAGGTAC I I I I I I I I I I I I I I I I I I I I I I I I I I I I GCGTGT 

GANACGMGTTTNNNTNTTGTTGCCCAGGCTGGAGTTTGACCCACAAAATAAGGTTCCCA 

NATCACCATCAACTAAAGTTCAAAATAGACAAACCCTATCCATCCTANCCAGNGATTATT 

TAGTATTTGNGGAAAGCCTNTAAATTTAAAGGGTCCACGAACAAAGACNGAGACNAAAGA 

ATNAGAGCNACATGTAGGAAAAAGAGATTTACACAGACTNTAAAACATTAACAAAACCAA 

GATTTGTATTACCAGAAMAGAGTTNTTTTGTNTAAGAAACAAAAATAAGNTGNTNTNGG 

GAAAAAAAAAAAAAACAGNTGGAAGCTTAAAAGGAAACTTAAGGAAACCTNCCNGAAAGA 

AGAATNAAATGNGNGAAATNGAAATAGGGAGAGAAAACTTNAAAGCCCAAAAATAAAAAT 

TGAAATGACNCATCNTTTTNNGGAACCCAAAAAAAAAAGAAAAAAA 

Sequence 2884 
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CCCTTAGCGTGGTCGCGACCGAGGTACAGCAGGCCTAATGATAAGACAGCCTACCTGGGC 

TCCAGCCTGCCTGTMCTGTGTGMCTTAGCMGTCCTTCACTCAGTCAGACTTTGCTTT 

CTACATCTGAAAAAGTCAAAAGAAAGAAAATAAAAACCTCAGCCTACCTCCCTCTCAGGG 

TTGTTACAAG GGGG AAAAGAGCATAAAACAGATGCAAGAAGTGCTTCCTTAAAGAGTTCA 

ATATGMTTGTTTTTCTGTTTAGGTTATGTTAAATCTATTTCATATTATTAATCCCT^ 

AGATGACACCCATCAAACAAACACCCAAATGGTCCATGTCTGAAAGACCACAGACTTTTC 

TTTTGGGAAAAAAAAAAA 

Sequence 2885 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTT N I I I I I I I 1 I I I I 1 1 I I C M I I 1 I GAGA 

CAGTGTCTC GCACT GTCACCCAGGCTGGAGTGCAGCGGTGTGACAGGCACATGCCACCAT 

GCCTGACTAATTTTATCGTTTTC I I I I I I I I 1 1 CAGGTAGAGATGGGGTTTCACCTTGTT 

GTCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCTGCCTTGGCCTCCTTACGT 

GCTG.GGATTACAGGTGT GAGTCA CCACGCCCAGCCATTGATCTGTTGCCTCTTACTAGAG 

CTTCTTGTTGGATTCCAGTTTTTTCCCACTCTATTTTATTCAm 

TAGATCTGTTCATTTCATTTACCCACCAAMAAGGATMGTTCTTACTCTTTGAAATAAA 

AATCAAACTCCTTAGTCTGCCTCTGAAATAAAAATC 

Sequence 2886 

CCCTTAGCGTGGTC GCGG CCGAGGTACCTATTGTATCAGAAAMTGCTMTTMTTTTTT 

GCACATAAAGGGCATTTTAMCTTGGTTTTATTCTTTGTGATAAATATGGATGATGAATG 

GTAATGTTAAACAGAATTCAAAAGTTATCAGTTTGGCTAGCCAGACACAGTAGTATATGC 

CTATAGTCCTAGCTACCCAGGAGGCTGAGGCCAGAGGAGCCCGGAAGTTCACGTTTAGCC 

TGGGCAGCATAGTGAGACACTTGTCTTTTATAAAAACMCAGCAAAAMTGATCAGTTTT 

GGGGATAAGTAAAGACAAAATGGC 

Sequence 2887 

CCCTTGGCCGCCCGGGCAGGTACGCGGGGAAAAAAATATTTTTATCTTTTAGAATATCAG 

TTTGTCCCTTACAGT MTG TAAAATTACATTTATTTTATCTTTAGGATATAACATCTNAA 

ACAGAGGCTATTAACTTTTACACTATGTGTAMGAMCAACTTGAGAGGCMCTTTATTA 

TGCAACCACGATCATGAAGGGGATGAATAAACCAACTACTTAGAAAAAAAAGTGCTCTAT 

TATCATGGAATGGTTTT CATTG ATCCTTTCTTGTATTTCTGGAATCACTAGAACTAATCT 

CCAATCTATGCACTGMTTTTTGACATACGATAAAATGCCATGATTTNATTAATGGATTA 

AMTACTTGTCTGGMGTATANCTTTTTTAMTAATTNCAGTGGATGATAATTTNATTGA 

CMGGTGACANACCTAAATTCTGTTTTCNTCCNTGCCTTGCAAMNGAAACTTTTAGGGG 

GATTAAAA 

Sequence 2888 

CCCTTAGCGTGGTCGCGGCCCGAGGTACAAATAAAGTATTCCAAGGGTTCAGAATAGAAA 

ATGATTTCTTCCAGCTTGGGGACATTTGGGAMTTGGGATATCCTTTGGGGAAATGTAGT 

AATCANTATATTCTGGGAAMCATAGTAGMGMTGAATAAATAAATTCCATTGAATTTG 

GAATATGTTGCCATTCTCCCTGTAACTAATGCTATCANGATAAAGTAGAAATACCACATT 

TCANAAACA GCTGG AGTAGACAGGTCTTCATAGGCTAGCTTGGGAAACCTAATAACTATT 

MTAATGAAATTTTAATTATACTCTTGGATTCTMACAMTGMCACACAGTGATCTTTT 

TGACTTGCTGCTTGGTTAT 

Sequence 2889 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGATTGTTGAACTCCGAGTCACACTCAT7ATGA 

CCTGGGAGGCAGATGTCTAGCCTTCAGCTAATTATGATGGGCAGTATGGAGGAGAGTGTG 

AGTCAAACCTTCCATGGAGGAAAAGGAGTGAACACTCAGGGAGACTGGAGCCGTTTAAAA 

GGACATGGGGAGGATCTMTCAGGATCTTGAAMTTTMTGTTCTCAAAAGTTATTTCTA 

TCAGTCTTMGTAATAGATGMTTTCTTTTTGCCACTGCAAAGATGATACTGTCTTGATA 

ACCATTCTCTATCATCATGCC 

Sequence 2890 

CCCTTTCGAGCGGCNCGCCCGGGCAGGTACCACCTGACTCAGCTGGGTGCATTTGACCAG 
TTTTGCCCCCATCTGTAAATATCCCTCAGTCCCCATGGAGCTGTAGAGGTGAGTGATGAA 
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GCAGTGACAGCTCTGGGTAGGAACTCAGCTTAATTTCAAGTTCCAGACCCTCTCCATCCT 

CAGGTTCTTCTTCTGTAAAATGAAGATCCCTTTGCGGTGAAACGTTTTGCAGCTCTGGAA 

TCTGAGGGTGTAGGAGGAGCTACAGAAGAGGGGTATAGGACTCACCTGAATAAAGGTGGC 

ATCTGTGATTAC TGTGG GCGAGGGTGAGAGGGAAGAGGATACTGACCGCTAAGCCTTGCC 

TTCTTAGTCCAAATTTTGGTTGTG CATTAAATATGTCTTAA 1 1 I I I I 1 1 ATAACTTAACC 

TCACC 

Sequence 2891 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGAAGGCTAAGTATCGCTAATTTCTATTCAGAG 

ACACAGGGACTCTACCCTTGTTCCTGGCTTTATAAAGCTGCTCAGTTCTAATGTGCACTG 

CACACAAC AGCT GCCAGGGCCACAAACACCCCCAAGCAGACGTTAGGCTCTCAACCCCTA 

AGATACAGTTT TCCTC TCAATAGCTCATAAGACTCTGCCCTCTGACGTGAGAAAATAGAA 

TTCTGTTGTGCTTTTTAAGTATGCCTATGGCATCTTTTAAAATATACCATATATGATCCA 

GTATTTCTTTGTGTTTGGAGTGTGAATCTGCAGGTGAAAGAGTGTGTTCTACACGTGCCC 

GATCCATCCTCATTACACAAATTCACCCTCTACTTTAAAACAGGGAATTTGGGCCT7TGA 

CAGGAAACAAGCCGGACAGTCATGGGGCCAAGCTGGAATCAAATGTAAATATTTAATTTC 

CTCCTGTTAAAAGCAGAGTGGACCCACAGTCCCTTCCAAAGTAA 

Sequence 2892 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCCAACTAGGAATTGGGGCTCAGCAAAGTGC 

CCTGGAGGAAGTGATACCCAAGCTGAGAGATGAAGAATAATGAGTGGGGTAGCCAGGGAT 

AGGTGAGAGGGAACATCCTCCAGGCAGAAGGAGAAGCAGACACAAGGGAGGGTGTGGCAA 

GAGATGAACCTGAAAGGCAGTCTTGGACCATATCAGAGAGGACCTTATAAGCTCATCATG 

AAGCTTAAAATGAATCCTTANGTCATTGGACAGCTGTGGAGAAAGAGTTTGTAAGGGGAG 

GGAAGTTATATGAATCACAGGAATTTTTTAAAGGTGGCTATGGTAGTAGTGTGAAAAGAG 

AAATGATTAACAGTANGCAAGCTCAGAGAAAGAAGCCCGCCAAGGAGGCTGTCACCATAA 

TCCAGCAGAAAGAAGATGGTGGCCTGGACTGGGGAAAGTGACTGTAGGGGATGGAGAGAA 

ATGGATGGGCTTGGGAAATGAGAAAGGGATAGGATCCACAGGGCTTAGGATTTGAGTGAA 

TAGAAAAGAGAGAGAAAAGTTAAACCAGACC 

Sequence 2893 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACTGCACTCCAGCCTGGGCGACAAAATGAGAC 
CCTGTCTCAAAAGAAAGC CCTCT CCTTAGCTGAGCAGAGGAAGGGAAGGAGTGTGGCTAT 
GAGMTATGATTTATGCCATTTTCTGTTTTTAAATCTAGAAGATCTTCTAAGCACAAATA 
CA GCTACMTGAM TATTTTACAGACAAAATGTTAATAGACCATATTCTTTGAATTAAAT 
TTGTTTTTAATTTTCTCTACACAI 1 1 1 1 1 1 1 I I CCTGGAGTCTCTTAGCTCTAAATATAT 
CMTCAGATTTATMT TTTTT TTACCTGATTCAGATGTCTTACATTTTTATATTAAATGA 
ACCTTAAGCATGATTCTTTTGGTAAGCCAGTATGAATGCCAGTGGTTGGGGGGCGGG 
Sequence 2894 

CCCTTAGTGTGGTCGCGGCCGAGGTACAGTCCACAAAATGGTTTGCTTCTGAAAAAGCAG 

TAGTAGTCTTACATCCAANATTATTCCTAAAATAAAGTTTCGACCCACATATATAATGAA 

ATTGAAACTGCCTACATTGGCAAAGCCAGAAAGAGAGTAAAAAAGATGGGGTAGGCCAGT 

TGCGGTGGCTCATGCCTGTAATCCCAGTACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 2895 

CCCTTAGC GTGGT CGCGGCCCGAGGTACTGTCTGTGACAAGAGCAAGAAATACATTTTGA 

AGTGGTTTGTTTTAGTAGCTAAAGTTATCCTGACTGATCATTTGATAAAGGATATTTACC 

AGCAGTTCACAATGAACATACTACTACATGTATATATTAAAGATCATAGCCTGCACAATA 

AAAAGAAGAAAATAAATAAATTAGAAAATAAATAGTTTAGAAACAAAGACACAGTATTAG 

TTCTAGTTACAGAATATGATTCTCCAAATAGTATATGCTATCAGCTTCATCAACAAGTAA 

CCAGAATACCAAAGAAGACATAGAAATGGCTAACAAGTATATAAAAACATGCTCAACATC 

ACTAATTATCAGAGAAATGCAAATCAAAGCCACAATGAGATATTAAGTCACACCTGACAG 

GATCACTATTATCAAAACTACAAAAGTCAATACTTATTGGCAAGGATGTANAGAAATTGA 

AACAGCATGGAGGGTTTCTCAAAATATTAAAAATAGAACTACCATATGATCTGCAACCAC 

TTCTGGGTATTTATCCAA 
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Sequence 2896 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I I I 1 I 11 I 1 I I I II I 1 1 1 1 1 I I I 1 I I I I I I GGA 

GACGGGGTNTNACACTGTTGCCCAGGCTGGAGTGCAGTGGTGCAAACATGGCTCACTGCA 

GCCTCGACTTCCTGGGTCCAATNAATCCTCCTGCCTNANCCTNCAGANTAGCTGGGACTA 

CAGGCATGTGTCACCACTCTAGACTAATTAAAAACAAAAAATTGTAGAGACAGTGTCTCG 

CCATGTTGTCCAGGCTGGTCTNAAACTCCTGGCAATCCTTNTGCCTTGACCTCCCAATNT 

GCTGGGTTACAGGTNNTGAGGAACCTGCCTGGCACCCAGTTTT^ 

TACTGTNCCCMCCAAAATGACTTCTTATGMGCTTNTCATCANTTMTGAGCTTTATTA 

ATGANANCTTCAACACTGGNCAAGGGGGGAAAAAT 

Sequence 2897 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGCGTGAGCCACCATGCCCAGCCTGAATCCCC 

CTTCTCAMAGTAACTTCCATTCACTTTGTCCCTGGCAGTCTGCTGGTGACCATGATTTC 

TAAGGCTCACTCATGGGACCAAAGGCAAGAAGAAAGGGAGGAGGGTGGTAACCAGATAAA 

CTTTGCMCCCCTTTGATAGGACTTTTTGGGTGAGAATTTTTATGAAAAATAATATATAC 

TTCTACTATAGATCTGCCTCTAGTTTCAACCTCTAGTTTTAGCACTG A I N il I CCCCTT 

MTTTCCAGMGCATGATATAATMTCTTTTTCAAAGAAAGTGGG 

Sequence 2898 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCTGGGGGTGTCATCAACCAAGATTAGAAATAT 

GGGMGATGAAATGGTTTTTAGGAAAGATMTGATTTTMTTTTAGAACTGTCAAATCTG 

AGGTGAATCCTTCAAAAAAGTCAAATGATAGTTCCAAAGAGTGACCATGAAATCTGACAA 

ATAAGAGTTATCTTAGCAAGGAGCAGTCTCATGGAAATAATAAAAGCTGAAACAAATAGT 

AGTGATTGAGGAGTAAATGAGAGGTGAAGCAGTGGAGATAATACAGGTTCTGTTTGGAGG 

AGACTTGGCAATAAAGGTGAAAAAATAGGATGGATGAGATAGCACTCAGCAATGAAATGT 

AACAGAATTGATGCAACACGGGTGAATCTCAAAATCATGCCAAGTGAAAGAAGTATAGTC 

CCTATAATCACCTATAGTCCCAATTACTCAGGAGGCTGAGGCAGGAGAACTGCCTGAGGC 

CAGGAGTTTMGACCAGTCTGGGCCMCACAAGCMGACCCCCATTCTTTTAAAAAAAAT 

TAAAAAACATTANCCAGGCATTGGTGTGGTGCCCCCCNTAGTTCCCCAGCTACTCAAGAG 

GC7TGGGTTGGGAGGACTACTTNGNNCTCNTGAGTTNAAGACCAGTCTNGGGNAATCAAG 

CCAGANCCTNNTTTCTTNAAAAAAACAANCCATAAACCTT 

Sequence 2899 

CCCTTCGAGCGGCCGCCCGGGCAGGTACTCCTATCAGTGATGTATGAGAGTTACAGNTGC 
TTCACATGTCTGT CAAC ATTTACACTGTCACTG I I I I I I ATATTAGCCATTTTAGCATAT 
ATGAMTMCTGATTTTMTTTACATTACA 
CATATGC7TATTTTTCATTTGTATATCTTTCM 

7TAMGTTGGGTGTCTTTTTATTGTAGGTGGTTATTCTTTGTATTTTGAATATAM 
TTTG ATAGATTATGCCTATG A I Mil I AAAGACTCTATGTNTTCACTTATTGGGGATCTT 
TCGA AGGAA CAAAA I I I I I I I I A TAAAAATCCAACTTTTATCAGG I I I 1 I I I AATNTTAA 
TGGNTTTTTATNGACCCTCAAMGTCACCMTAGGAACATNATTTTTNTTTCCTTANAAA 
GTTTTTATGGGTTTTGGG 
Sequence 2900 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATCTCTCCATATGCAGCAGGMTGTCTTTTT 
TATTTAAGGGCACATTTACACAAAGAGCAAATGGTCCAGACTCAGTCTTCCATAACCTAG 
AGTCTGA ATT GATCA CTCTATAAAATGGAAAAATGTTNCTTTTC I 1 1 1 11 I I I I II II I 1 
TTTTTTTNGGNTTTTMCTTTTnTGAGACAGNGTCTCACTTT 

GCAGTGACACGATCACAGATCACTGCAGCCTTGACCTCCTGGGCTCAGGTGATCTTCCCA 
TTTCAGCCTGCTGAGTAGCTGGGACCACAGGGCGCGTGCCATCACACCTGCCTMTTTTT 
GTATT 

Sequence 2901 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGTAGTTATAAAACTGACACTGTTTACAATT 

MCAGTGTTCTAGAATCCANTTGTTTGGAGGGTATTTTACATTATGAAATATTGACTTCA 

GATGGTCACTGCTATTTTCGAGATCTATGACTATGTTTCAAGGAGATCCATTGGTCTGAC 
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AAAATAAGAGATGTATATTCCTGAAAATTGCATGTCCTCTGGAGTCTGTTGTCTGGATGG 

TATCAGAGAGGTATTAGAACTGTCCTCAGAGGTCATTACAATTTTCTCATAGTGTTTGTC 

AAGGAAAGAGTTGGGGTGTGAAAGGTCTTCTATAGTATTGGGTCCTAACCAAGTCAGGGA 

AGAGAAGCAAATGCC 

Sequence 2902 

CCCTTA GCGTGGTCGCGGCCGAGGTACTTAAGTAAAGAAATTAACCTCTAATAATACAAT 

ATTTTTATTAGTTAAAAATAAAGGACATATGTATCTACACACACATATATGCTATACATA 

AATACATATGAGAAAGACAAAAATTGAGAGATTAAGAGACAGAGAGAGTTTGAGATAGAG 

AGTTCAAAAAAAAAGCAAA TGAT ATAAATTTACAAGAGTAAATAGAATAATCATGTGAAG 

GGGTATTAGGAGTTCTTTGTTTTATCTTTTAACATTTCTGTAAGTTTAAM 

AAATAAMGATCTAAATATTTTTCAAAAAATAAMCCAMTTTTTGAAAATCTAAAAAAT 

TTTCAAAA AATAAAAACATAAACATAAAAAC 

Sequence 2903 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAACAAGGCCAGAAGCCTTCTACCTTCAGTCTT 

GCTCTGGCAACCCCACCTCTTCTTTCATCTTCTCCCATCTAGGGAGATGTCTAATGGGCA 

TGATAGTTAGAGGTGGTGAGAGGATAGGGACAGGAAAGTCCTGCCTGTGGGACAACAGGG 

ATCAGGGACTCAACAAGTCTTAGGCCTCTCACCAGGACCAGCATGTTGGGAAGCTGCCTC 

TCCATCCATATTCAGTCTCAGAATGGGCAGCTGCTTCCCCTGCACCCTCATCTTGGTCTG 

GTCTTGGTGATAGGGTGGGG 

Sequence 2904 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAGTGGTGACAAGATCTCAGCTCACTGCAACC 

TCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTGAGCCTCTGAATAGCTAGGATTACAGG 

CACGTGCCACCATGCCTGGCTMTTTTTGTATTTTTAGTAGAGACAATGTTTCACCATGT 

TGCCCAGGCTAGTCTCAAACTCCTGACCTCAAATGATCCACCTGCCTCAGCCTCCCAAAG 

TGCTGGATTACAGGCGTGAGCCACTGTGCCTGCCCTAAGCCTGTGTGTTTTATTCTTCTG 

ACTTGCAGGCTAAAGCGGCAGCTCTTCCATATCTCATTGCTATCTCCTAGGGCTTCCGCT 

AGGAGACTGATCTGGGGCTAGAGGCCTCCCTCTGTGTACACGAGAATGCTGGAAATGTCA 

CCTCTCAGGGCTCTGCCTGCCTCTCAGCCCTGAAAGCCATGGTGGAAAGGGGTGGCGCTT 

GACATAGACATCTGAGGAAAAGAAGTGAGGGAGGGTAAAGGGTGGTGCAGTAAGANGAAG 

GGGTNGGGAAGG 

Sequence 2905 

CCCTTGGCCCGCCCGGGCAGGTACATTATCATTATTTTATATCTTGCTCMTCTTTTAGG 

CGTCM TCT CACAT CAAAAAGTTTATTGGCCTATAACGGCACGAGCCAACTTCAAATTAT 

TrTTTAAMGTTTTGTGTATTTGTAGATTATGAATCTCTGGTMCTACTTGTAACATTTG 

GCTCTGTTTTCTGTGAATATTTATCCCTAGCTAAAAAAAATTCTTACATCTTCAGAAGAG 

CTATAATGTTGACACCTAACTACACTGTAATACTACACAGTAATAGGACACTAAGCAGAT 

AGATAGGTTTTTAAAAMTATTTGTATTTTCAGCACCTGGTAATACAGTGGTCTA 

Sequence 2906 

CCCTTAG CGTG GTCGCGGCCGAGGTACTTCGTTGGTGCCTCAGTCTTTAAGGATTAACTA 

GGMGAATTTTCTTTCTGCATAGMACTTTATMAATAACGACATTGTTAATAATTCAGG 

CATMTATTACATTATACCTTTCTTGAATGCTGACGTTGCATACAAGGGTGATCTGTAAA 

CCTCCATCATT TCCTTG CTCTCACTTTTAGACCTATTTCCTGACATAANAACCTTGGTGG 

GTAGTTCTGGA I I I I I I I I i I NCTTCTTCTTTTNTTATTTTTGAGACAGGGGTCTCACTG 

TGTCGCCCAGGCTGGCATGCANTGGNTGCTCAAGTGATCCTCCAGCCTGTGCCTTNCTGA 

GTNAGCTTGGGACAACAAAGCACAANGCCACCAGGTNCTGGCTMTTTTT 

Sequence 2907 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGG 

TCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCA 

ACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACC 

CCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGG 

GCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAAGTTTATCAA 
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CCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCA 
TATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGA 
Sequence 2908 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTGCTAGTATTTGTGGGTTTTGTATTTTACAT 

TTAAATCTTAATGTAAGATTTAMTGTAAATCTTAAACATCTTAAACACATATAGTTTAA 

GGTGTGAGGGGAGATTTAACTAATCTG I II I 1 I CCTTTTATTCCACTTATTAAACCCAAA 

TGT TTMT CAGTTGTCCCTAATATTGTTTAGTGAATMTMTTGCAMTAATATGGTTTT 

TATATTTTGCMGGCAATTTTCCTTCATCCTTTATTCTTCTm 

CTATTTCTACATTATCATCATCCATCATCCATTATTATT 

Sequence 2909 

CCCTTAGCGTGGTCGCGGCCGAGGTACATATGGATTTGATACGTAGATATAGATTTAAGA 

CTAGAMTACTTATAGATATATTGGTATATAAATGGTATTCCAGTGCCCTTTTATAGGAT 

AAAATTCATTCTTATATTAGAAAAATCCATTCATACATMTTTTTGATTATTCAAGACAT 

AAATGTTGAACAACTAAGTGCTAGACAATATGGTAGGTGCAAAATTAACACTAGAAAATT 

CTTCCC CAGAG CACTGATTTATCATTACACMGCACAMTCTTTAATTCTCTAATCACAT 

AAMTTATTTTCTTCTMTMCTTCATCTCAMTTCCCTTTGGTTTCTGTGAAAAGMTC 

TTATAATATGTCACTATACATACAAATTATGAAAAATTTATGCAAACATACATCCTTTTC 

AGGMGMCCACAAATCTTCCTGCTTGAGGATCATTAGAAGGCAAATTATTTAATC 

Sequence 2910 

GGTCGCGGCCGAGGTACTTAGAACACTGTTGATGGAGTATATCTGTTGTGAGCCTGGTTT 

CTGTGTATGTGTCCAATTCAGTGATAGGAGAAGAAAGCCTGTAATTACATTTGCATAAAG 

TGGTTGGCTGCTAAAGAGATGTAATTCCATTATTCCAAACTTTTMTTCATATTTTAAAA 

TATAAAMCTAGTATATATATCMGGAATATTTTAGGAAATCTTGTAGTTACCTGTAAGT 

GTCTAACTGTGT ATTCA ATCAGTAACCAATATGGAAATTGCAGAGTTTCTTCATCAGAAG 

ATATGAGTTTAMTmATATTTGGCATGCACAGCAATATATTAMTGCTGTGCTTAACT 

AGAMGTATTGTGCAGTGTGATGGCCT7TTMCCTCACAATATACATTTTATTTTTATTC 

CCTAGACTTGCACATGMGTGCATATGTTCATATTTACCCGTTTGGAGATGGGTANGATA 

CMCCAGCATAGCTCCATCTNCATGGNCCATCTTTGATTTCATGCCNCCTTTCCTTT 

Sequence 2911 

ACC CNTAN CGCGGTQGCNGCCGAGGTACTATTTGGGGGATCAAGATNTGNAANATTTAAA 

CTGTTTTTGGTAAAAACTAGCACTGTTGGGAAGAATGCTGTGGAATACCACAAACTAGTT 

TNGCAMGGGGMMGMTGTTTNANMGTGCTGATTGTATTTTAAACATCAGTAGTGAT 

AATACAAGGAAAGCTTTTA 

Sequence 2912 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGTTCTCAGTTTTTCTTATAGGAGAAATATGGT 

ATATGTTTATMGAATCTTTTATGAGATTATAGATTTCAATGCTGTGGATAGTGTCTTGC 

ACCCMACMGAMGTCCATMTGGMTGATCTTCCCTCAGCTTCCTATCGATTTAGTTA * 

CCTCTTGAAAGCACAAAMTTAAMCATTGCCATATGTTGAATTTTTAAAAAGCACTTGG 

AGTGAGCGAACATTTCCTGATAMTGCCTTTTAGAGATAGGTTCTTGATATTCAGACATC 

TGCAGM ATGTT CTGGTTCCCAAAGTCATTTCACTTCGAAATAAAACACAGCTCCTTCAA 

ACAGCACTTTTTCCACATAAATCTAGTTGCCTCTCCCTGTGGACATTCAGAACTGATAGA 

ACAAACACTACTCTTTTGAATTTGATGGTTCGTGTCCTTTAMGTGTTTGAGGACCTATG 

CAGAGCCTGTAACACTTGGGTAGTACCTGCCCGGGCCGGCCGCTCGAAAGGGGC 

Sequence 2913 

CCCTTCGAGCGGCCGCCCGGGCAGGTACAATTGGGTAGGCCCAGAAGAGGAAAGAGAGCA 

AAATAGAGTAGTCAGGAAAAGAGGGCCCAGTTTGGGCGCCTGTTAGCTCCTGTGTGGTGT 

GAAAGGCACTCAGATGTTCCTGCATCTCTGATGGAAAACACAGAACCTGGCTCAGCTGAG 

AGACAACAGCTCTGTCATGGGGAGGTTGAAGTGACCCCACAGCAAAGTGGAGAGGAAGTT 

GTTGAACAGAAGCTGTGGCAGAGACTCAAAGAGGCCTATAATGCTGGTGAGAACTATGGT 

GGCCTGAGTCATC 

Sequence 2914 
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CCCTTAGCGTGGTCGCGGCCGAGGTACTCACCTGCCACATCCTAATCTCAGCTCCTCTCT 

CTCCCACCCTCATCCACCTTCCATAACATGCTTTAGTCCCACCCTACTGTCAGTTATTCC 

TGCAAAGAATACAACTTACAGCCGGGCGCGGTGGCTCACACCTGCAATCCCAGCACTTTG 

GGAGGCCGAGGTGGGTGGATCACGANGTCAGGAGATTGAAGACCATCCTGGCTAACACGG 

TGAAACCCCATCTCTACTNAAAATACAAAAAAAAAAAAATTAGCTGGGCATGGTGGCGGG 
TTGCC 

Sequence 2915 

CCCTTTGAGCGGCCGCCCGGGCAGGTACTTTGTTTATGGATCTGTTGAATCTGAGGAGAA 

GTGATTGCTTTGAGTTAGCTAGTTTTAAATTCATGACACTTTCCCAACATAATTAGGTAT 

TTGMCCTMGTGTGTGCTAGTGTATTGTATATTTTTCATTMTTACTTTATTGAAAGTT 

GCmGGGMTCMTTCACTTCAATATATTTTACTGTTCTCAGTGAAGAATGTGACTAAT 

MGTATTTGGACTTTCAAAMGTGATACAGTCTTTGTGATAGTTTTTTACCTTTACATTA 

ATGAGTTGGAAAATAAC 

Sequence 2916 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCACTTCACACCAGCCTGGGTGACAGAGTGAG 

ACTCTGTCTCGGAAAAAAAAAAAAAAAAACGAAAACAAACAAAAAAATAAAATGTTCACT 

TTTGGGCTCTATGCCCAACAAGTGAGGTTGCTGGACTGTGACTACAAGGACACCTGAGCT 

GTAACTGTGTCCATGCATCTCATTTTCTCTTGAGAAGACTAAAGAACTGACCTTTAAAAT 

AAACACAGCTCCATGGTGGCCCCTITCAAAGTCATCACCTCAGGAGGCCACACACGTGCC 

TGAAGGATGCTGCCTTGGCTTGCAAACCTCTCAAGCTCCCCCAGAGTAGCCTTCANGGTC 

CCCTCCCCGCTACACAACAGCAGCCAGCTTTGCTAGCAGTCACTTCTATGCTTCTCTCCC 

ACTGACAGTTACTGNGAGTGAGCCTCTGCAAGTATTAACAGCGCCTAATTAATGCAGTGA 

CCCA AGATACAAGCGGTGACAAACTCCTTACCCTTCAAGTANGGCCTNTGTGGCTGAAAA 

TTTTACAATGGGCTTGTGAAGGGGA 

Sequence 2917 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTTACAGAAAGGCCACTGCAGCTGCATCATAG 

TTCTATAACTCTTCAAAGTCCCCAGGCACCCTAAGTAGATAGCTTGAAGCTTCACCTTTA 

CAAAGTCCAATCCATTGCATCAAAAGCCTACATTATCCCATTTCCTGTGGCCTAGAGATT 

ACAAATCACCTGAAGTAATACCAAAACCCAGTGAGTAAAGACGGAAGTAAATCTAGTGAG 

TTTCCAACCACCCACCGCCCCCGAGATCTTAAAGGAAGCAGTAACAAGCAACTAGAACTC 

ACATGAGAATTTATAGGCCAAGAGATTTTAATGTAAGCATTCTTATAATTAAATACCTCT 

TACAAAAAATGCCATTTAATCAGACAAGGGGAAAAAAGTANCATCAAAAGAGATGCTGAC 

AAGGGAGACAATGAAATCAAAT 

Sequence 2918 

CCCTTTCGAGCGGCCGCCCGGGCAGGtACTTTTAGTAGAGACGGGGTTTCACCGTGTTAG 

CCAGGATGGTCTCAGTCCCCTGACCTTGTGATCCACCCACCTAGGCCTCCCAAAGTGCTG 

GGATTACAGACGTGAGTCACCGCGCGCAGCCCGCTTTTGTGGTTTCAAAAGTTCTCACTT 

GCTAATGTCATGTTAGAGATACAAATTTAAAGTATGCTAAATCAGTGGAGAGAACAGATG 

TTTAATCTATAGACACTTATACATGCACTAAGAAAGTTGACTACTATACTGAAGTCCCTT 

ATAAGAAGGGAGTTGTTAAGGATAAGGAATGTGGATTGGGCTCTTTCAGAGACTCCATAC 

AGGTATATTTAATTAGCAAAGGAGTCAAAAAAACACCAGTTGTTATCAACAGCACTTTGC 

ACATGTTGGGATATGATTTGCCAACTGGTGAGACTATTTCAGGTATTAACATANAATATA 

TGTCAGCTTTTGGGGAGAAAAAAGTAGTCTAGGACAGTAAAATAGTCTTATTTGCTTAAA 

TGAACCCATAAACNTAGGCAGCATTTCTTCAAAGTGGTATTNCATGGATCACCAATCCTT 

ATAAAA 

Sequence 2919 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGAGTATTTTAATCTTTAGGGGATCAAGATGTC 

AGATGCAAACAAAGCTGCCATTGCAGCAGAAAAGGAAGCTCTGAACTTGAAGTTACCCCC 

CATTGTCCATCTCCCAGAAAACATAGGCGTTGATACACCAACACAAAGTAAGCTGCTAAA 

ATACAGAANATCCAAGGAGCAGCAGCAGAAAATTAATCAGGTTAGTAATTGATGGAGCCC 

AAAAGAAATTTAGACAGAACACTGGGTAAAAAGAACACCTCTATTACCACCACCCTGATT 
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ATNCCTCAANACTATGACCAGTGAAAATGAAAAAAAAAAG 
Sequence 2920 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGAGTTTGTGTTGTTAAGAGTTAAGGCCTTAAA 

CTTGGCTCAAAATAMTTTMCTGTGATTTCTCTMTTTTTAGTGCCTACTGTTTCACCA 

GACACCTAGAGTCMTAGTGTCCTAGAATCTACGGTGTCCAGGGCTTGAGCAAGATACTT 

TAGTGAATATAAACACTAACGAGACCTTGCCTTTTCTCAAGCCTACAGGATATCAGCCTA 

TTTGAA AAGTGACACTGGCTTTACACCCAGAAGCCCACAAAATAGAATGATGTAACTGAA 

I 1 1 ii iGGTAGCTCTCTGGGTAATTTATGTGGTGTTAGGACTCTTTGGGTTTGAAAGNAA 

AGAGACTCGTGATCCCATTCTGGCATTGACAGTAAACTTCATTGGA 

Sequence 2921 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATTATAATACACATACCAGACAAATTTTATT 

AAATAATAAAGAAGGGAGAGAGAAAGAGCAAGACACTTTTTAAGCACCATGTGCCAGGCC 

AGTGCTTG AATCT CTTCCTACGTTACTTCTCCTTATTATTTAACCGTTATTTAAATGTTA 

C TACT CCCATTTTACAGATGGAGAAATTTAGCTTAAGGTAATGACAAGATTGAACTTCAA 

GTTTTCTGACTCCAAAATATGCCCTCTTTCGACCACACCGTATTATCTCTAAATATGGAA 

MCATTTCAAAAGTAGCAAGCGTNTTCAGAACAAAAACNAAAACAATCTAAGGGGGCTTA 

CACATTATAAACATGGTTTCGGGGTCAT 

Sequence 2922 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTGGAGAAAATGCACAAAATCTGACTAAGGAGC 

CATA AGAC TCT GGGTG CAAATAATTGAAAAAAATATTTTAGAAGCATAATATCAATTCAT 

CTAGTTTTAAAGTTTTCACAMTTATAATAGGGMTTCTGCTTATTTCACAAAATAATTA 

TCTGACCTTAAGAGG ATTTT TCCTAACATCACATAAACCTGATGGCTTTACACAGTGATG 

ATGAGCTATTATTTCATTTTCATCCTGATAAACTACATMGMTTGCAATTTTTATATGC 

MTTATTCATAATGTATGTTTGGATTTCCCTGTTTAAATTCTATTAGAGTCCTACTTTAT 

GAAAAATCTATCATTTAATATTCTATTTAATACTGAGATGAACATACATAAAAGCAAACA 

GATGAAGAAGATGAGTAATATCTCAGGATCTGTAGCCATCAGACTATGAGCAAATCATAA 

TATATTATTATAAGTCCCTTCTGTTACAGTCAGATATTTCAGAAAAATAACAATAATGTA 

TTAAAAACGGAACTTCTCTGGTCTTTAAATAGCACAAGTAATTCAACTGATAAATCTAGA 

TAATCTTATAGCTCTCTATACCTATNGGTTGGGTTTGCCTTTTCTA 

Sequence 2923 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGCAAAATTGCAAGGGCAAACTTTGTGTCTTAC 

CACTATGATGGGAGAAGAGTTAAATAAACAACATAACAACAAACTAACTTATATACTCTG 

GTGMCACCTCGATTTCATGATNAMGGGAATATACATTTATTTTCAAGAACTGANTTAC 

ACTTTTACAAAGTTGACCATATATTAGTTAATAAAGCAAATCTGANAACATATTAAAACA 

ATTATCATATAAACTACATTCTTTGATCAGAATGAAATTAAATTAGAACTCAGTATAAAT 

AAAGAACACCAAATAATGTGCAAGTCTGGAAACATCCTTCAAAGTAACCTATAGGTCCAC 

CNNNNNGGANANAAACCAAANNCAANTNATGTTCCTGCCCGGGCGGC 

Sequence 2924 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCATAGGAAGGAAGCTGTCCAAGCCCTCCATTG 

CTCTTTGAGCTCATCATGTGATTATAGGGAAGGGGAAAGGGAATGATTTCCTCAGAAAAA 

TATACTGATCACCATATATTCTTTCGGGATGCTAAACAAGTATCCAGAAAAACATTTACA 

TGTAATACATCTACAAATCCTCTTTTCCAGTAATGCTTATGTCCTGTCATCTGAGATATA 

AMTTCTTTCTGGGTTACTMTTTCTTMTTTGGCCAATTTATTTAAGATGCTCTTCAAA 

GTTTTAAATTAATTGACT 

Sequence 2925 

CCCTTAGCGTGGTCGCGGCCGAGGTACAATTTCCACTCTTAAAAATTTACTAAGGTTTGC 

TTTGTGGCTTAAGATATGGTTAATTrTTTATGAATGTTCTCTCTGTCACACTATTATTTAG 

CATTGTTAAATATCTAATGTCAATTAGCTGAGAAAGAGAAATGAGGCAAACAAATTAGAA 

AGGAGAAGGTAACATTATTACTATCAGATGTTATAAAAGAACCAACAGGACAATCTATAT 

AAACATCCCCGCGTACCTGCCCGGGCGGCCGAAGGG 

Sequence 2926 
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. CCCTTNTTGTGGTCGCGGCCGAGGTACAGTCCACAAAATGGTTTGCTTCTGAAAAAGCNG 
TAGTAGTCTTACATCCMGATTATTCCTAAAATAAAGTTTCGACCCACATATATAATGAA 
ATTGAAACTGCCTACATTGGCAAAGCCAGAAAGAGAGTAAAAAAGATGGGGTAGGCCAGT 

TGCGGTGGCTCATGCCTGTAATCCCAGTACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 2927 

CCCTTAGC GTGG TCGCGGCCCGAGGTACTTTTMTGTTTCTTATACTTGTTTCTTCAGAA 

TT TAATTAGTT TTAMGGAATTTTTCTCATTATA 

TGTTTTGTTTTGTTTTTTTGATGTMGTTCTCGCTCT 

TGAGCCGAGATCACACTACTGCACTCCAGCCTGGTGAC 

Sequence 2928 

CCCTTAGCTTAGCGTGGGCCGGGGCCGNAGTACATTTACAGAAAGGCCACTGCAGCTGCA 

TCATAGTTCTATAACTCTTCAAAGTCCCCAGGCACCCTAAGTAGATAGCTTGAAGCTTCA 

CCTTTACAAAGTCCAATCCATTGCATCAAAAGCCTACATTATCCCATTTCCTGTGGCCTA 

GAGATT ACAA ATCACCTGAAGTAATACCAAAACCCAGTGAGTAAAGACGGAAGTAAATCT 

AGTGAGTTTTCAACCACCCACCGNCCCCGAGATCTTAAAGGAAGCAGTAACAAGCCACCT 

AGAACTNCCATGANGAATTTTNTAGGCCNAGAGATTTTTMTGGTMGCATTNTTATAAT 

TAAATACCTTTTACAAA 

Sequence 2929 

CCCTTAGCGTGGTCGCGGCCCGAGGTACCAGCCAGTTTTGAGATATACCTGGTTTGCTTT 
MTGTTATTAMTTGCTTTGGTGAGTTTTGCMATGGACAATTGMCTTTGTTCTGCCAA 
ATTCI I 1 1 f rA TTGCATTTAMCTTCAGGGTTTTTTATTTCCGGCTTATTATGAATGGAT 
TAGTCATTTTGTGAATAACTCTTCATTGAGTTTAGGGAGAAATAGGACGTTAGATCCGAA 
GATGCTATTTGTTTAATCACCAGAATGGACAAMTTATACTTCATTTACTATCAGGGAGG 
CA TGGGGAGAATAGAGG CCTAGGGGTGCCGTCTTGTGGCTCTTACTG Mill GTGTATCT 
GGJ N 11 I I I I I N II N IGAGAGGGCTTCCATTATGTTGCCCGTGCTGGACCCCAACTC 



Sequence 2930 

CCCTTCGAGCGGCCGCCCGGGCAGGTACGCGGGCAAGTCTGCATGGCAAAAAAGGTGTCC 

ACMGTGTGAATGGCTCATTTTTAATGACTAGCCAACAGCCACTTGTAAAAGATGTAATT 

GMATTGTTCAGAGATTAGGTTCCGTCTGCTTTGNCCTCCTTCTCAAAAGTTTTCATGGA 

TCCMGCTATTCTTAAGCATTGTTTAATCTAGGAAATAGCCTTTATGGACTAATNATAAG 

AAAGAAAAAGGCANATTTTTAAAAGATATTCCAAATTCAAAAGAAMTAAGTTTGGGCAG 

CCTGACTT TTGA TTATCTGTA AAAAAT CANGGCCAATAGTTTTGAACGGGAAGAGTAAAT 

ATGAAN GGTTTTCT AMCCAGTTTTTTCTTGGMGAAGAAGGGGTTGGNAGGTTAAGAAA 

GTTNTTCTTTTTTTNATCNAGCCATGTGCCCTACCAMMACATAAMTGGTGGTTTCCT 

GTGAACCCTCAAAAAAGTAAAATCAGTGAGTGNGATGNAGGGAATTNGGG 

Sequence 2931 

CCC7TAGCGTGGTCGCGGCCGAGGTACTTGAAAACTCTGATTCCCAGAGGGAGGGTAACA 

TAGAG CAAAG CCCCCTACTTACCCACAAAGAATATTCGGTGTGTGGCAAAAAGGGGAAAC 

TTGTTATTT TAAG TCACTAATTNAAAAACCTAGCTCAATCTAATATACTAGCTCAATCTA 

ATACA CTAATTTTGT TTTTAMTTAGTGTAMCTMTTTAAACCTAGCTCAGTCTAATAC 

ACTMTTTTGTTTTTAAATGATAATACAGTAATGAATACCTAGAAGACAAAGGTTAAGGG 

AAAGGAAAAGAGAAAGAGGTGCTGAGAAAAAAAACAAGGGGAGTTTGAACTGAAAGTTAA 

AGAGCTCATTAAMTTTTGACAGATCATCCATCTTTTCATCTGAACTCCGTATCTATAAA 

TCCTGAT 

Sequence 2932 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTCCCCATCAGTGACATTAACATTACATATTAA 

TTCAACAGGACATCTCCTATATGCCAGGTAGTGTTCTAAGTGCTGAGTGAGTTATAACTG 

TAACAAAACATGCCTGCCACTTAAGAAACTTACAGGGTAATGAGTTAAAAATGAACACAT 

AAATAAGGTAACCTGNTGATAATGGGGAAATGCTNGGAGGAAAATAAATTGGGGAGGCNG 

GGCGGGGGGGT 



CTG 
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Sequence 2933 

CCCTTTGGCCGCCCGGGCAGGTACTGAAACATCAAAACACCAGAATGAACATCAGGAGGG 

AAACATGAAAGTAGCCACTCCAAGTAAGGCTCCAGAACATACTCAAATGAAGGAACCAGG 

GCTTTACTGCGTGACCTAGATCAACAACAGAGCTGTTCTCCTTCTTTCAGCTAATCAGGA 

GCATTTAGTTGGGATTGGGAAAGAGGCAAAATATACATAGAGGTAGAAAATGAAGATTTA 

TATGGATMCTTCACAGCTGTTTTCCTGTGAACCACATTTCTAGCAATAAACTAATCCAA 

TTGGGAGGGAAAACAATATGTGAAGAA 

Sequence 2934 

CCCTTrCGAGCGGCCGCCCGGGCAGGTACTTGTGTTGTTTGAATGGTATTTTGAAAGCAA 

T TGCA TAGTTACTATTGATTCCTCAAAATGCATAATGGGGATGCTTCTGACCCGACTTGC 

CTTTTAGCAAGTTGAAGGTAGGTATGCACCGACAGGGCTCTGACTTCACCAGAACTAATA 

AGTTAGATTGCAATAGTAGGAGACAGATATAAATAATGAAGGAAGCTAGAGTTATGACGG 

CTTTCTGGAAAAATGTTGGGAATTGGAAAGAAAATGTGGAAAGAGAGTTGAGTGGAAGAA 

AGACTAATAGAGTAGGAGTATTAGTTTTATGAGAATGTGAGTAAATCTAATATTGGATAA 

ATGMTCTMGGCTACTAATACTACATTTTTGCAACTTACATTTTTGAGTATTTGGGTAG 

TAAM TCACTCCGATAAA MCTTTTAG AGTGGTTCACCTNTCAAGTCACATAATTTTGTG 

TTTTATCNACATATACTTGGTTTTTTTAG 

CNCCTNACTTTGGAMNACANTAAMCTAGCCATTTAAAAATTAATTCCAATTAATTTGC 
ATGTTAI 1 1 M I I I 1 1 I I AMGCATTNTCCTNGGGGGATNTATNGNCTTTGCAAAATTT 
Sequence 2935 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGCAAGCAGGTGTTCCATGAGCTGAGCCA 

GCAGACCCATGGCATCACCCGGCTGGGCCCCTACTCTCTGGACAAAGACAGCCTCTACCT 

TMCGGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCC 

AGTCTGGTGGAGCMGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGC 

TCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCA 

ACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAAC 

Sequence 2936 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTCCCAGGCTATTAGTATTATCTAATGAGT 

GGCTTAAAGATACCAGAAACACTGACTGAGGTCTCCACCTGCTTCTGGCTGAATGAATGG 

GAGC TTCTA GCCATTGAATAAGAGGCAGTAGCCAATGTTTCTTATTCTGAATGGTCCAGG 

TTGACTTTTATAAGAGAAACACTCANCCTTTTAATTGCTGAANGTCTGGCCCATAGACTA 

AGATTGCACATTCTTAGTCTAGGTTCACTTGAACTAGAGTGCACCCACTGACGATATATT 

CAGTTCAGGTGTTAACTCACTTGGATAAGCATNTGGCCTGTCCT 

Sequence 2937 

CCCTTAGCGTGGTCGCNGCCGAGGTACTAAANACTAACATTTATTGAGCACTNACTAGAT 

NCCATMACTGANATTGNCNTTGATCTNTGATTMTTAATCTCACMTCCTTGCTTTTCT 

GAAGTATNTNATTCAATCCTTTGAATGTCCCTAATGCACAGATAAGGAAATAGGCTGCTC 

TNCANATAAGGAATACANATCCTTGTGGATTCAACCCAGATCTGACTTGAGGCTAAACTC 

CTAACCA 

Sequence 2938 

CCCTTTGCCGCCCGGGCAGGTACGTTTGTTTATCAAGCTAGAAAATTCAGATAATAAAGT 

CAGCATCTGGATGAGGTMCACATTTTTTCTGCCATTCCTAACTTCCTCTCCTGGTATTC 

AAGTCATAGATACCGGCCTTTATCCCCAAAGAGTCMTTCTATTGTTCTTTCTTTCATAT 

AGATGAGAGTCAGAGAAACAMGTMGGTTTGGGAGAMTGGACATGGTATAGTTTTAGC 

TAMCATTCTMTATACAAMAAAAATTTMGTTGTTTTGAAACAAATAACACCCAAAGT 

CTACACATTTATAGTMTAMTAATTATCCAGCTAATTCAGATAAAMTTTGTTTCTAAC 

TGGGAAGGCCAGMGTCTGTATTTCTACAGACAATTTAAMTCGAGCCTGGAAGCAAATA 

TCCAGATMTTTGGATCCCCTTCCTCCAGTTTACCCAAANGAACAAGTTGTTTCAAGGTA 

GGGCACAGTTATACCATGTTTGGATCCACATGGATGAAAATTTACTACT 

Sequence 2939 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCCAGAGGACCAGATGATTGATGGGTTCTACC 
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ACGAGCATCTMTAGCGACTGCTGCAGMCTGGTGTGGAAMGGTTTCTATTTTATAAAA 

GMAGTATCCAGAAGGCTAAAGGCAGTTGTGGMTTTATTTCTTCCCAGTGCTCTGATTG 

TCAAAATTTGAATAAAGCTCAGTGAAAGGAGGACAGTAATTAATGACTTGATAGCCAAAC 

ATCGTMTGATGATTTAGTGATATACTAGACTGGATGAAAGAGAGGTCACTGTCTTTTCA 

CTATGGATTCATCATAAACAGTATCATT 

Sequence 2940 

CCGTTAGCGTGGTCGCGGCCGAGGTACCAACTCATTATAAGCAGATCACAGTCATATAGT 

GAGAGAGAAGGAAAAGAAAAACCTATGTAGATCTAGAGGAATTACCTAAGAATTCCTAAT 

TACATA GGATTAAGCT CTGGGCTAACATCAMTTTTMGTAATTACTAAAATAATTTCTT 

TTCTTTCI 1 1 I I I I I I IGAGACAGAGTCTCATTTTGTCACCAAGGCTAGAGTGTAGTGGC 

TTGATCTCAGCTCACTGCAACCTCCACCTCCCGGGTNCAGGTGATTCTCCTGTCTCCAAG 

TAGCTGGGATTACAGGCATGCACCACCACCTCCAGCTCCTTTTTTTATTTTTA 

Sequence 2941 

CCCTTTAGCGGCCGCCCGGGCAGGTACAGTTCTGCTCAGAATGCCCTATCTCCTNANTAA 

ANNNNM NGNN GGAGANANGNCCTATTTTGGGAGMCTAATTTTGCAATTCCTGCCAATT 

TTTATGCATTTTGCCCTTGTTTCCATMTTCAGCACCTCCATCATCTCTTCATCACCTCC 

ATCMGATACTTTCACTTCCTTTTAAMGCAAAGATCTCTATTTAATGTAATGCTTCTCA 

GTTTATTATTTACATTTTTATTTTTTCACC 

CTTTGGTTAATGCTCTGGTTACAAAGTGGCATTGCTGTGGGGCAGTCAGCTGTAACCC 
Sequence 2942 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGGTGATACATTCTAGCCTTCTACAAAGTGCA 

CTTAGACCANGATATTCTTTCATGATGGCAGACAAATGAGGAGTTTCTTTTCATTATCTA 

ATGAAAAAAGAGTTGGTCTATATTTTCTCAGTGTATTCCTATAGGGAAGAGAGTCTTATT 

GTTGATAACCTTTCAGACAAGTAAGATGCTTCTGCATCCTATGAGGTATTCCCAAAGCCC 

CTCTTTMT CTCC ACCATTGTGCATMTTTTGTGATAATGMGCAGTTATTTTCTTCTCT 

TGATAGACATTTTATGAAGGCAGAGACTATGCCACTCTTGCTTACAGTTGTAGCTCCAGC 

ATCTAACATGGTGGCTTAACACATAGAAGATGCTCAGTAATTACTTGTTAAATGAATGGG 

ATGGGTGGGTGGATATGCCCTAATATGATGCAGCTTGATTATGCTNGAANTCTGGCTGGA 

TTMGACATCATTCCMGMGTAMTACCAGTANTCTTTAAACTGTTGCCCTGTATCTTA 

CACACCTGAMGAGACATCTTTCCAAGGGTGATNAAAAAAAAAGGTCATACCCTTTGAAA 

AA 

Sequence 2943 

CCCTTAGCGGCCGCCCGGGCAGGTACTTAGATTTGCTTAGATACCTGTAAAAGTGGGATT 

GGTA TGGGATAMACGTTCTTGTGCTGCTTGCAAAAGTCTTTGGAATATTCTTTGGGATA 

TTTTGGTGCTTAGGMGTGCTTTTCAGGTTATATTMTATATGCTTAGTCTTCATGCTTT 

CMGTCCTGTGGGACGTTMTGTTAGTCTTTTMGTTGAGTCCTTTTTGGTTATATTAAG 

AGGTAGTTCTTGATGTTTCAAAGGCCATCCAGAAATAGGAATGCCTGAACAGGAATTTCC 

AATTAAGTCGGTCAGAATCCTGAA 

Sequence 2944 

CCCTTGAGCGGCCGCCCGGGCAGGTACGCGGGACACAAACTAACCTCTAATGTCAATATA 
CTGTGAAAAACACTGTGATACAGATATGTGGAGGGTAGTATGAGGACACAAAAAAGAGTG 
ACAGCTAACCCAACCAAGGTGGTCTGGAGAGGCTTCCTGGGAGACTGATTTAGACTTAAG 
TGTTATGGACTAAGTAAGAGCTACCTGAGTGAAGAAGGAGGGAAAGGAGACTCCATACAG 
AAGAGCTGAGGAATGAGGAGTGTCCTAGGGTAGTCATGGTTTCC I I I I 1 1 I I I I I I ATGA 
Sequence 2945 

CCCTTNCGAGCGGCCGCCCGGGCAGGTACCAGTGATTACCTGCCCCTCCTACCCTGGGCC 

TGCCCCATCCCCCACTTAAGAACTACAAACTAAACTACTGATGAAAACATTTATATATGT 

TGATACTGAAAGANTCCCTGTAGAGGAATAGCAACTGGATANGTAAGCNACTGCAGAAAA 

AMGGTGGGNAAMTTTGNTTAGGGCATATCCTTTANNTTTAMCTTGMCTTTCTTCCN 

ACNACCNMATTATAGNACCCCMGGNACCTTTGGCTTAACCTGCACNCTTAATGGTTNT 

NCAMATTAAAAAAGNGGAAAACCMGGTTNGGCATTAATTGGTTTTTACNAAAAAATTT 
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TTAAAAAAATTTGGGGGNATATGNAACCAACNANAATTGGTNANCTTAANATAACNNACC 

NMATTNGNTTGGGNGGAAAATTCNCCNTNGAAAATTTTGGGGTAATACCCCTTTGGGGG 

AMCCCANNGMAMAAAAAGGGGGAACCAACNTTTCNAACCAAAAAAAAAGNTTTGGGG 

GAAAAAANAACCTTTTGGGGNTTAAAAAAAATT 

Sequence 2946 

CCCTTTGAG CGGCC GCCCGGGCAGGTACTATCTTTTATTTAGATATGCTGAGATAATTAC 
AGATGAAATGTTTTTGGGG I I J I I I I I CTACATAATCTGTTGTAATAACTGGTGGGAGAA 
AGTGGGTG TATAA ACACATTGCTGCAATTAAGTGATGGAGACATGAGTGTTCATTGCACT 
ACAATTACATTTTATTGAAAATTTCCTCACTCAAATTTATGGAATAGGTATCTGTTAGAA 
ACTATTGATCATTCTCTCTAAATTTGAAAAAAGAAGCTGTATTTGACTCTTATGTCCCAA 

TAAAACTTAAAAAATGCTGCCCGAGAATTGTTTGGTGTGAAAAAAAAGGAAACTACAAAG 
TCTTTT 

Sequence 2947 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACTACCCAAACCTAAATTTAGAAACAGGATTA 

GAAAAATATCATTCCTATTCTTCTTGGCTATAMTTTACCCATCAAAGTTAMTTTAATT 

ATAC ATTCTTA ATAGTCTTGGATAGACAGGGCTTCACAGATAAATTGTAATTTAAGCAAG 

AATTAi ii I l ICCTGTTTAACCCTAAACACAATAATGTGGTCTTTACTAGCTCATCTCAA 

TTGMTATCATGCCTTAGTATTTGGGCTTTATATAATTTAGTGTGTATATTMTTTCTGC 

TCCTGAGCTCTTATAAATGAAACTGCTGGGAGCTGTGCACAAGGCTTAAACTAAAGAAAC 

CTCTTAMGACAG TGTG GATATCACTGAMGTATTTTGGGGGAAMCTAAATTTTAATGT 

MMGAGMCAACTTTTAGAAAGAAGAGAACCATAAAAATAAAATGCAGCACTCCTTTGA 

CTTTGNCTAATTCCTTGNGTCTTTCTGGGTGGAATTCTGTAGTCAATGNGGTTTAAAAAT 

ACATCAGCACCCTTTGCTTCTNCCTATMTGGAAAMTTTAAATTTAAAACAAAA 

Sequence 2948 

CCCTTAGCGTGGTCGCGGCCCGAGGTACTTGTTTTTGGCCACAGGAATCTTAAGTGACTT 

AATTAACAGGTGAGTGGCATTGCTCCTGAACAGAGCTACCTAGTTATAATCTGTGTTACC 

mGCTTTAAAATGTTCTCTTTACTCCATTTCCTTAGTTGATATGCTTGAGGCAT^ 

TGMTCCATCATTrATTCACCCACTCATTCATTTCACATGTATTTGCAGTTGCAAAGCAG 

GCAGTAGGAATAAAATTAGGATCAAAATAAGGTAGTAAGTAACGAAATTTGCTGAGTATG 

TTCTGAGAGAATAGCTGAATATACATTTCTAAATGGACGACAMTCCATTTGGAAGTCTT 

TCTCCTCCTGTTATTTTGGTAA 

Sequence 2949 

CCCTTAGCGTGGTCGCGGCCGAGGTACCGTAAATGGAAATTATCCCGCAAATTACGGCAT 

CTCCAAATGTAGCGTGGAAACTAATATTGACATGTCAACACAGGATTAATACATAGTAAT 

CAAAGAGTGCTTAAAAATTCATTAAAAATGAAGTCCCACCTCTAAAAGTGAAATGTGCAA 

ATTAMGTAGCAGTTAGATAATGTTTTGTCATTTAGGTAAAAATTACAAAAGAATCGGCC 

GGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCGAGGCCGGCAGATCAC 

CTGANGTCANGAGTTCAAGACCAGCCTGGCCATGGTGAAACCCCGTCTCTACTAAATATA 

CCANGAAATTAGTTGGGGCCGTGGTG 

Sequence 2950 

CCCTTAGCGTGGTCGCGGCCGANGTACANCAAGATAAGTGACCTACTGAGGGACAGAGTT 

CAAAATAAAATAGGTTTCAGTCACTCAATGAAGTGCTTTGCTGGGAGATCTCGCCTGGTC 

CTGTGGTCCTTTAGTGAAATATGGGGAAACCAGCAGGGGACGTAGGTCAGATAGTTTATA 

ATGGTGAGTTATAGGTTGTATATTATAAGGAGCATTTCTCAGCACTGAAGMATTTCACT 

AGATTCCAGCTCACCTTTCCAMCTATTAMAGCAAMCGGGACTGTTTTTCAAAAATTG 

GAAAGTAMCCTGGCTTCTTGTTAGAMTTTGGAAMTATTGAGAAGTTTTAGACCCAGT 

CCCTTTAACCTTAAGGAGCTCACATGCAATAAGATGGAAGGGCAGGGGAGATCANGACAG 

AAMC TACCGA TATGGCCGTGAGAACATGGMGCTTATATCAAGTGAATTCTTCTTTGAA 

TTTACC I i I I i ATAMTCTTTTAAAAACAAATTTGACTTTAAAAAAGCCCCAATTTTAAA 

TCGGAAATGCTAATTAGCACCTTTAAAGTTAGATGAAACATATCAGTGGAATCAAAGCCT 

ACGCCTCAAATGGATGTATTTTAATGGTTT 
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Sequence 2951 

CCCTTAGCGT GGTCG CGGCCGAGGTNCAAACATTAAGTGTGTAATTCAATGAGTTTTGAC 
AAA TAATGCA GTTTTGTAAAAACTTTGGTGAAATTGGAAAATAACCGCACCCCAGTCACC 

TTG' iGGCTACTACTAAATTGACTCTGATCACATGGAAGTGTAATAATTTTCTTT 

AGTGTGAATGCCCTGAGCCATGAGGTAACCCTGTTCTGAAGTGGTTTCCTTGCTTGTAAA 
TAA ATAA TGTTGACATGTAAAGCAGGACCACCAAACTGTAAGTCTTGTAGTAATTTTAAT 
GTATTTTCCACTCTGGACAAAAAGATGTGTCTTTATTCCTGATAAATATTAATAGTTCTA 
GMGAGGACATTTTCCAAATTAAAGAATTAGGTATTTNTTCATAATTCCCGGGTAGTA 
Sequence 2952 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATTTATTAAGCTCTAGGTTCATCAGATGACC 

AGGAAAACATAGACTATCANGACACANACAAAGCAGAACTGCAGAATTATAGAAAATTCA 

AGGTCTCATGGAGGAAAAGCAGTATCTACAATAAGAAAGAGACCCCAAACAGCACTCTAG 

ATTTCTT TMGA GGGATTATATTTCCTCCTTCCTCCCATTCTCCCTTCATTTCTTCCTAC 

CTCCCTCTTTTCATCCTCTTCTCCTTTTATCTTTTCATCATTCCTTTCTTAAGCATTTTA 

CTTAGTTACTACTATG 

Sequence 2953 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGCCGTGTCTAAGGAGCTGGTTGGAGAGTTTT 

TGCAATTTGTTCAACTTGATAAAGAGGCCTCTGATCCTTTCAGCCTAAATGAATTACGAG 

ATGAATTATCAAGGAAACAGAAAGAAGAATTATGGCAAAGGCTGAAGAATTTATTGACAG 

ATGTGTTGTTAGAAAGCCCAGTGGATGGGTGGCAGGTAGTGGAAGCCCAGGGTGAAGACA 

ATATGGAAACCGAACATGGCTCAAAAATGAGAAAAAGCATAGAAATAATTTATGCAATTA 

CATCTGTGATTCTTGCTTCTGTGTCTGTAATAAATGAAAGTGAGAACTACGAAGCCCTAC 

TGGAATGTGTTATTATATTAAATGGTATTTTATATGCA 

Sequence 2954 

C CCTTA GCGTGGTCGCGGCCGAGGTACAACGTGGGCAAGCATCTGTGCATATGAAATGCT 

GTTTTTCTAAAAGATTTCTCACATTGGCAMCTGAGAGATTTTCTTCTGGAACACAGGAA 

AGATTATATTAATGCTNATAGCCATACCATGTCTGAATATGGGAGGATGACAGACACNGA 

ACGAGACCAGATAGACCAGGATGCCCAGATATTCATGAGGACCTGTTCAGAAGCAAAGGG 
Sequence 2955 

CCCTTGAGCGGCCGCCCGGGCAGGTCCGGGCAGGTACACTGCATAAAGCCAGAGTTAAAA 

CTTCACTGCCAGCCTCTGAACAGAAGGCTGTTCTATCCACACTATCACAAGACCTGGTGG 

AGTTGAGGCAACTGCTGAATTACCATACAGGGAAGAATGAATTCAAGAAAATTCCCATGC 

AAGATAGGCTCTTAAAAAATAAATTTACACAAGAAAATCAGCACTGTAAAGGTAATTGAT 

AAGCCCAATAGAAGGGAAACCTATACAAAGAAATAGAAATAACTAAGCAATCTGAAATGG 

ACTTTAAATAATGATGTTTACAATTCTCTAAGAGGAAAAGGAGCATTAGCATCAGTGAAA 

CAMAGTAGGGCTATAGAMAAACMTACTTATGAAAAAAACCAATTGGAAATTTTTAGA 

TGGAAAAGCGCGAAGTAAAAAATTCAACCCCATGGTCTAAAAGAATAAAACTGCACCCAG 

CTGGMNGGGAAMTTANTTMTTTTTACGAAGAAACCAATTAAATCTTACCAGAATGTA 

MGGAGATMAGATNTTTTAMTAMTCNGGAGTTTAGGAGATNTAAACTTTTTTCCATA 

TGGAGTNTTTTAAATATCCATATGGTTATTATGTATTCCATATATATCCCNAAAGGGGAA 

AGGG 

Sequence 2956 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAACTATTCTCTTTCCTATTGCATTTTCCCTTG 

AGAACATGAACTGACAAGGAACAAGGTGAAGGAGTTTGACTGGACTTTAGGGATAATATA 

ACTCCCTTAAAAGAAGTGTTAAAATGAAGGAGAGATTTTACAAATGGCAAGTAGCATAAA 

AAATGGGAATTGTAAAGAATTAGTATAGATGATTTTTGACTGGGGAAACTTAAACACTAA 

TTACTTTCAGAAAGMTTTGAGATGATTTTCTTTAGTAGGGGTGATAGTGGTAAGGCCCT 

TCTATGAAGT ATGTG ATGGATTGGACTACTTCTGAGAGCCCTTTCTTCTCTTAAAATCTT 

ACGGTAAAAG 1 1 1 1 1 GMGMTCATA7TTCMGAACTTTT7TCTTCTTCCACATTAGCCA 

TCTTCAGATAATAAGAAACCAGATACCTA 

Sequence 2957 
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CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGATGGC 

TAAGMGTCTGTCTTTTAACTGACTACATTATTTATATGAAATAGAATTCTAGTATATTT 
TGACTCAGTTTTCC CAAC TTATTTCGTATTTTCTT^ 

CATCAACCCCCTTCTTTTGGGTTGGTTGTGTTTTACGCACCTCCTCCTCTTTTCCTCACT 
CCACTGGTTTGCMGTTATACATTTGATTTCTTTTCTTTCAGTGCCATGCTCTGATAAAA 
TAAAGGACATTTATTACAGAAAA 
Sequence 2958 

CCCTTAGCGTGGTCGCGGCCGAGGTACACAAGTCACGATTATACAGTTTAATGAGTAAAC 

ACTGATCACACCCATATAAACCACTGCCAGGCCAATAAGTAAAATCTTACAAGCAACCCC 

AAAMMCTCTAGGCCTCTACCTTTAGAATGCGTTTGCAGTATTTAGGACTATTGTAGTT 

CC AAMTTA CTAAAAGTAATGAATCATTMTGTATGTGTATTACATGTTAGAAATTTTAT 

TTATTTTTTGAAAGCATGCATGTTTATCATTCTTMTATACTTTCTCCTGTTTAA 

TATAAACCTTAAACTGTAGTGACTNTTAGGAGAGTGAAATTCATGAATATAAAGGTAGTC 

CATAAMCCGTGATTTGTAGATTTTAAATCTGAAAAACATTCTCATATGCACCATAAATT 

TTGTCACTGTTCTATTTGATTTGCNTATCTCTACAGGTGMTGMTTTGTTTNNTTTGAA 

GAGGATGGGGGMGCNTTTNCCATTTTAAACAGGGATAAAAACTGAGGGAAANTGA 

Sequence 2959 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATTTATTAAGCTCTAGGTTCATCAGATGACC 

AGGAAAACATAGACTATCAGGACACAGACAAAGCAGAACTGCAGAATTATAGAAAATTCA 

AGGTCTCATGGAGGAAAAGCAGTATCTACAATAAGAAAGAGACCCCAAACAGCACTCTAG 

ATTTCTTTAAGAGGGATTATATTTCCTCCTTC 

CTCCCTCTTTTCATCCTCTTCTCCTTTTATCTmCATCATTCCm 

CTTAGTTACTACTATGTT 

Sequence 2960 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAATTGCTTGAGTCTGAAGAACCTGCTANGGA 

GCATATACATCTTCAATTAACCTACAACTGGTCTTCAGTAAAACCTCTGTCCCTGNCACA 

CTGMTNTGGTGTMGTCATTTTTAAAACTTCTGGCCAAMCCCMCCATMTATGCTTT 

CTTGTAAGCCANCTTCATATACCTAAAGAAAACATGTGAAAGGCCGGGCACCGATGGCTC 

ACTCCTGTAATCCCAGCACTTTGGGAAGACGAGGGGG 

Sequence 2961 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCACTGATGTCGTCATGGTGGCAAGTCAGTGTT 

GGCCCTGTCCTAACCACAGGGAAGGGGTTCTCTACAGAAAAGTAGAGTTTAGATCTGAGA 

AGGTAGTTGAGAATGAGTGCCGTATAGTGMTMTAGTAGATTTTTACCCCATTTTACCC 

CTTTGTCTTTGGTTGTATATATATTCACATACCATTCTMTTATACATATAGTTTCAAAA 

ATGCCTCTGCTTCTCCMACACAACTATACTTTATATATATTTTCAAAAACAGTTCTACC 

TCCTTCTTCCAAAGATAAATATCTGTGTTTTGTGTCCTCCATGGCCACCCCTAAGAGAGG 

TGCMGGAGGATTTCATTCCTGGGAATTACCTACTTTAATAGCCT 

Sequence 2962 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I ( I I M I I I M M I I I I I I I I I I I GGAGATGGA 

GTCTCCCTCTATCGCCCAGGCTGGAGTGCAGTAGCACAGTCTCGGCTGCCAGGCCTCCCC 

AATCTCTTCTTCTCACAGGGCAGGGCTGAACAGTATCTCCTCCTGGCTGCATCCGAGCCC 

ANAAGTCAAAACCTAAGGGGTCCCTGCTTGGCTGGGACAGAGGCCGCANACCTAGGTCCA 

GCTNTGACTCGACAGNGGCACATCCCTGTGAAGTAGGGCTTGGGGGCTTCCCATGAACAG 

CTTTCANAGTGAGTGCAGTAGAGTGGGGCTGGAGGACAGTTTTGACAGATCTTGAGCAGG 

TGCTTGMTTTTAACCCMCATCCACCCTTACCCATGACCCAGGTGACATTTCAGAGGAG 

GGTTCACCTGCCCTGCAAAGTAAGAGCTGCAAAATCA 

Sequence 2963 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAATGAGGAAGAGAGTTTTGTAAAAGCAAGCAG 
GCTGGCTGTCATTAAAAGTTATCTTAAATAACTTGTCGCAGCCCACTCCCACCAGTCCTG 
AAAATAGTTCATTCAAGTGGTAAAACAAGACAAGGCAGAGGACGATGTCTCTAACTTCCC 
TGATGTGAGATCTAAAGCCCCATTCCTMCTCTTGGTTTTAGAATTGAGGAAAGTGGAAT 
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MATGCA7TTGGAAAGGATCTGTTNTCTTCCCAGGTTTCCTGCCTGGGTTGAAAATAAAG 

GGTTCAGGGGATGGAMCAACTTTGAGGAACATAAAGAGGTATTGGGGGTTCATCAATTC 

ATCTTTGTTTCAAGATGGGTCCCCCCCACCCTCCNCAATGCAAGTTMTTAGGGAGATAA 

TTTTAGCCTACCTTTGGGMTTNATNTNTAAANATAAAGTTATCTTTTTTTM 

TCACTTTMTTGNCCNCCCCTGNATTTGATMNGAAAAGTAGTT7TTTTCTTATTTG 

NCTNTTTTNGANTGNCTATTNGGMNNCNTTTTATTTTGGGNACTTTTTTGGG 

AAAGGAAAGTTAAATT 

Sequence 2964 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAGAAGCTGAGTGATTTGTTGAGGTCCCAGTG 

GAGTGTCAGACCTAMCTTCCAGTTTCCCAATTACTTAAATAAGGTTCTTCTCATCAGAC 

CCTCTTTCCTCATCTCTCTACTAAAAAACTACAGTGAATAACTTCCACTGTCATCAAATA 

GAMCTTTCTGTGCTGGTTTCCCCTACCCCACCTTTCTGTGCTTGATAACAGAAGCTGTT 

GGGAGCCCCACCTTCAGTCTTATTTGTCCTTCCTTGCNGGTTTTTCTTTG 

Sequence 2965 

CCCTTGGCCGCCCGGGCAGGTACAATAATGGGATTGTGGTTATAAGGGAGAATGCCTCTA 

TTCTTCAGAGAGGCACACAGATTTACTTGCTTTTAAATGGTCTAGCAAAAATAAAAACAA 

ATCCAC TCATA AAACTAATATGACAAAGTAGATGGTTCTATAGGTGTTCAATGCATCATC 

CTTTCMTTTTCTAMTATTTGAAATTTTCTTAATAAAMGATAGGGAMCAAAGG 

TTGGAGGAAACAAAAACAAAAACAAAACAAGAAGATCCAGTCTAATGGAAAGATGCAATG 

GACTCAGGGATCCTGCAGCCAAAG 

Sequence 2966 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGTTGTTCCAGACTGTATGATTGAAATAGCTGT 

TATTTTCCCAGTTTCTGTAGATCACATATAGGAAGTTCTGCATAATCATAGTGATGAAAA 

CTCATGTT TAAA AAATCCATATATAATTAGCGTGATACAACACAGCATCTTAACACTGAG 

GCmMGTmAATMTTCTGTTATTCTCAGTMCACTGAAAGTTGCCTGTGCTCTTTC 

TGTCACACAATTGATTCCAATGTATTTTAAMTGTGTTTTTCTGAGTCATTTGTTGCCTT 

GTTACTTTTTAAA 

Sequence 2967 

GGNACTTAGCTGGGAATCTGCATCTTTAGAGNTCAAMCTGTGCAGTTTTTCCTCTGATT 

AGGTATTATNATGTTGAACTCCAMGGGTCATTTACATTCCTTTATAACACTGGAGCTCT 

GGTG TATCG AATTGNGTTATGAGATMGCCACTNGCAGGGACTTGAACATTCCATTTTCT 

TTAGATTTTGTNGTATCAGCATGTGAATATGCTGAATACAACTTGTATCCTAAAGCATAC 

AANCTATAACATTTTCACGTTGGACTCAAATTTC 

Sequence 2968 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTGGAAGGCCCCACTGTTACTTCTTCATACAGG 

CAAAAGCTAATCAACTATTTCACCTTGCAGCTAAGTCAGGACAGAGTGACTTGGAGAAGT 

GCAGATGMCTCCCGTGGCTTTTTCAGCAGCAGGGAAGTAAACAGAAGCTGCATGATTGC 

CTTCTTMTCTCTTTGTGTCTCAAMCCTTTATAAAAGGGGACACTTTGCTGAGTTGCTG 

AGTTATTGGCAGTTTGTTGGCAAAGACAAAAGTGCAATGGCAACAGAATACTTCGATTCA 

TTGAAGCAGTATGAGAAAAACTGCGAAGGCGAGGACAACATGAGTTGCTTAGCTGATCTT 

TATGAAACCTTGGGGCGATTTCTCAAGGATCTAGGCCTTCTCAGTCAGGTAAGCTCAGCA 

GATCMTGGC MGCC AGAGACAGGTGGGATTTTTGTGTTCACTCTTCTCTTTTTCCAAGA 

TGTTMTGTGGTTTTCATTTGTTGTGCTATMTCCTTAATATTTAATCAGATGACAGATC 

TTGCAGCATTTCACTCACTTTAACTGGCCTTTCACCCCTTT 

Sequence 2969 

CCCTTCGACGGCCGCCGGGCAGGTACTTGAGTCTCTTAACTCTGTAAAATCAAAACCTAA 

AGAM ACAAAAAGATTATCCCTTGCATTCGMTACTGTTTAGTCAGTGTATATCAATTTT 

TTTTTACTGTATCACTGCATTCAGAGGTATCAMTTTTGCTTGCAGCGCTGTAAMGTTT 

ACCAGTCAGATTTGTCGGCCCGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGAGA 

GGCCAAGGCGGGAGGATTGCTTGAGGCTGAAAGTTTGAGACCAGCCTGGGCGACATGATG 

AAACTCCATCTCTACTAAAAACACAAAAATTGGCCAGGCTTGGTGCCATGCACCCGTGGT 
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CTCAGCTACTTGGGAGGTTGAGTGGGGAGGATCGTTTGAGCCCAGGAGATTGAGGCTTCA 
GTGAG 

Sequence 2970 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTGTGTTGTTTGAATGGTATTTTGAAAGCAA 

TTGCATAGTTACTATTGATTCCTCAAAATGCATAATGGGGATGCTTCTGACCGACTTGCC 

TTTTAGCAAGTTGAAGGTAGGTATGCACGACAGGGCTCTGACTTCACCAGAACTAATAAG 

TTAGATTGCAATAGTAGGAGACAGATATAAATAATGAAGGAAGCTAGAGTTATGACGGCT 

TTCTGGAAAAATGTTGGAATTGAAAGAAAATGTGGAAAGAGAGTTGAGTGGAAGAAAGAC 

TAATAGAGTAGGAGTATTAGTTTTATGAGAATGTGAGTAAATCTAATATTGGATAAATGA 

ATCTMGGCTACTAAT ACTA CATTTTTGCMCTTACATTTTTGAGTATTTGGGTAGTAAA 

TCACTCCGATAAA MCTTT TAGAGTGGTTCACCTCTCAGTCACATATTTTGTGGTTTTAT 

CTACATATACTTGGTTTTTAAGTCCAAAAATATTGGTTGGATTTCTGGTATACCCTACNT 

TTGT AAAAAG CATAA ACTAGCATTTAAAATTAATCNAATTAATTGCATGTT A I I 1 1 I 1 1 1 

TTTTAAAGCCATTTTCCTGGNGATATATGGCTTTTGNAATTTTA 

Sequence 2971 

CCCTTAGCGGCCGCCCGGGCAGGTACGTGTCTGTGGTCCCGACTACTGGGGAGGCCAAGG 

TGGGAGAATTGCTTACGACCAGGAGTTTGAGTCCAGCCTGGGCAACATAGCAACACCCCA 

TATCTTATTTAAAAAAAAGCACTCTATGTTTGTTTTGGGCCTCTTTGAGAACATTTACTA 

TGAMTTTTATTTATCTTTGTTTCCTTTCAGTTCTTCTTTTGTTTATTGTGATTATCTTT 

CATATAACAATGTGCT TAAAT AGCTGGTGGTTCTTGATAGGTCACTCAACAAGTGAGCAA 

ACTGAT GGAT CCTGTGTTTTGTGAGCAGTGTTTATTGGCTGTGATAGAGGGTGGGGACCT 

AGATGCTTTTAGGGGATCCTCCTGTTATCCATTTCTG 

Sequence 2972 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGGAAGAATGAGATAATTTATTTGGCAAAAT 

TCATATGTATATGTTATGCCAAACACTTGGCTAATATCTGGATGAGATGGGCAATTTACA 

GACTATGTAATCAACTAGGGTTCTATCTGTAGTTGTGTTATGAGAAACAATGGTTCATTA 

ATTCACGGAAATGTGAATTAAAAATGTCATTAAGCGAAAGACTCTGTTAGCTTGAATTCC 

TATTGGGATATGCMGACACATAACATTTATGCATATTTGTGATGTTCTAATCCATTTTT 

AGGGAATAGGATATTATATAAAGCACCCTAGTTTAAACAAATACATGATTACGGTTGTCA 

ATAGAGATAGATATAAATAGTGACCTGTGTMTTATTTTGCTAATGCTGAMTTTTTTTT 

CAGACAAGTGGATCCAAAA TCTACC AATGATCTTGCATTACATTAAGGGGAAAAGTATCT 

CCTTAATGGGGAATCTTTAC I I I I I I I 1 1 1 CCAAATTGTGCCTATCACATGATCTTATGG 

ATAGATAATGGTTGGTATTCCTGGTGGATTCTTATGNGCAAAAAAAAGAGAACTTGATAC 

ATTGGTTAGTTACATTGCTCTCTTTCTGNGCATCAAGCTTCAACTATCAAGATTTCTAAG 

TGGCAAAAAAGTTTGGGGGCTGNGTTATAGAAGTGGAATAAAATNTTACTGNACATCATT 

CTTTTAATTAAAATCCCN 

Sequence 2973 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAACAGCTTCCTAACTGGTGTCTCCCTTCCTC 

TNAGCCNTGCTGTGCATCATTCTCGCACCCTGCAGAGAGTTATGTCTTCACAATGCAATT 

TTGGTGTTACCCCTTAACTTAAAAACTTCCAGTGCACTTAGGACAAAATCCAATATTCTT 

AAGCCCCCAAAGACTCTTCATGATCTGACCCTATCTTGCTTTCAACATTCATCCCCTCCA 

ATGTTCTCTCTCAGGTTCTAAGCTTTCCACAAATTGCTACATCTGCAAAGATTACTGCCC 

CCAGCCTTACTCCATCTTCTTTACCTAAATCAATCTACTCAGTTTTCAATTCATTCTTGG 

ATTAAACTTCAAGTTCTTTAGG GG AAG CCCTTT 

Sequence 2974 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGATCATTCAATGAGAGAATATGTCAAAAAA 

TAGTTGGTGCATAGTAGATGCTCAAGAAATGTCAAATTCTCCTGCCTTTTCACTTAGATT 

CTTAAAAGTATGAATGGTTTCATATCCTCATTGACAAAGTATTTCTCTGAAAAATTCTAA 

GAATAAAAMTCCTACATCTCTGAATAATTACATTCATGACTCAGGGTGAATTAAAGCTT 

CACTTTGCCTGGAAAGACGCAAAAGAAACTTGAAGACAAAACAAGAAGCAGATCACAAAG 

ACATAAAGTATAGTG 
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Sequence 2975 

CCCTTAGCGTGGTCGCGGCCGAGGTACACACTTCTGTAACTTAATATGTCATTTGGATAC 

CTTTTCTCMGMGACTCAMTAMTAATTTTATTATGCCTTAAATATTACCAAGTCAGA 

ATACTCACAGATGGATAC TCTG TCCAAATGAGTATTTGTGATTGCGCATTTACACACACA 

TATMTTTGTCATTGGTAATTTTGTTGCAGAAATTATTGTGAGAAMTAAATTTTAAAAG 

AAMTACGATTCTATGMTACATTTTATATTTTACCTTTTTCTGGCACATACTGTTGCAT 

CAMCTGATTMTTAATGTAATTATGTTAAACTTCTCCAGTTAAAAAAGAGAATATACAC 

TGCAAAGCTGAATGTGTATACTACTTGTTGAA 

Sequence 2976 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTAAAGAGCTCGTGGTGTGTGCCAGGAGTGTGT 

TTTCCATGCA TTCTT CCTGAGAAGTAGGTTTGATTATACAGTATCTATTTTACAGATAAA 

GGTTGAGGTCTTTTCTTTCTAAGGTGTTATTAGACAGCTATAGACATGGACACCAAAACT 

NGNCGAGCCTNCGGCATTGTAAAGGCTTGGCTTACCTTCCATCACCCTCAGCAGTTCTGC 

TGGAAGGGAAAAATGGAMCAGGGCCCTTAATTTTTTCTCATTATTGGATTAATTGGAAG 

CACTAGGCATGGTNTGCTT 

Sequence 2977 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATG 

TTTAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAATGTAAAACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 

ATGGCAACGTGACCMTTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 

TGGATTTAT GTCT ATATGACAAATCTTGATACTGCATCCACAACATTACTGGCGTGCTTT 

TTGCTTTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTAATACGTCCGCTACTTTTGTCT 

GTGTCTACATA CACA TCATTAATACTATTGGTATTACTATTGTGGATAACAGCAGCCTCT 

GCGTTTAGGTGTTTTATTGGATATATTGGATTTGNTTATATACCATTATTTT 

Sequence 2978 

CCCTTAGCGTGGTCGCGGCCGAGGTACATAACACATTTATGTGTCTATGTGTATATACTA 

TTTATGTAAACATATAATATATGCAAAACATATATGCATATACACATACCGATATATGCA 

MTCTGTATA TAGA AAAACTTAGTTTACTCAAGATTATTAAAATTGATCCTGGGAACAAA 

ACAATACACCTTTTCTAGGCATTTATCCCTACTATAGAGACAGAGAAAGAAAGATTGTGA 

GAGGGAGACACACAQATTCAGATGAGCCTTCAGGGAATTGTATGMCTTTTGAACCAGAG 

ATGGAGMGCCAGAGCCCAGCCtGTTCCCACTTTAATAAACATTTTTTTCTTCMTTAAT 

TMGGTTTGGGAAAGAAAAAAAAACCCCCAGAAGAGAAAGTAATTTAACATGCCATGCAT 

ACTTTACAGATGGATCCTCCNTCNGATGCCTATTAAATCTATGCATGATTGCNCTGGAAA 

AMTTAATTGCATTTGGTTCTCACTATTGGGAAAATAGGGTGCCAACAGCTCAAAAATAA 

AAANGGAAAATGAAACCANGGTGGNTGNTGGTGGTTTA 

Sequence 2979 

ACGCGGGCTCCMTrrMTTCTTTTAGTTGMTGAGCTTCTTTCTCGCTTGTTACAGACT 

CACTTAGGGCTCATTCCTCCCTGCCTTAGTCCCAGTATGGTCTCAGTAATGACATGCAGT 

ATGGGGACTTTGGACTCCTGTTTGAACAAACCAGCTTTGAGATGATAAAAATTTGAATAT 

GMTTGAGTTCTTGAAGATAAGAAATTGTTAATTTTGTCAGGTTTGTTAATGGCATGGTA 

GTTACTCTAAAAACAAATTTCTGGGCGAGTGTAGTGGCTCATGCCTGCAATCCTAGTACC 

TCGGCCGCGACCACCGCTAAGGGCGAATTCC 

Sequence 2980 

CCCGCGTCCGCTTTCTTATTTGGAAAGAAAACAACACTCACCACGGCATTCACAATGCTG 

GGGACCTTATGAATGGGAATGGAGAGGCTGCACTAATGTAAGTGGGGTCAACATCCCTGC 

TGGAGGGC AGAT GCAAGTGGGGTCAACATCCCTGCTGGAGGGCAGATGCAATTTTGGCAA 

ACTMTGACTTTTGGMGCCATTTTCTAGACTGCATTTTTCTCTTCTTTCT^ 

TTTCTCAAGGGGAAAAAGAAAGCAGAGGTGTAAATGGTATCTGGCCGTTCGAAGAGCCCG 

TCTGGGAGACTCTCTGCTGACCACAGCACTGTCTCTCCCCAGGGCCCAGTCCCACAGTGT 

CACTGAGTCCAGAGTCGGGACTCCAACAAGCCCTCCGCATTCCAGCATGACTGTGGTAAA 

AACAAAAAAAATCTATmATTCTAGTCCATCACATTTTCTATATTTGCTAM 
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CNATCACTCCATAAATGATCGGTTAAAATNAAAAATATAGGCTGGGTACAGCAGCTTATG 

CCCTGTAATCCCATGCTTTGGGACACTGAGGCTGGAAGATCACTTTGAGGCCANGAGTTC 

NAGACCAGCCTGGGCAACATAAAAMGACCCATNTTTTCCAAAATTAAAAATAATTNGCT 

GGGGGGAGCCTAATTCCAC TACTT GGGAAGCTGAAGGCTGGAGGATTGCTTGACCCTGGG 

AGTGAGANGCTACANTGAGCTTTTATTGCCCCACTGCNCTTCANCTTTGGCCAACAAAAT 

GANAACCCTNTCCGTAAAMATCNNCNCNNNTCAAAMNTTTTAAAAAAAAAAAAAAAA 

Sequence 2981 

CCCTTAGCGTGGTCGCGGCCGAGGTACMTMTCGTTTTTGGAATGATGGAGTAAATGGT 

GGATTTCATGCCTCAACTTGAATACACTCTCTGGTGGCTGAAACCGAGGAAGAATGTTGC 

TCCAMATATTAACGGGTGATAACTTGCTTTTCTATTTTAAAATAGMAAATTAATTCTC 

CCATTCTC TTGCATG GAMGMCCACTCAAACGGAAATTATTTTGTATTTAGTCACTATT 

TAGT TATTA l I I I I ICCAAAATTMGGCTCATGCTCAGMGCTATTTTTCTTAATATGTT 

TGAGTTTTTAAAAATTGGAAAAAATATCTCTTCCACCAAGMTTTGGCAAAAGAAAGCTA 

GAATTAAACACCGCAATAATAAATTTCACAACACTATGAAAAG 

Sequence 2982 

CCCTTAGCGT GGTCG CGGCCGAGGTACACTNACAGCCCTGTCCTGAGCCTGTGTTAAAGC 

CGGCACTCTATTTTTCTGGTGTCATGTCAGCTGCTCACATGTCTGTCCCTCACTCCATGC 

CCCACATCCCCTTGTTCCAGGGTCCGGCTCGCAGTCAGCACTTTTACCATGGGAATGCAT 

GCCTTAGTTTAGCCCACCTTCCCTGGGAAAAAGANCTGGATATACACACAACATACCCAT 

GTCTCMTTGTCTGGTATGCMGTGATATGACATATTTGTGTTMTAAATTTGNGNCTTA 

TATGCATTTGTGTGCAGGCAGTCACTCAMCTGGGACACTTTTGATCTAAATAAATACTA 

AGGNGGATCCTTGGACAATTGGCAAAGCCAGGACTGTCAGATAAATGGTCCCCTTGG 

Sequence 2983 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAAAAACTAGTAATGCAAAGGCAGCAATGTTAG 

CMMTTTAAAGAGTTATACCGGGGTGAGTTTTACAGAATTAGTAAGACCATTTAAAAGT 

MTMATCMCGTGTTGCGATTGGTGTATTGCTGCATTTGGACTTACACCCAGTATAGCT 

GACAGTATAAAAACACTATTACAACAATATTGTTTATATTTACACATTCAAAGTTTAGCA 

TGTTCATGGGGAATGGTTGTGTTACTATTAGTAAGATATAAATGTGGAAAAAATAGAGAA 

ACAATTGAAAAATTGCTGTCTA 

Sequence 2984 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGAGCGCGCCGGAAGAAAAACCAGCAA 

GAAGGCGGCGGGGGAAGATGGCGGTCCTGGGGTAGAGTTTGCAAGCT7TCTGACTAGGCT 

AGTCGAGCAACTATTCGGGTCATGGCGTCAAACTCAACTAAGTCTTTCCTGGCAGATGCC 

GGCTATGGCGAACAGGAACTGGATGCCAACTCTGCCCTTATGGAATTGGACAAAGGCCTA 

AGATCTGGCAAACTTGGTGMCAGTGTGMGCAGTTGTTCGCTTTCCCAGACTTTTTCAG 

MGTATCCATTCCCTATTCTTATCAATTCTGCATTCCTAMGTTAGCTGATATTTTCAGA 

GTTGGAMTAATTTCCTGAGGCTATGTGTTCTTAAAGTTACCCAACAAAGTGAGAAACAT 

TTGGAGMGATTCTAAATGTGGATGMTTTGNGAAGAGMTTTTTTCTGGGATTCATAGT 

AATGATCCTGTGGCAAGAGCCATCACCCTCCGGAT 

Sequence 2985 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I I 1 1 I I I I I I I I I I I I I I I I 1 1 GGTGGACT 

CAGGGTCTTAC TCTGT CACCCAGGCTGGAGTGCAGTGGTGCAATCTCAGTTTAAACCCCC 

MGTCTGTGGTATTTTGTTATAGTCCCCCTGGAAATTAATACAGGATTCATGGTCTGCCA 

AAGTCTCATTCCCTGCTCANACCGACAATGGATAGTGGGATCCTTTGTGATATGTGCTCT 

G TGAGTTANAA GTAGTAACTTANAGCTGTTGAGGACAATACCTTTGCACCCCATCCACTG 

AATTTTGTTTTCAAAGATACAGTCTGAMCTCCCAGAGGGTGAGTGCCTTTCCAGTGTGT 

CAGAGCCATTAGCGGCTGAGCAGACCGCTGAGCATGTANATTTCTAGGTCAGGGCCCTCT 

CCCCAMGAATCATACTCCCTGGTTGAACTGAACCTAAATCAGGTTTAAAACTAGACAGA 

TGCCAACCAACTTAAAATTCCAAAGATATNCAAGACCTGCCCATGGC 

Sequence 2986 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAACAGCAGTGAACCAGGGCCTANTTGTCAGCA 
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ACCAANACC TNTGCTGACCCA NAAGCANACAGGTGACCCTTTCACCAAAACTTCAATCAT 

CAGTCTTCTCTTTTCTTTTTTTGAAACANAGTCTTGCTCTGTCGCCCATGCTGGAGTGCA 

GTGGCGCGATCTCNACTCACTGCAAGCTCTGCCTTCCAAGTTCACGCCATTCTCCTGCCT 

CAGCC TCCCGAGTAACTGGGACTACAGGCGCCCGCCACCATGCCCGGCTMTTTTTTGTA 

TTITTCATANANACATGGTTTCACCATGTTAGCCATGATGGTCTCGATCTCCTGACCTCG 

TGATCCACATGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCCG 

GCTAGTTTTCTC 

Sequence 2987 

CCCmCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAACG 

TTTMGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 

ATGGCAACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 

TGGATTTATGTCTATATGACAAATCTTGATACTGCATCCACAACATTATTGGCGTGCTTT 

TTGCTTTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTAATACGTCCGCTGCTTTTGTCT 

GTGTCTACATACACATCATTAATACTATTGGTATTACTATTGTGGATAACAGCAGCCTCT 

GCGTTTAGGTGTTTTATTGNATATATTGGATTTGNTTATATACCATTATTTTTAATACA 

ACACATGCACGCTTTT ~* 

Sequence 2988 

GGTACAAGAGTTGGAAAAACAGAGTTGAAAGGAAAAATGAGGATAATCTTATTTAAAAGA 

TTATTGAATGCCTAAGTGCCTTTAATAACTCCACAAGGACCCTCATGTAAACAAAATTCA 

GCAATAGCAACAAAACAAAGCCTCTGAAATGCTCATTGTTCCCCAGTGAGATAAGCTTTA 

AAGAGCANAAATACGAAAAACAAAACCAAATNTCAAGTNAAACTTAAAAATGTTACTTTA 

TGGGAGAGGACAGGCAAGGCGGCAAGAACATAGAAGCAGTGCCACCTGAAGCTGGCATGA 

TMGGTTGATTTAACTGCACTGGAAGTGACAGAATCTGAAAGAACGACTTTTCAAATGTN 

AGATAAATCCATTTCAAANNATNGTTAAACTTGTGAATTCAACAGAAAAACTGTAAAACT 

TTCNGNGGAGGCTGTTAAGAGCAAAATTGTTCCCTCAGAGCANAGGACTCATTGATGCCG 

AAGATAGAGGAACATGGCCAACCCAATTAAAA 

Sequence 2989 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAGAGAAGAACTCATGTCAATGGAAAGTCTGT 

TTATCCTTAAAGCCAQATTTGTAGACTAGACAGTTCCAAMTATCCCTTGGCAATCCCTT 

TTGCAMGTMTTGGAAAATGCTGTGCCMGTGATAAAAAGCATTTCAAGTTTCTGCCTA 

ATGTCTGTTATAGTTTTCAAAGCCCTGMTTGCAAAACAAGATTAAAACTAGTTCTGAAA 

TAMCAACMGTATTTACTGAAGAACAACTATGAGGCACTCTGCTGGAACTTTGGGGATT 

TGAAGATGTGTMGACTGAGCTTACAATTTAGTTGGGGAGATCAGACGTGTGCACAGCCA 

AGGTTAAGCTCTAATACAGCATGAAAATATCTGAAATGTCAAAAAACACCAGCAACTGTT 

TCCATCTCANGGAGTGAGGCTACTATCCTCCAAAGGCAAGTCCTAGCCCCATCAAAA 

Sequence 2990 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATATACATTATGTAATTAAAAAGCGTGCATG 

TGTATGTATTAAAAATAATGGTATATAAACAAATACAATATATACMTAAAACACCTAAA 

CGCAGAGGCTGCTGTTATCCACAATAGTAATACTAATAGTATTAATGATGTGTATGTAGA 

CACAGACAAAAGCAGCGGACGTATTAATAGGCAGACACACAAAAGCACACAAAAGCAAAG 

CAAAAAGCACGCCAGTAATGTTGTGGATGCAGTATCAAGATTTGTCATATAGACATAAAT 

CCAGTAGACACTGTMTAGTTTTTGGTATTTTMCTTGAGACAAAMTTGGTCACGTTGC 

CATTCACTATCATATGTMGTGTMCAATTGCACTTTTATGTTTTACATTATGTCCTGTC 

CMTGCCATGTAGACGACACTGCAGTATACMTTTACAATGCTTTTTAAATCTATATCTT 

AAACATTTTAAAGGATTAGTATCACCTT 

Sequence 2991 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCAGGACTGTCCTAGTGCGGGGAATAAAAGTG 
AAATTATAGTCCTTTCTCCTGAAGAAATCAGGGTCTAGTGGAAAGGAAGGTGGGACCGGA 
AGGAAGCTGGCCGGTGGAGCTTGGCCCTGTGCTAAGCATTCCTATCCACTTAACTGTGGG 
TAATCGTCTGAAACTCCCTACAGACCTGGGAGTAAGCTATTGTATTGAAGGGGCTGTATG 
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GAGTCAGGGAGAGAAGCCAGGCTTTANAGTCAGATGGCTCTAATCGTGCATCTCACCCTC 

TCTGTGTCCCCAAGCTCTCCCTCTGTAAGATGGGGACAATGACAGCACCCACC 
Sequence 2992 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTGTATACTGCAGTGTCGTCTACATGGCATTG 

GACAGGACATAAT GTAAA ACATAAMGTGCAATTGTTACACTTACATATGATAGTGAATG 

GCAACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACTGG 

ATTTAT GTCT ATATGACAAATCTTGATACTGCATCCACAACATTATTGGCGTGCTTTTTG 

CTrTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTAATACGTCCGCTGCTTTTGTCTGTG 

TCTACATACACATCAT 

Sequence 2993 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTTCTCCAAGTCAATGAATCCAATCTTTAA 

TGGATCTGGGATGCTCTCTGACCAGGTCCTGTTATGAGTTTCCTGTGTTATAATTAAAAA 

CCCCAAACTGATTAAGCATGAGAGGATACTTCCTGCTCTCAGATAAATTATTTATTATAT 

TCT GTAT TATCCTTGTAATTTTTCTGGATATGAATAAATAACAATGTATCTCAGAAAAAC 

ACATTTTGAAMGCCTTATAMGTTAAMCTTTAAATGCTTTTCATCAAAATGTTATGAC 

CAATAAAAATGCCATTCATGTATCTCATCACACTTTCTATACATTTTGTAGTGATATGTA 

TTTATACATACTGCAGTTGGMATAGGGMTGCTTTTTTGGTTTCCCACGGTAGGCTGCA 

GGTTTTAAAAGCAGAAGTCCTGAACCTCTGAAGGCAACATAAGAAAAGTATTCATTTGAA 

ACCTG 

Sequence 2994 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTGTGTGGGTGTCTTACCTTGCATTAGMTATC 

CTAGGCCTTTCTAATTCTTCCTCTTGAATTCCCTGTGTCACTCATCTGAGGTCATCTCTT 

GGATTTAAATACATTTGGTCTGAAATTCTCAAACTGCTGAATAAAATGCCATANTAAGTC 

CAACTATATGGTTTCATGTAGGGTAAAGTAACCA7TGTGGAAGGATTACGATAAAACTGT 

GTGACAATAAATAGAATAAAAGATGGCAAAACAATCTGTAAATTACAAAGTGTTATAAAA 

ATGCATTGMTTTCTAGTG7TCTTACACTTTCATATTTCTCATTGGCACATCAATCANAC 

CTTGCCTTCCTAAAGTGGGA GGAA GAAAAAGGCTGCACCTGTCTGGGGCTGAAGGTCTGG 

GCAGAAACACTAGCCAAAACTTTTACTAAATCTTCATGTCACCCCCCCAAACTNA7TAGG 

GGCCCAMCTTATTTCATTATCTTCACTATCCCCCAGGGTTTCCCTCTCCNTATTTTTTC 

ATGCCCCACCTTTCAGCACTTTTCATGGTTNCTTNGTCTTTAAAAAACCCCCAA 

Sequence 2995 

CCCTTCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATGTTTAA 

GATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTGCATGGCATTGGA 

CAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAATGGC 

AACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACTGGAT 

TTAT GTCT ATATGACAAATCTTGATACTGCATCCACAACATTACTGGCGTGCTTTTTGCT 

TTGCTTTTGTGTGCTT7TGT 

Sequence 2996 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATTTGGGATTATGGTTTTTAAAAGCTTGAGAA 

TATTACTTATTTACTTTTGCAGCTTTCCATGCAGCTG I I I 1 1 I I I I CTCTCTGTTTTTGA 

CTTTGGANATAAAGTTACTGATTTTTATCTTTACTTTTCTCTTTATTTTGTGCCTTGGTC 

TTGAAAAAGTAATAAATATTTCTTACTGAGGCCCCCTCCTGCTTTGTTCTGCCATCTAAG 

GTGATTACAATGCTCAAGTCTCTTTGTTGGTTGCCCTTGACTAATGCTTTTCAGCAGCCC 

CGGGCTCAGCTCCTGGTTCCCAGTATGGCACAATGACCAGGCAGATATCTCGACACAACT 

CTACTACTTCTTCGACATCTTCTGGTGGATACAGACAAACTCCCTCTGTGACTGCTCAAT 

TTTCTGCTCAGCCTCATGTTAATGGGAGGTCCACTTTATTCTCAAAATTCAAGTAAGCTT 

GT GCTrr GCAGGCATACAGCAACAGGGGGAGCTCCTTTTGGAAGGAAATTATTTGAAACT 

GGTTTTTCTTGCCAGGGGATAGACAACCAATTGAAAAACCATGAAGGNTNCAAGGGTTAT 

TTTTGGGT TTTANG AMTTAAGNCCCTTGAAATCATACCNATTTTTAAANANGTAATTGN 

TTTACCACCTTTTTTNTTCCCA 

Sequence 2997 
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CCCTTTCGAGCGGCCGCCCGGGCAGGTACATGTTCCTTCCCTTACTTTGGGGAGTCTAGG 

TTGTGMTTTGAMCMAATCAGATTCTTTTCATCTTCCCTTTCCCCTCAAATTCCTGAG 

AAAACCTCCAACCTTCTAAATTTATAGCAAATCAACTATAATTATGTGTTTCCATTTGAA 

ATTCMGCTAMATMCATACTTTAAAAAGTGTATCTTAAAAATCATATTTCGCTTCAAA 

AAAACTTGTTAAAAGTAATTTGCATCAGATCCTGGAGTCGACTTGAAGAATTCTCCTACA 

TCTGACACCCAGTT AGGCC CTTTGAGAAAGAGAGAAAAGAGAATTTTTTAATGCATGTTT 

GATTATGGCCATCTCTTTTCTTAGAAGGTAGAAGATAGCACCATGCCGATTCGTCGAACT 

GTGAATTCTACCCGGGAAACTCC 

Sequence 2998 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATGTT 

TAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCATT 

GGACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAAT 

GGCAACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACTG 

GATTTAT GTCT ATATGACAMTCTTGATACTGCATCCACMCATTATTGGCGTGCTTTTT 

GCTTTGCTTTTGTGTGCTnTGTG 

Sequence 2999 

ACATCTGCACATTGTGCAGGTTAGTTACATATGTATACATGTGCCATGCTGGTGCACTGC 

ACCCACTAAATCGTCATCTAGAAGTGTTTTCTMTTTTATTTGTAATTTCTTCTTTGTTC 

CATTGGCTGTTGAAGAATGTGTTGTTTGGCCGAGCACAGTGGCCGACGCCTGTGATCCCA 

GCACTTTGGGAGGCCGAGGCAGGTGGATCACGAGGTCAGGAGATCCCCGCGTACATCTAT 

TTAAAAGTCAATACACAGGAAAAAGGTCTGGAAGAATATATACTAGAAAAGAATGGTTAC 

CTCTAGGGGATGGAGTGTCAAGGTGAATTTTGGTTTATTCATACTGTCTAAATGATTGCT 

TAATA ACTC AGGTATTGCTTTCAAAAAGTAAGAGGAAGGGTGGGAAATACCCTGTTCTCT 

CANAATTTTAAGTCCAGAGTGCTGGGGAGAAAAANATGGCAGCACTTGANCGGGACTGAA 
AAAGT 

Sequence 3000 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGTAGGAGCAGGAGGAGCAGTGGGGGAAGC 

TGGTATTATTTATACCACTCTTTGTATTTTATCACTTTTTATAAGTCTGAAATTATTTCA 

AAA TGAAA ATGTAAAAAGAACAAGGAGTTTCTCACTATTCTTTGTCCCCCTAAAAAGACC 

ATCi 1 1 1 i AAAAAGATTAAGGAATTCATACAAAATAAGTAAGAATTAACAATTATAGGAA 

GTTTGGGCCAAGCACAGTGGCTCATGCCTGTAATCCCAGCGC7TTGGGAAGCTGAGGTGG 

GTGGATTGCTTGGGCCTAAGAGTTCAAGACTANCCCTGGTAACAGCGAGGACCCTTGTCT 

CTACAAAAAATAAAAAATACTAAC 

Sequence 3001 

CCC7TAGCGTGGTCGCGGCCGAGGTACAGATGGGGTTTCACTGTGTTAGCCAGGATGGTC 

TTGATCTCCTGACCTCGTGATCTGCCCGCCTCGGTCTCCCAAAGTGCTGCGATTACAGGC 

GTGAGCCACTGTGCCCAGGCCTATTGGGGGTTATGCAGGTTCCTTTGCTTGGCCATGGTA 

GATGGCTAAGTCTTTGCTTCCAAGTTCTTTTGACCATGAGACATGTAGAGCCAGGGAGGA 

MGAGAAAACCCAAACCAGGTCACTGTTACCTGTGAATAACATCTTGATCGAGTTCTGAG 
A 

Sequence 3002 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTTCATATTCTCCACCTTTGTTCCTTGCC 

CT7TGCCTTCAMGATATGTAGATGTTCCTTAACTCCAAAAATTATAAAAACAAACAAAA 

AAACTTTACCACTGTTCATTATCAMGTTACTATTCATTTATCCATTCAATATTTAGAGT 

AGCTAACAAGACAAAACAAAATAATTGCTGCT7TCGTGGCTTATAGTGGGGAAGGGTGAT 

AAA CACCATGA TGAAGGAGAGAAGAGTGCTATTGTGGGGGCACAAATTATGGGGATCTGA 

TCCI 1 1 I I I I I AAGGGAGGCAT 

Sequence 3003 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATG 

TTTAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAATGTAAAACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 
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ATGGCAACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 
TGGATTTATGTCTATATGACAAATCTTGATACTGCATCCACAACATTACTGGCGTGCTTT 
TTGCTTTGCTTTTGTGTGC 
Sequence 3004 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATGTTCTAAAGGCAATCGAGTCATAATACACT 

GAAAGCAGTCCAAGAGCAGCAAGAGACTTTGCTGCAGCTGTAGATCACTGGATCGGTAAC 

TCGTAGTTAAAGGAAATCCCTACCCACTCCCATCCAGCCCCTACCCCCTAATTATGGGAC 

AAAATAGCCAATTCAATGAGATTTAAACAGGAAGTTCACAAGAGGAACTCTTATACCTAT 

GAGGACCCATTAACCAGGGATTCCACAACCAGTAGAATATTATCTACTGGTAGCTATTTA 

AGCCTTACCTGACAGGTCTCCAAGCCAGCCGCAGCAGCCACATCCCTCTCTGCGCCATAA 

ACGTCTCGCCTCAGTTTCCTGAAATTCCCAACTGAGAANGGAATATAGCCAGCAGGCACT 

CCCAACCCTGTTATGATTCTACAGTTCTACTGTTAGTAAATTTCCCAGGGGCATCACAAT 

CTGTGTCCTTACACATGCATACACAGAGGAGCTTCCTTTTTANGGATGAGCTCCTCATTA 

CAGGGC 

Sequence 3005 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAAGTCTATCGCCATGCTGTGTCACGTGGGCAG 

GTCTTCTTGGCATTCATCTTGAACATGACCCAAATCCTGGCTCCATCTTCCCTCTTACTT 

TTCCTTTGCCCCATTTTTCTTCCTTATTTTTTATGTATGTTTACCTTGCCCTGTAGGCGC 

ATTTGTAAGCTGGCTCAAATCCTTTTGGGAACAAGGTGGGTTATAAATGCACACGTTCCT 

ACACAGGATGTTGTTAGTCATTTCATMGGTCTCTTGTGGAGGAGATTAGCCAATAGCTT 

CTTTTGTAACCTTAGGCA 

Sequence 3006 

C CCTTA GCGTGGTCGTGGCCGAGGTACAAAACACATTTGGTCCTGAGATTTTTCAGAAGC 

AATnTAAGTGTAATAGGAATCCTCAGAAGGAAGAAAGTAAACATGTTCTGAGCAACTCC 

TCCTCCAGGCACTACAATAAAAGAGAAGGGAACTGGACCTGGAAAGACAGACAGTCCATA 

GAGAAGTGGAGAGGGAAGAGGAAGAGAGCATGCTGCTTAATGCAGGTTGACATTTCTGTT 

TGTGAAGCTAATTAAATTCC AGAAGG ATTAAATACTTAATACTTTACCGTCTGTATGTAT 

GTATGTATTTATTTATTTATATTTTTTTGAGATGGAGTCTCACTCTGTCACCCANGCTGG 

AGTGCAAAGGCCCAATCTTGGCTCACTGCAACCTCCACCTTCCAGGTTCAAGCGATTCTT 

CTG ACTCAC CTCCAAGTAGCTGGGATTACAGGCACACACCACCATGCNGAGCTAATTTTT 

TGNATI I I IANTAAAANACANGGGTTCCCCGGIMTACCCGGATGGGCTCAATCTCCTGACC 

TCNGGATCTGGCTGTNTCGGGCCTCCCAANTGCTTGGATANAGGCGTGAGCCCCTGGGCC 

CCACTTATACCCTTNATMNTAAATAmAATCCCCAAGNmAMTTTTNNTTATCCCT 

TGTCGGGGTAAACCTTTCCCNAAAAAGG 

Sequence 3007 

CCCTTAGCGTGGTCGCGGCCGAGGTACATGATTCTACACTGAATCTGCATTTCACTCCCA 

TATCTATACCAGAAGGTTATCAGTGGAAGAGAAATTCAGTTATCTTGAATGGACATGATC 

TTCTCAGGAGCAGTCAGTGGTTAATTGGGACAAGAAAACACAAGTCATTATCATTGAGAA 

ATCTGAAGCAAATTGAGGCAGGTTGTCACCTTTCACCAGGAAACAAATTAGCCCTGTCTT 

TAAAAAGACTTCTTTCTCTCTCTGCTGACTGGGGATTCCTGTCAACTGCCAATGAAATAG 

GGAGAGTGTAGATTAATGGC 

Sequence 3008 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAACCATATGAAAATAGTTGCTACCTTGTTGGA 

CAGGACAGATATACAAAGCATTTCCATTTTTACAGAAAATGTGTATGGATACCATGGAGG 

TTTGAAATAAATGCTGAGTTCATTCAAACTATAGTGAAATACAGCATGTGAAAAAGTAAT 

TACTTATTGACTATAATAAATGCTTCATAAAATATAATAGATGCTGATTCTGGCCAAGAT 

TGACAAAATCTAATGCAGCCTACATGTCTCAGTGGTTAATACTAAGACTCTGAATAAAAT 

ACAAGCAAATAAACA 

Sequence 3009 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTTTGCCAGTTTCTGGCATATTAACAGCTGAGA 
TCAGATAAGTGAACAGCGATAGAGGTAGTGAGGTTTAGGATTCAGGTCTGCCACCGGGAG 



